🤔
every-data-scientist-should-know A collection of (mostly) technical things every data scientist should know
This is a s/programmer/data-scientist/g
of every-programmer-should-know by @mtdvio. That means two things: 1. I take no credit for the idea of creating this page, I just wanted one for data-science and 2. things that are purely related to software development will not be the focus of this page, though there may be limited duplication.
What do YOU think every data scientist should know?
This list is far from complete/correct. Add/remove as you wish...
Contributions)
(seeAs in the original, all of the following applies:
Highly opinionated
Comes in no particular order
U like it?
P.S. You don't need to know all of that by heart to be a data scientist.
But knowing the stuff will help you become better!
P.P.S. Contributions are welcome!*
Ethics
📜 📜 Theme issue ‘The ethical impact of data science’ Phil. Trans. R. Soc.📖 Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy
Base Math for Data Science
Statistics
Open Source / Open Data
Machine Learning
📇 Machine Learning Flashcards📖 Gaussian Processes for Machine Learning🔗 10 Machine Learning Algorithms You Should Know to Become a Data Scientist🔗 🏫 Cheatsheets for Machine/Deep Learning for Stanford's CS 229
Artificial Intelligence / Neural Networks / Deep Learning
🎥 But what is a Neural Network? | Chapter 1, deep learning🔗 Awesome Tensorflow🏫 Stanford Course on Deep Learning for NLP
Visualisation
"Data Constructs" - Data Structures & Relational Databases
"Big Data" Processing/Managment Technologies & Operalization
📜 Machine Learning: The High Interest Credit Card of Technical Debt🔗 Hadoop HDFS Architecture Explained🔗 Awesome Big Data
Specific Programming Languages for Data Science
📖 R for Data Science📖 Python Data Science Handbook📖 You Don't Know JS (Not Data Science Specific)🔗 Hyperpolyglot - Programming Languages - commonly used features in a side-by-side format (Not Data Science Specific)
Career
Meta-Lists
- Trello Data Science
- Hadley Wickhams - Stats 337: Readings in Applied Data Science
- Open Source Data Science Masters
Blogs/Tweeps you should follow
- the morning paper / @adriancolyer
- @BecomingDataSci
- @DynamicWebPaige
- @zeynep
- @hardmaru
- @KordingLab
- @math_rachel
- @hmason
This work is licensed under a Creative Commons Attribution 4.0 International License.