Data Science Resources

For interview preparation and learning

Table of Contents:

Data Science Resources

Algorithms and Data Structures

Platforms

LeetCode
- Leetcode Patterns
  List of questions with patterns + tips
- LeetCode Explore
Codewars
HackerRank
CodeAbbey
CodeRun Инструмент для подготовки к очному собеседованию в Яндексе. Задачи очень похожи на те, что будут на интервью.
Другие

Courses

Resources

Articles

Books

Python

Clean Code

Мартин Р. Чистый код: создание, анализ и рефакторинг / Robert C. Martin. Clean Code: A Handbook of Agile Software Craftsmanship
Стив Макконнелл. Совершенный код. Мастер-класс / Steve McConnell. Code Complete: A Practical Handbook of Software Construction

Theory

Questions

53 Python Interview Questions and Answers
Python: вопросы на собеседовании:
- [Часть I. Junior](https://pythonist.ru/ python-voprosy-sobesedovaniya-chast-i-junior/)
- Часть II. Middle
- Часть III. Senior

Other

Efficient Python Tricks and Tools for Data Scientists

Practice

SQL

How to pass data engineering SQL interviews in big tech

Courses

Practice

Machine Learning

Sites

Courses

Open Machine Learning Course by Yury Kashnitsky
Машинное обучение (курс лекций, К.В.Воронцов)
Прикладные задачи анализа данных (курс лекций, А.Г.Дьяконов) video
Алгоритмы Машинного обучения с нуля
Stanford CS229: Machine Learning by Andrew Ng
Kaggle Learn
Google Machine Learning Courses
End to End Machine Learning by Brandon Rohrer
Машинное Обучение в Python: Большой Курс для Начинающих paid

Books

Учебник по машинному обучению от ШАД
An Introduction to Statistical Learning by Gareth James, Daniela Witten, Trevor Hastie, Rob Tibshirani
The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, Jerome Friedman
Machine Learning Simplified: A gentle introduction to supervised learning by Andrew Wolf
The Kaggle Book
Feature Engineering and Selection: A Practical Approach for Predictive Models by Max Kuhn and Kjell Johnson
Clean Machine Learning Code
Interpreting Machine Learning Models With SHAP: A Guide With Python Examples And Theory On Shapley Values
Interpretable Machine Learning. A Guide for Making Black Box Models Explainable by Christoph Molnar
Machine Learning Q and AI. Expand Your Machine Learning & AI Knowledge With 30 In-Depth Questions and Answers by Sebastian Raschka
Reliable Machine Learning: Applying SRE Principles to ML in Production by Cathy Chen
Machine Learning Refined: Foundations, Algorithms, and Applications
Models Demystified. A Practical Guide from t-tests to Deep Learning by Michael Clark & Seth Berry

Cheetsheets

Articles

Applied ML

Feature Engineering

Tutorials

Blog posts

Other

Deep Learning

Books

Courses

Tutorials

Blog posts

Other

Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AI

NLP

Books

Natural Language Processing with Transformers by Lewis Tunstall, Leandro von Werra adn Thomas Wolf
Speech and Language Processing by Dan Jurafsky and James H. Martin
Transformers for Natural Language Processing by Denis Rothman

Courses

General

Large Language Models (LLMs) / Transformers

Reading papers with AI

Prompt Engineering

Tutorials

Blog posts

Articles

Word2Vec, Mikolov et al., Efficient Estimation of Word Representations in Vector Space
FastText, Bojanowski et al., Enriching Word Vectors with Subword Information
Attention, Bahdanau et al., Neural Machine Translation by Jointly Learning to Align and Translate
Transformers, Vaswani et al., Attention Is All You Need
BERT, Devlin et al., BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
GPT-2, Radford et al., Language Models are Unsupervised Multitask Learners
GPT-3, Brown et al, Language Models are Few-Shot Learners
LaBSE, Feng et al., Language-agnostic BERT Sentence Embedding
CLIP, Radford et al., Learning Transferable Visual Models From Natural Language Supervision
RoPE, Su et al., RoFormer: Enhanced Transformer with Rotary Position Embedding
LoRA, Hu et al., LoRA: Low-Rank Adaptation of Large Language Models
InstructGPT, Ouyang et al., Training language models to follow instructions with human feedback
Scaling laws, Hoffmann et al., Training Compute-Optimal Large Language Models
FlashAttention, Dao et al., FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
NLLB, NLLB team, No Language Left Behind: Scaling Human-Centered Machine Translation
Q8, Dettmers et al., LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Self-instruct, Wang et al., Self-Instruct: Aligning Language Models with Self-Generated Instructions
Alpaca, Taori et al., Alpaca: A Strong, Replicable Instruction-Following Model
LLaMA, Touvron, et al., LLaMA: Open and Efficient Foundation Language Models

Computer Vision

Graphs

Reinforcement Learning

Spinning Up in Deep RL course
🤗 Deep Reinforcement Learning Course course
Practical RL

RecSys

Courses

Books

К. Фальк. Рекомендательные системы на практике / Practical Recommender Systems by Kim Falk
Personalized Machine Learning

Other

Time Series

Временные ряды
Topic 9. Time Series Analysis with Python
Прогнозирование временных рядов
Time Series
Forecasting time series with gradient boosting: Skforecast, XGBoost, LightGBM, Scikit-learn and CatBoost by Joaquín Amat Rodrigo, Javier Escobar Ortiz
ARIMA and SARIMAX models with Python by Joaquín Amat Rodrigo, Javier Escobar Ortiz
Груздев А.В., Рафферти Г. Прогнозирование временных рядов с помощью Prophet, sktime, ETNA и Greykite

Big Data

Books

Перрен Ж.Ж. Spark в действии / Spark in Action by Jean-Georges Perrin
Learning Spark
Data Analysis with Python and PySpark

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
README.md		README.md

Extremesarova/ds_resources

Folders and files

Latest commit

History

README.md

README.md

Repository files navigation

Data Science Resources

Algorithms and Data Structures

Platforms

Courses

Resources

Articles

Books

Python

Clean Code

Theory

Questions

Other

Practice

SQL

Courses

Practice

Machine Learning

Sites

Courses

Books

Cheetsheets

Articles

Applied ML

Feature Engineering

Tutorials

Blog posts

Other

Deep Learning

Books

Courses

Tutorials

Blog posts

Other

NLP

Books

Courses

General

Large Language Models (LLMs) / Transformers

Reading papers with AI

Prompt Engineering

Tutorials

Blog posts

Articles

Computer Vision

Graphs

Reinforcement Learning

RecSys

Courses

Books

Other

Time Series

Big Data

Books

Other

About

Topics

Resources

Stars

Watchers

Forks