Skip to content

raunak-r/data-science-notes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Notes

Documenting my learnings for Data Science, and how anyone can approach learning ML/AI.

Who am I?

See my qualifications on Linkedin, Personal Website.

My Data Science Experience

I was a part of the Applied AI team in Deloitte USI, with experiences on analysis, cleansing and understanding the story of a dataset to use Machine Learning to build and run custom models, evaluate performances, at times also requiring software development to build relevant dashboards, for various clients.

How to start Data Science?

There are tons of articles, courses (paid and free), degrees available in this field. But, at the end of the day, everything you read should fall into one of these topics.

This article is aimed at teaching those few people who are starting their Data Science from scratch, or are at beginner or intermediate levels.

Beginners can follow through everything provided, and intermediate experienced people can try to identify where all their ML learning really fits in the large canvas of Machine Learning.

Set a weekly target for each of these to atleast have a basic idea of the ML Universe in 5 weeks maximum.

  1. Programming in Python
  2. Statistics
  3. Maths
  4. Data Analysis
  5. Data Science

1. Programming in Python

Setup

  • Setup and get familiar with Jupyter Notebook Usage

Practice

This resource by w3 is really great, and will help you build basic python fundamentals and provide syntax-level comfort. You will need to solve the first 10 questions, and subsequently every 2nd/4th question to practice more efficient questions.

- Python Basic (Part -I)
- Python Basic (Part -II)
- Python built-in Modules
- Python Data Types - String
- Python JSON
- Python Data Types - List
- Python Data Types - Dictionary
- Python Conditional statements and loops
- Python functions
- Python Lambda

Give it a foundation

Follow everything on this Roadmap.

2. Statistics

There are literally no shortcuts here, and if you've no idea what to read, just download this book, and start reading the index page to find out topics that you may have left before.

Books to Read -

  1. The Complete Idiot's Guide to Statistics by Donnelly Robert

Other Links

3. Maths

Although, a strong fundamentals over Maths is required, you may skip this section entirely for now, and revisit this once you will have more experience on the overall ML concepts.

Links

4. Data Analysis

You will need to cover the basic libraries like Pandas, MatPlotLib/Seaborn, Numpy to grasp a basic syntax level understanding of how you can work on dataframe & datasets in large and in Python.

Basics

Notes

Links

5. Data Science

Understand the Industry

You will find these few applications on a very large scale, either in projects, and in interviews both.

- NLP (wherever there is text data)
- Churn Modelling (everywhere)
- Customer Segmentation (everywhere)
- Retail
    - Sales Forecasting
    - UpSell & CrossSell
- Predictive Maintenance

How to Work on Mini-Projects/UseCases?

Pick any of these approaches, or mix them up. The idea here is to give you enough understanding of the ML universe, so that you can start picking up things on your own.

- Dataset Specific
    (Some famous beginner datasets are given below)
    - AirBNB Price Prediction (Regression)
    - IMDB Sentiment Analysis (Classification)
    - Titanic (Classification)
- Algorithm/Concept Specific
    - Classification
    - Regression
    - Clustering
    - Deep Learning
    - Decision Trees
    - NLP

More Links to Read:

About

A collection of data science concepts, datasets, industry-applications, walk-through's & notes. See ISSUES tab for in-depth studies by topics.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published