This repository is a complete hands-on learning path for Data Science and Machine Learning using Python. It covers fundamentals, advanced analytics, visualization, machine learning algorithms, deep learning, NLP, and Big Data with Spark, with daily practice and projects.
- Python Crash Course
- Python for Data Analysis
- NumPy
- Pandas
- Matplotlib
- Seaborn
- Pandas Built-in Visualization
- Plotly & Cufflinks
- Geographical Plotting
- Data Cleaning
- Feature Engineering
- Visualization Techniques
- Capstone Data Analysis Project
- Introduction to Machine Learning
- Linear Regression
- Cross Validation
- BiasβVariance Tradeoff
- Logistic Regression
- K-Nearest Neighbors (KNN)
- Decision Trees
- Random Forest
- Support Vector Machines (SVM)
- K-Means Clustering
- Principal Component Analysis (PCA)
- Recommendation Systems
- Natural Language Processing (NLP)
- Neural Networks & Deep Learning
- Introduction to Big Data
- Apache Spark with Python (PySpark)
To build a strong industry-ready foundation in:
- Data Analysis
- Machine Learning
- Deep Learning
- Big Data Engineering
with real-world datasets, projects, and interview-focused implementations.
- Data Science & AI students
- Machine Learning aspirants
- Interview preparation
- Portfolio building
- Hands-on learners
- Python
- NumPy, Pandas
- Matplotlib, Seaborn, Plotly
- Scikit-learn
- TensorFlow / PyTorch (Deep Learning)
- PySpark
Each folder contains:
- Concept notebooks
- Code implementations
- Visualizations
- Mini projects
- Notes and explanations
π Maintained by Chirag Bharadwaj β M.Sc. in A.I. & Data Science, P.E.S. University π Daily coding practice to build a strong Data Scientist portfolio