You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A structured collection of machine learning notebooks covering core algorithms from supervised and unsupervised learning to NLP and time series forecasting. Each notebook includes line-by-line comments explaining both the code and the underlying concepts, with references where applicable.
Textbook exercises following a structured data mining workflow — from data loading and preparation through EDA, modeling, and evaluation. Uses the Bank Marketing and Churn datasets from the UCI Machine Learning Repository.
Tools and Libraries
Python 3.11
scikit-learn — model training and evaluation
statsmodels — logistic and Poisson regression
keras / tensorflow — neural network construction
pandas, numpy — data manipulation
matplotlib, seaborn — visualization
pmdarima — auto-ARIMA for time series
Notes
All notebooks were developed and run on Google Colab.
The NaiveBayes_FromScratch notebook implements the algorithm without sklearn to demonstrate understanding of the underlying math.
Datasets used are either publicly available (UCI ML Repository, Kaggle) or course-provided.
About
A curated collection of Jupyter Notebooks containing implementations and evaluations of foundational machine learning algorithms and data science techniques.