Skip to content

ShrikarRamesh/FDP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

41 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation


🧠 Machine Learning Faculty Development Program – Complete Repository Guide

Welcome to the Machine Learning Mini-Workshop Repository! This workshop is designed especially for MBA practitioner and non-technical learners who want a simple, practical, and visual introduction to Machine Learning.

This README will guide you from installation to running notebooks, and walk you through every folder in this repository.


πŸ“Œ 1. Prerequisites

Before starting, ensure you have the following:

βœ” A laptop

βœ” Stable internet

βœ” Basic familiarity with Excel-level data

βœ” No programming experience required


🧰 2. Install Required Tools

2.1 Install Git

Git helps you download and manage this project.

Windows

Download and install: https://git-scm.com/download/win

Mac

Git is pre-installed, but if needed: https://git-scm.com/download/mac

Linux

sudo apt install git

2.2 Create a GitHub Account

You need this to access and store your projects.

Create your account: https://github.com/


2.3 Install Python (3.10 or above)

Download from: https://www.python.org/downloads/

βœ” During installation on Windows, check β€œAdd Python to PATH”.


2.4 Install Jupyter Notebook / JupyterLab

Open a terminal (CMD / PowerShell / Terminal) and run:

pip install jupyterlab notebook ipykernel

After installation, launch with:

jupyter lab

or

jupyter notebook

πŸ“¦ 3. Download the Workshop Repository

In your terminal:

git clone https://github.com/<your-username>/<repository-name>.git
cd <repository-name>

(Replace with your GitHub repo link.)


πŸ“₯ 4. Install All Required Python Libraries

Run this inside your repo folder:

pip install pandas numpy scikit-learn matplotlib seaborn \
requests beautifulsoup4 lxml opendatasets kaggle \
tensorflow torch notebook jupyterlab transformers \
sentence-transformers nltk spacy

This installs everything needed for:

βœ” data preprocessing βœ” regression, classification, clustering models βœ” visualizations βœ” notebook execution


πŸ—‚ 5. Repository Structure

This workshop repository contains multiple folders. Each folder represents a complete part of the learning pipeline.


πŸ“ /01-Data-Discovery

This folder covers how to find datasets and research papers.

You will learn:

  • How to download datasets (Kaggle, UCI, Google Dataset Search)
  • How to structure research data
  • Tools like Semantic Scholar, arXiv
  • Spreadsheet basics

Material inside:

  • data_sources.md
  • Example dataset links
  • Small sample CSV datasets

πŸ“ /02-Data-Quality

This contains the data cleaning workbook and explanations.

You will learn:

  • Handling missing values
  • Removing duplicates
  • Encoding categories
  • Scaling numeric data
  • Train/test split

Files included:

  • data_cleaning_workbook.ipynb
  • sales_data.csv (demo dataset)
  • Completed code solution
  • Worksheets you can teach from

πŸ“ /03-Model-Selection

This folder introduces the 2 important Supervised ML families:

βœ” Linear Regression

For predicting continuous values (e.g., sales, profit)

βœ” Classification

For predicting labels (e.g., churn: yes/no)


πŸ“˜ Every model has:

  • A ready-to-run Jupyter Notebook
  • Step-by-step explanation
  • Intuition with simple plots (scatter plots, clusters, decision boundaries)
  • Visual explanations for MBA participants

Example files:

  • linear_regression.ipynb
  • classification.ipynb

πŸ“ /04-Training-Validation

This section focuses on how models learn.

Includes:

  • Train/test split explanation
  • Gradient descent intuition (MBA-friendly)
  • Avoiding overfitting
  • Hands-on training scripts

Files:

  • training_basics.ipynb
  • Visual plots for training curves

πŸ“ /05-Production-Readiness

This folder shows how to evaluate models properly.

You will learn:

  • Accuracy
  • Precision
  • Recall
  • F1-score
  • Confusion Matrix
  • Cross-validation

Files:

  • validation_metrics.ipynb
  • Interactive graphs with matplotlib & seaborn

πŸš€ 6. Running the Notebooks

Open your terminal inside the repo folder:

jupyter notebook

or

jupyter lab

Then click on any .ipynb file to open and run it.


🎯 7. Learning Outcome

By the end of the workshop, participants will be able to:

βœ” Understand data cleaning

βœ” Build simple regression, classification, clustering models

βœ” Interpret model results

βœ” Read plots and explain insights

βœ” Work with Jupyter Notebook

βœ” Apply ML thinking in business settings


About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors