NBA Data Fetcher and Predictor

A machine learning system for NBA player statistics prediction and prop bet analysis.

Project Overview

This project implements a complete pipeline for:

Collecting NBA player statistics
Processing and engineering features
Training prediction models
Analyzing betting propositions
Visualizing results and insights

Project Structure

nba-data-fetcher/
├── src/
│   ├── data/
│   │   ├── raw/              # Raw NBA statistics data
│   │   ├── features/         # Feature-engineered datasets
│   │   └── analysis/         # Prop analysis results
│   │
│   ├── models/              # Trained models and metrics
│   │   ├── {stat}_model_YYYYMMDD.joblib
│   │   ├── {stat}_metrics_YYYYMMDD.json
│   │   └── feature_groups.joblib
│   │
│   ├── docs/               # Documentation
│   │   ├── technical_docs.md
│   │   └── model_performance.md
│   │
│   └── scripts/
│       ├── data_collection/
│       │   └── nba_historical_stats_fetcher.py
│       │
│       ├── preprocessing/
│       │   └── feature_engineering.py
│       │
│       ├── modeling/
│       │   └── train_and_save_models.py
│       │
│       ├── analysis/
│       │   └── prop_analyzer.py
│       │
│       ├── odds/
│       │   └── odds_api.py
│       │
│       └── run_pipeline.py
│
├── requirements.txt        # Python dependencies
└── .env                   # Environment variables (not in repo)

Model Performance

Current model performance metrics (as of February 9, 2025):

Statistic	R² Score	RMSE	MAE
Points	0.710	1.903	0.279
Rebounds	0.680	1.154	0.230
Assists	0.886	0.197	0.054
Three-Points	0.189	0.221	0.056

For detailed model analysis, see Model Performance Documentation.

Setup and Installation

Clone the repository:

git clone https://github.com/yourusername/nba-data-fetcher.git
cd nba-data-fetcher

Create and activate a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Create a .env file with required API keys:

ODDS_API_KEY=your_key_here

Usage

Run the complete pipeline:

python src/scripts/run_pipeline.py

Run individual components:

# Data collection
python src/scripts/data_collection/nba_historical_stats_fetcher.py

# Data cleaning
python src/scripts/preprocessing/clean_raw_data.py

# Feature engineering
python src/scripts/preprocessing/feature_engineering.py

# Model training
python src/scripts/modeling/train_and_save_models.py

# Prop analysis
python src/scripts/analysis/prop_analyzer.py

Key Features

Comprehensive NBA statistics collection
Advanced feature engineering
Gradient Boosting models with optimized feature selection
Time series cross-validation
Prop bet edge analysis
Performance visualization tools

Recent Improvements

Optimized feature selection using gradient boosting importance
Enhanced preprocessing pipeline with robust scaling
Improved handling of categorical variables
Reduced training time with optimized number of trials (20)
Implementation of mean threshold for feature selection

Contributing

See CONTRIBUTING.md for guidelines on how to contribute to this project.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/PULL_REQUEST_TEMPLATE		.github/PULL_REQUEST_TEMPLATE
src		src
tools		tools
.cursorrules		.cursorrules
.gitignore		.gitignore
.gitmessage		.gitmessage
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
commit_msg.txt		commit_msg.txt
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NBA Data Fetcher and Predictor

Project Overview

Project Structure

Model Performance

Setup and Installation

Usage

Key Features

Recent Improvements

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

chevyphillip/nba-data-fetcher

Folders and files

Latest commit

History

Repository files navigation

NBA Data Fetcher and Predictor

Project Overview

Project Structure

Model Performance

Setup and Installation

Usage

Key Features

Recent Improvements

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages