F1 Markov Model Project

Dirichlet-Multinomial Markov and Bayesian state-space models for predicting F1 finishing positions, trained on historical race results (2015-2025).

Environment Setup

# Install dependencies with uv
uv sync

# Run any script
uv run python <script>

# Launch the dashboard
uv run streamlit run dashboard.py

Model Stages

Three production models are trained on 2015-2025 data and ensembled:

Stage	Description	Strength
6	Year-weighted constructor (Dirichlet-Multinomial)	Best calibration (LL/race = -59.0)
8	Time-varying Plackett-Luce	Best ranking (Spearman rho = 0.800)
9	Bayesian state-space (MAP)	Best top-3 accuracy (1.59/3)

A composite model blends Stage 6 + Stage 9 probabilities (Stage 8 excluded when degenerate).

Archived models (stages 1-5, 7) are preserved in archive/ for reference.

Race Prediction Workflow

Quick start: predict the next race

# 1. Add prior race results to generate_2026_data.py, then populate CSVs:
uv run python generate_2026_data.py

# 2. Run simulation for the target round:
uv run python simulate_race.py --season 2026 --round 3

# 3. Launch dashboard to view predictions:
uv run streamlit run dashboard.py

Adding race results

Edit generate_2026_data.py and add entries to QUALIFYING_RESULTS and RACE_RESULTS dicts:

QUALIFYING_RESULTS[3] = [
    ("Driver Name", "Constructor Name", grid_position),
    ...
]
RACE_RESULTS[3] = [
    ("Driver Name", "Constructor Name", position, "posText", laps, "time", points, statusId),
    ...
]
# statusId: 1=Finished, 130=DNF, 20=DNS

Then run uv run python generate_2026_data.py to append to CSVs.

Simulation CLI

# Auto-detect everything from CSVs:
uv run python simulate_race.py --season 2026 --round 3

# Explicit metadata for races not yet in CSVs:
uv run python simulate_race.py --season 2026 --round 3 \
    --race-name "Japanese Grand Prix" --circuit "Suzuka Circuit" --date 2026-03-29

Results are saved to data/sim_*.json and auto-discovered by the dashboard.

Key Files

File	Description
`simulate_race.py`	General-purpose Monte Carlo race simulation CLI
`config_2026.py`	2026 season constants (drivers, constructors, calendar)
`generate_2026_data.py`	Append 2026 race results to Ergast CSVs
`dashboard.py`	Streamlit multi-race dashboard
`models/stage{3,6,8,9}_*.py`	Production model implementations
`data/sim_*.json`	Simulation results consumed by dashboard
`pyproject.toml`	Python dependencies (managed by uv)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.claude/skills		.claude/skills
.devcontainer		.devcontainer
archive		archive
data		data
models		models
.gitignore		.gitignore
.python-version		.python-version
CLAUDE.md		CLAUDE.md
README.md		README.md
config_2026.py		config_2026.py
dashboard.py		dashboard.py
evaluate_season_models.py		evaluate_season_models.py
generate_2025_data.py		generate_2025_data.py
generate_2026_data.py		generate_2026_data.py
predict_2026_shanghai.py		predict_2026_shanghai.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
simulate_race.py		simulate_race.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

F1 Markov Model Project

Environment Setup

Model Stages

Race Prediction Workflow

Quick start: predict the next race

Adding race results

Simulation CLI

Key Files

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

F1 Markov Model Project

Environment Setup

Model Stages

Race Prediction Workflow

Quick start: predict the next race

Adding race results

Simulation CLI

Key Files

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages