# NFL Fantasy Football Point Projections: An Introduction

## Project Overview

As avid NFL fans, we've spent countless hours analyzing player performance, debating draft strategies, and watching our fantasy teams succeed or fail based on real-world outcomes. Fantasy football transforms every Sunday into a personal investment in player performance, where statistical analysis meets genuine passion for the game. The unpredictable nature of the NFL—with its coaching changes, injuries, rookie developments, and evolving team dynamics—creates a fascinating challenge: can machine learning models capture the nuances that even dedicated football fans struggle to predict?

## Why This Topic Was Chosen

This project stems from our genuine love of football and curiosity about how data science compares to traditional football analysis. As dedicated fans who follow the sport closely, we chose this topic because:

### 1. **Testing Our Football Knowledge Against Data**
As fans who watch games regularly, analyze player matchups, and follow team trends, we're curious whether our intuitive understanding of football can be systematized and improved through machine learning. The NFL generates comprehensive statistical data across 24 years (1999-2022) with over 215,000 observations—enough to test whether patterns we think we see actually exist.

### 2. **Real Stakes, Real Validation**
Unlike theoretical projects, fantasy football provides immediate feedback. When our model predicts a breakout season for an underrated running back or identifies a quarterback's decline, we can watch it play out on Sunday. This real-world validation makes every prediction meaningful.

### 3. **The Challenge of NFL Unpredictability**
Football is inherently chaotic—injuries, weather, game script changes, and countless variables that even seasoned analysts miss. Building models that can navigate this complexity while incorporating the football knowledge we've developed as fans presents a genuine technical challenge.

## What Makes This Project Compelling

### **Fan Intuition vs. Machine Learning**
As dedicated football fans, we've developed intuitions about player performance—we can spot when a receiver is getting more red zone targets, notice when an offensive line is improving, or sense when a running back is losing a step. This project lets us test whether machine learning can capture these subtle patterns that we think we see, or if it can find signals we've missed entirely.

### **Every Sunday is a Test**  
The beauty of football prediction is that results come quickly and definitively. Unlike academic models validated on static datasets, our predictions face the ultimate test every weekend. When we predict that a third-year wide receiver is due for a breakout season, we get to watch it unfold (or fail spectacularly) in real time.

### **The Complexity Football Fans Appreciate**
Football fans understand that quarterback performance differs fundamentally from running back consistency, that offensive line play affects multiple skill positions, and that coaching changes ripple through entire offenses. Our approach acknowledges this complexity by developing position-specific models and reliability metrics that reflect the nuanced way football fans actually evaluate players.

## Why This Matters Beyond Our Fantasy Leagues

While our primary motivation is personal curiosity and football passion, the implications extend beyond weekend entertainment:

### **Professional Sports Applications**

**1. NFL Team Scouting and Draft Strategy**
Our methodologies extend beyond fantasy football into professional team management. NFL franchises invest millions in player evaluation, and predictive models can identify:
- **Undervalued draft prospects** who outperform their draft position
- **"Bust potential"** of highly-touted players whose skills may not translate to professional success
- **Optimal contract valuations** based on projected performance rather than past achievements

**2. Player Development and Career Management**
Teams can use similar models to:
- Identify which young players are most likely to improve
- Predict career longevity and plan roster construction accordingly
- Make data-driven decisions about player trades and free agency

### **Technical and Methodological Contributions**

**5. Advancing Sports Analytics**
Our approach contributes to the broader field by:
- Demonstrating **reliability index calculations** that balance performance and consistency
- Showing how **position-specific normalization** can enable cross-positional comparisons
- Developing **hybrid scoring systems** that combine multiple evaluation criteria

**6. Machine Learning Innovation**
The techniques we develop—particularly around handling **seasonal variance**, **career trajectory modeling**, and **multi-dimensional performance evaluation**—have applications beyond sports in fields like:
- Financial forecasting (predicting stock performance with similar volatility)
- Employee performance evaluation (balancing consistency vs. peak performance)
- Healthcare outcomes (predicting patient response variability)

## What We Hope to Discover

This project represents our attempt to bridge the gap between football fandom and data science. We want to understand whether the patterns we think we see as dedicated viewers—the subtle signs of player development, the impact of system changes, the reliability indicators that separate consistent performers from boom-bust players—can be quantified and systematized.

More fundamentally, we're testing whether machine learning can enhance rather than replace football knowledge. Can models help us identify undervalued players we might have overlooked? Can they warn us about declines we're too emotionally invested to see? Every prediction becomes a test of whether data science and football passion can work together to better understand the game we love.

# Figures

This section presents key visualizations that illustrate our approach to NFL fantasy football prediction, from initial data exploration through model development and validation.

## Figure 1: Dataset Composition and Positional Distribution

*Data Source: `data_exploration.ipynb`*

Our analysis begins with understanding the structure of NFL player data across positions. The position distribution reveals important insights about roster composition and the relative scarcity of talent at different positions—a critical factor in fantasy football valuation.

**Key Insights:**
- Wide Receivers comprise the largest segment of skill position players
- Running Backs and Tight Ends represent smaller but crucial position groups
- This distribution directly impacts our position-specific modeling approach

## Figure 2: Fantasy Points Consistency Analysis by Position

*Data Source: `data_exploration.ipynb`*

Understanding the variability in fantasy scoring by position is fundamental to building reliable predictive models. This analysis measures consistency through standard deviation of weekly fantasy points.

**Legend:**
- Lower bars indicate more consistent, predictable scoring
- Higher bars represent positions with greater "boom-or-bust" potential
- Y-axis shows standard deviation of weekly fantasy points

**Interpretation:**
- Quarterbacks typically show the most consistent scoring patterns
- Wide Receivers and Tight Ends exhibit higher variability
- This variance analysis directly informs our reliability index calculations

## Figure 3: Quarterback Performance Analysis Across Career Trajectories

*Data Source: `stat_projections.ipynb`*

### Figure 3a: Age vs. Performance Relationship
Polynomial regression analysis revealing the typical career arc for NFL quarterbacks, showing peak performance windows and decline patterns.

### Figure 3b: Elite Quarterback Progression (2024 Active Players)
Longitudinal analysis of top-tier quarterbacks who played in 2024, demonstrating how elite players maintain production over time compared to league averages.

### Figure 3c: Performance Percentile Analysis
Quarterbacks grouped by performance percentiles across seasons, showing how different tiers of players age and develop differently.

**Key Findings:**
- Clear age-performance curves exist for quarterback positions
- Elite quarterbacks maintain performance longer than average players
- Performance percentile is a strong predictor of career longevity

## Figure 4: Running Back Predictive Model Development

*Data Source: `firstModel.ipynb`*

### Figure 4a: Feature Correlation Analysis
Multi-panel scatter plot analysis examining the relationship between 2023 performance metrics and 2024 fantasy outcomes for running backs.

**Features Analyzed:**
- Offensive snap count (opportunity metric)
- Rushing yards (volume metric)
- Rushing touchdowns (efficiency metric)  
- Yards per carry (skill metric)

### Figure 4b: Support Vector Regression Model Performance
Validation plot comparing actual vs. predicted fantasy points for the SVR model, with separate visualization of training and test performance.

**Model Interpretation:**
- Points on the diagonal line represent perfect predictions
- Points above the line indicate model overestimation
- Points below the line show model underestimation
- Player names are labeled to enable individual performance analysis

### Figure 4c: Position-Filtered Model Results
Enhanced model performance when restricting analysis to RB1/RB2 level players (team depth chart positions 1-2), demonstrating improved accuracy for relevant fantasy players.

## Figure 5: Feature Engineering and Player Evaluation Framework

*Data Source: `feature_eng.ipynb`*

### Figure 5a: Reliability Index Calculation
Visualization of our custom reliability index that combines average performance with consistency metrics, creating a comprehensive player evaluation framework.

**Formula:** `Reliability Index = Average Fantasy Points / (Standard Deviation + 1)`

### Figure 5b: Position-Specific Normalization
Z-score and percentile rank analysis showing how players compare within their respective positions, enabling cross-positional value comparisons.

### Figure 5c: Comprehensive Value Score Rankings
Integration of performance metrics and reliability indices into a single value score, representing our final player evaluation framework.

**Components:**
- 60% Performance Z-score (relative to position)
- 40% Reliability Z-score (consistency within position)

## Figure 6: Model Validation and Real-World Application

### Figure 6a: Cross-Validation Results
Performance metrics across different model configurations and hyperparameter settings, demonstrating model stability and optimization.

### Figure 6b: Prediction Confidence Intervals
Uncertainty quantification for player predictions, showing model confidence levels for different types of players and situations.

## Statistical Methodology Summary

Our visualization approach follows established sports analytics practices:

1. **Exploratory Analysis** (Figures 1-2): Understanding data structure and inherent patterns
2. **Position-Specific Modeling** (Figure 3): Acknowledging that different positions require different analytical approaches  
3. **Feature Engineering** (Figure 5): Creating meaningful metrics that capture football-specific insights
4. **Model Development** (Figure 4): Building and validating predictive models with proper train/test splits
5. **Performance Evaluation** (Figure 6): Rigorous assessment of model accuracy and reliability