# Project Setup – Knee Osteoarthritis KL Grading

**Project:** AI-Driven Detection and Grading of Knee Osteoarthritis  
**Objective:** Prepare the environment and define the experimental setup for knee X-ray–based KL grading.

This notebook focuses on:
- Environment setup
- Project configuration
- Dataset planning
- Methodology outline

No model training is performed at this stage.


In [1]:
# Here im doing basic environment check
import sys
print("Python version:", sys.version)

Python version: 3.12.12 (main, Oct 10 2025, 08:52:57) [GCC 11.4.0]


In [2]:
# I have used the core scientific libraries
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

# The basic Deep learning libraries (this i will use later)
import torch
import torchvision

print("Libraries imported successfully")

Libraries imported successfully


In [3]:
# Reproducibility
import random

def set_seed(seed=42):
    random.seed(seed)
    np.random.seed(seed)
    torch.manual_seed(seed)
    if torch.cuda.is_available():
        torch.cuda.manual_seed_all(seed)

set_seed()
print("Random seed set")

Random seed set


In [4]:
# Project configuration
CONFIG = {
    "image_size": (224, 224),
    "batch_size": 16,
    "num_classes": 5,  # KL grades 0–4
    "learning_rate": 1e-4,
    "epochs": 30,
    "model_backbone": "ResNet50",
}

CONFIG

{'image_size': (224, 224),
 'batch_size': 16,
 'num_classes': 5,
 'learning_rate': 0.0001,
 'epochs': 30,
 'model_backbone': 'ResNet50'}

## Dataset Planning

The primary dataset identified for this project is the **Osteoarthritis Initiative (OAI)** dataset, which provides knee X-ray images annotated with Kellgren–Lawrence (KL) grades.

Planned steps:
1. Register and request access to the OAI dataset
2. Inspect image formats and metadata
3. Analyze class distribution of KL grades
4. Design preprocessing and augmentation pipeline

Backup datasets (e.g., MOST) will be considered if access delays occur.


## Methodology Outline

- Use transfer learning with a pre-trained CNN (e.g., ResNet50 or DenseNet121)
- Replace final classification layer for 5-class KL grading
- Apply data augmentation to mitigate class imbalance
- Evaluate using accuracy, precision, recall, and F1-score
- Use Grad-CAM for model interpretability

Detailed implementation will follow in subsequent notebooks.


## Next Steps

- Finalize dataset access and download
- Perform exploratory data analysis (EDA)
- Implement baseline CNN model
- Conduct initial training experiments
- Analyze results and refine approach
