CourtKeyNet : A Novel Octave-Based Architecture for Precision Court Detection

Adithya N Raj

Excited to introduce CourtKeyNet, an open-sourced deep learning architecture stemming from sports video analysis. Positioned as a top-tier court detection model, CourtKeyNet offers the following features.

High-Fidelity Feature Extraction: It maintains high fidelity in capturing both fine court details and global structural context effortlessly using a novel Octave Feature Extractor.
Robust Boundary Attention: It enables precise boundary localization by mapping spatial relationships in polar coordinates via our Polar Transform Attention.
Geometric Consistency & Open Access: It supports structurally valid outputs, strictly ensuring proper quadrilateral properties through a dedicated Constraint Module and Geometric Consistency Loss. We provide public access to the code and pre-trained models. We believe our release will empower the community with practical applications across areas like sports video analysis, match statistics generation, and automated broadcasting systems.

🎬 Video Demo

Dataset

The datasets utilized for CourtKeyNet are located in the datasets folder, which is linked as a submodule to the primary dataset repository:

CourtKeyNet Dataset GitHub Repository

Note: The dataset contains badminton court images for keypoint detection, and the main repository contains the custom annotation tool for geometric keypoints labeling.

Installation & Setup

Set up a conda environment and install dependencies:

# 1. Clone the repository
git clone https://github.com/adithyanraj03/CourtKeyNet.git
cd CourtKeyNet

# 2. Create and activate a Conda environment
conda create -n courtkeynet python=3.10 -y
conda activate courtkeynet

# 3. Install requirements
pip install -r requirements.txt

# 4. Login to Weights & Biases (optional, for experiment tracking)
wandb login

Click to hide: How Confidence Detection Works (Visual Explanation)

How Confidence Detection Works

Model (CourtKeyNet) works like this:

Problem: It has no "court detector" - it assumes every image IS a court!

What the Model Actually Outputs Internally

When you run model(image), it returns a dictionary with these components:

outputs = {
    'heatmaps': Tensor[B, 4, 160, 160],      # 4 gaussian peaks (one per corner)
    'kpts_init': Tensor[B, 4, 2],            # Initial keypoints from heatmaps
    'kpts_refined': Tensor[B, 4, 2],         # Final refined keypoints
    'features': Tensor[B, 256, 20, 20]       # Feature maps (optional)
}

Visualization of Heatmap Output

For a real court image:

Heatmap for Corner 0 (Top-Left):

For a non-court image (e.g., random person):

Heatmap for Corner 0:

3 Confidence Metrics I Added

1️⃣ Heatmap Peak Confidence (Primary Signal)

What it measures: How "peaky" the heatmap is

max_values = heatmaps.max(dim=(2,3))  # Find highest value in each heatmap
conf_heatmap = max_values.mean()      # Average across 4 corners

Visual comparison:

2️⃣ Heatmap Entropy (Uncertainty)

What it measures: How "spread out" the probability is

# Entropy = -Σ(p * log(p))
# Low entropy = focused (good)
# High entropy = random noise (bad)

Visual comparison:

3️⃣ Geometric Validity (Shape Check)

What it checks: Does the quad look like a real court?

Checklist:
✓ Are corners in correct positions? (TL upper-left, BR lower-right)
✓ Is the quad convex? (no crossed lines)
✓ Is the area reasonable? (not too tiny, not entire image)
✓ Is aspect ratio court-like? (not a thin line)

Visual examples:

Repository Structure

CourtKeyNet/
├── courtkeynet/
│   ├── configs/
│   │   ├── courtkeynet.yaml     # Model hyperparameters
│   │   └── dataset.yaml         # Dataset configuration
│   ├── models/
│   │   ├── __init__.py
│   │   ├── courtkeynet.py       # Main architecture
│   │   ├── octave.py            # Octave Feature Extractor
│   │   ├── polar.py             # Polar Transform Attention
│   │   └── qcm.py               # Quadrilateral Constraint Module
│   ├── losses/
│   │   ├── __init__.py
│   │   └── geometric_loss.py    # Geometric Consistency Loss
│   ├── utils/
│   │   ├── __init__.py
│   │   ├── dataloader.py        # dataset loader
│   │   └── metrics.py           # Evaluation metrics
│   ├── train.py                 # Training script with wandb
│   ├── finetune.py              # Finetuning script
│   └── inference.py             # Inference/visualization
├── datasets/                    # Datasets (submodule)
├── requirements.txt
└── README.md

Model Download

Model	Details	Resolution	Download Links
CourtKeyNet-Base	Full Architecture	Native	🤗 HuggingFace
CourtKeyNet-Fast	Light Architecture	Native	To be released

Download models using huggingface-cli:

pip install "huggingface_hub[cli]"
huggingface-cli download Cracked-ANJ/CourtKeyNet --local-dir ./courtkeynet-base

Training

The training scripts are located within the courtkeynet/ directory.

Fine-Tuning Pre-trained Weights (Recommended)

If you downloaded the HuggingFace weights, use finetune.py to adapt the model to your specific dataset:

cd courtkeynet
python finetune.py

(Note: The pre-trained weights are explicitly trained on badminton court and tennis court images.)

Train from Scratch

To train a completely new model from scratch, use train.py:

cd courtkeynet
python train.py --data_root path/to/dataset

(Note: Training from scratch requires a minimum of 140,000 images for the model to learn features effectively. Fine-tuning to any court or similar geometry requires only 5,000 to 7,000 images.)

With Custom Config

Depending on your workflow, you can specify custom configurations. There are two provided configurations:

configs/courtkeynet.yaml: The primary configuration for training dynamically from scratch.
configs/finetune.yaml: Tailored automatically for fine-tuning pre-trained weights (automatically loaded by finetune.py).

cd courtkeynet
python train.py \
    --data_root path/to/dataset \
    --cfg configs/courtkeynet.yaml \
    --data_cfg configs/dataset.yaml

Resume Training

cd courtkeynet
python train.py \
    --data_root path/to/dataset \
    --resume runs/courtkeynet/exp/epoch_50.pt

Inference

To open the CourtKeyNet Inference Studio (GUI), run:

cd courtkeynet
python inference.py

(Note: The application will open a graphical interface allowing you to easily select both your model weights and target media files directly through the GUI menus).

Architecture Details

Octave Feature Extractor (Section 3.2)

Processes visual information at multiple frequency bands:

High-frequency path: Fine court details using Court-Specific Shape Kernels. (Note: To optimize learning, explicit L-shaped boundary kernels are turned off when training from scratch, as enforcing them too early hinders the model's ability to grasp broader contextual features and textures. They are meant to be utilized primarily during fine-tuning).
Mid-frequency path: Structural patterns with Non-Local Self-Similarity
Low-frequency path: Global context via Fourier Feature Encoder

Polar Transform Attention (Section 3.3)

Enhances boundary detection by processing features in polar coordinates, naturally suited for detecting court boundaries radiating from the center.

Geometric Consistency Loss (Section 3.6)

Combined loss function:

L_total = λ_kpt·L_kpt + λ_hm·L_hm + λ_edge·L_edge + λ_diag·L_diag + λ_angle·L_angle

Expected Training Time

Train from Scratch (140k+ dataset)

GPU	Batch Size	Total Duration (120 epochs)
RTX 5090	32	~48 hours (2 Days)
RTX 4090	24	~72 hours (3 Days)
RTX 3090	16	~108 hours (4.5 Days)

Fine-Tuning (7k clean dataset)

GPU	Batch Size	Total Duration
RTX 5090	32	~35 hours (~1.4 Days)
RTX 4090	24	~52 hours (~2.2 Days)
RTX 3090	16	~79 hours (~3.3 Days)

Citation

If you use CourtKeyNet in your research, please cite:

Paper Link: CourtKeyNet: A novel octave-based architecture for precision badminton court detection with geometric constraints
DOI: https://doi.org/10.1016/j.mlwa.2026.100884

@article{NRAJ2026100884,
title = {CourtKeyNet: A novel octave-based architecture for precision badminton court detection with geometric constraints},
journal = {Machine Learning with Applications},
volume = {24},
pages = {100884},
year = {2026},
issn = {2666-8270},
doi = {https://doi.org/10.1016/j.mlwa.2026.100884},
url = {https://www.sciencedirect.com/science/article/pii/S2666827026000496},
author = {Adithya N Raj and Prethija G.}
}

License

This project is released under the MIT License, suitable for both academic and commercial use.

Contact

For questions or collaboration opportunities, please contact at adithyanraj03@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
assets		assets
courtkeynet		courtkeynet
datasets @ 10977dd		datasets @ 10977dd
examples		examples
tools		tools
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

CourtKeyNet : A Novel Octave-Based Architecture for Precision Court Detection

🎬 Video Demo

Dataset

Installation & Setup

How Confidence Detection Works

What the Model Actually Outputs Internally

Visualization of Heatmap Output

3 Confidence Metrics I Added

1️⃣ Heatmap Peak Confidence (Primary Signal)

2️⃣ Heatmap Entropy (Uncertainty)

3️⃣ Geometric Validity (Shape Check)

Repository Structure

Model Download

Training

Fine-Tuning Pre-trained Weights (Recommended)

Train from Scratch

With Custom Config

Resume Training

Inference

Architecture Details

Octave Feature Extractor (Section 3.2)

Polar Transform Attention (Section 3.3)

Geometric Consistency Loss (Section 3.6)

Expected Training Time

Train from Scratch (140k+ dataset)

Fine-Tuning (7k clean dataset)

Citation

License

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages