Automated Leaderboard System for Hackathon Evaluation Using Large Language Models

The code for the paper Automated Leaderboard System for Hackathon Evaluation Using Large Language Models

Architecture

Overview of the system architecture illustrating the processing pipeline of the Jupyter notebook submissions, from initial raw data intake to the final predicted results.

Results

Bland-Altman analysis [22] table reveals a mean difference (bias) of 27.5 points—meaning the LLM scores are, on average, 27.5 points higher than the technical scores and representing roughly 6.9% of the maximum technical score. The 95% limits of agreement (–6.83 to 61.83) indicate that most differences fall within a 68.66-point range, which aligns with typical inter-rater variability in manual grading and supports the reliability of our hybrid evaluation approach.

Deployment

How to download the Submissions manually

Preparing Kaggle API credentials.

pip install kaggle

Run the file retrieve-competition.py it will download and convert all the submission files to .md file.

How to launch the server

You should have node on your machine. And you are welcome to create a SQLite db file results.db

npm install

node index.js

How to run the marking manually

node mark.js

Citation

@software{Li_Automated_Leaderboard_System_2025,
author = {Li, Bowen and Cheng, Bohan and Talyor, Patrick and Osborne, Dale and Han, Fengling and Shen, Robert and Gondal, Iqbal},
doi = {<>},
month = mar,
title = {{Automated Leaderboard System for Hackathon Evaluation Using Large Language Models}},
url = {https://github.com/SkywardAI/hackathon-leaderboard},
version = {1.0.0},
year = {2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
imgs		imgs
leaderboard		leaderboard
router		router
.gitignore		.gitignore
CITATION.cff		CITATION.cff
README.md		README.md
database.js		database.js
index.js		index.js
mark.js		mark.js
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
requirements.txt		requirements.txt
retrieve-competition.py		retrieve-competition.py
utils.js		utils.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Automated Leaderboard System for Hackathon Evaluation Using Large Language Models

Architecture

Results

Deployment

How to download the Submissions manually

How to launch the server

How to run the marking manually

Citation

About

Uh oh!

Releases

Contributors 2

Uh oh!

Languages

SkywardAI/hackathon-leaderboard

Folders and files

Latest commit

History

Repository files navigation

Automated Leaderboard System for Hackathon Evaluation Using Large Language Models

Architecture

Results

Deployment

How to download the Submissions manually

How to launch the server

How to run the marking manually

Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Contributors 2

Uh oh!

Languages