EduScale

EduScale is a research monorepo for improving the readability of low-resolution educational videos on budget Android devices.

The study combines slide-aware super-resolution, OCR-aware evaluation, lightweight SPAN models, and a packaged Android app for offline enhancement.

The goal is practical: make lecture slides, classroom recordings, and screen-captured educational videos easier to read after upscaling from 240p or 360p to 720p.

Accepted Research Proposal

Title: EduScale: A Segment-Based Video Super-Resolution Systems for Budget Android Devices in Philippine Educational Settings
Authors: Vanessa S. Bermil, James Ryan S. Gallego, John Harold C. Patron, Alexander R. Santos
Institution: Partido State University, Bachelor of Science in Computer Science
Adviser: Shane C. Briones
Date: April 2026

Study Snapshot

Area	Current status
Retained model run folders	30
Runs with training metrics	30
Completed training epoch rows	1,359
Held-out x2 evaluation pairs	291
Held-out x3 evaluation pairs	291
Benchmark summary files	33
Per-sample evaluation CSVs	20
Android-packaged TFLite models	2

Primary evidence files:

Training Metric Charts

The charts below compare the retained x2 and x3 SPAN education model metrics over training epochs.

Quick Start

1. Clone the repository

git clone https://github.com/james719-code/EduScale.git
cd EduScale

2. Install Python dependencies

Create or activate a virtual environment, then install the project dependencies:

.\.venv\Scripts\python.exe -m pip install -r requirements.txt
.\.venv\Scripts\python.exe -m pip install -r requirements-optional.txt

3. Run the Android app build

cd android
.\gradlew.bat :app:assembleDebug

4. Run benchmark evaluation

Return to the repository root, then run:

.\.venv\Scripts\python.exe scripts/benchmark.py

For Evaluators

Start with:

TRAINING_REPORT.md for full training and benchmark history.
benchmarks/results for saved metric outputs.
android/app/src/main/assets for packaged TFLite model files.
datasets/manifests/heldout for held-out evaluation pairs.
docs/DATASET_SOURCES.md for dataset citations and redistribution notes.

Android App Status

Area	Status
Platform	Android
Model runtime	TensorFlow Lite
Processing mode	Offline / on-device
Current packaged models	x2 and x3 SPAN TFLite models
Target content	Lecture slides, classroom recordings, and screen-captured educational videos
Target devices	Budget Android phones
Cloud requirement	None for local enhancement

Tested Environment

Component	Environment
Android test device	Realme Note 50
Android model format	TensorFlow Lite
Training/evaluation OS	Windows
Python environment	Virtual environment
Android build tool	Gradle

Current Android Models

EduScale TensorFlow Lite model exports are also available on Hugging Face: jimzzzz/EduScale.

Scale	Source checkpoint	Packaged model	Size
x2	`models/span/education-finetuned/x2-v3-tpgsr-refine-20260411/last_model.pt`	`android/app/src/main/assets/span_education_x2.tflite`	971 KB
x3	`models/span/education-finetuned/x3-real-refine-20260412/best_model.pt`	`android/app/src/main/assets/span_education_x3.tflite`	1.18 MB

Expected use:

Input	Model	Target
360p educational video	x2 SPAN	720p readable output
240p educational video	x3 SPAN	720p readable output

Selected Model Rationale

The x2 model x2-v3-tpgsr-refine-20260411/last_model.pt was selected for Android packaging because it achieved the strongest readability-oriented result among the selected x2 candidates, reaching 91.2295 OCR confidence and 0.0765 CER on the 291-sample updated held-out set.

The x3 model x3-real-refine-20260412/best_model.pt was selected for stronger enhancement of lower-resolution 240p educational videos, balancing readability improvement and runtime.

Benchmark Highlights

These numbers are from saved repository artifacts, not a fresh live rerun. OCR confidence is higher-is-better. Character error rate, or CER, is lower-is-better.

Updated Held-Out Evaluation

Model run	Scale	Samples	PSNR	SSIM	Runtime / image	OCR confidence	CER	Evidence
`x2-all-20260410`	x2	291	29.9002	0.9813	719.159 ms	90.9905	0.1016	`x2-all-20260410-heldout-updated-ocr-summary.json`
`x2-all-20260410-ocr-refine`	x2	291	29.3532	0.9758	762.6484 ms	90.9805	0.0814	`x2-all-20260410-ocr-refine-heldout-updated-ocr-summary.json`
`x2-v3-tpgsr-refine-20260411`	x2	291	29.4579	0.9786	572.7827 ms	91.2295	0.0765	`x2-v3-tpgsr-refine-20260411-last-heldout-updated-summary.json`
`x3-real-refine-20260412`	x3	291	26.6166	0.9642	292.9324 ms	88.8627	0.1918	`x3-real-refine-20260412-heldout-updated-summary.json`
`x3-real-refine-20260412` last	x3	291	26.6002	0.9642	288.0681 ms	89.1655	0.2058	`x3-real-refine-20260412-last-heldout-updated-summary.json`

Improvement Over Time

Date	Change	Result
2026-03-08 to 2026-03-11	Baseline x2/x3 SPAN experiments, TPGSR variants, real-video runs, and synthetic-plus-PPT mixes	Established the first reproducible quality and OCR benchmark matrix
2026-04-09 to 2026-04-10	Expanded x2 training with larger all-source and raw PowerPoint manifests	`x2-all-20260410` reached the strongest saved x2 PSNR/SSIM on the 291-sample updated held-out set: 29.9002 PSNR and 0.9813 SSIM
2026-04-10	Added OCR-focused x2 refinement	CER improved from 0.1016 on `x2-all-20260410` to 0.0814 on `x2-all-20260410-ocr-refine`
2026-04-11 to 2026-04-12	Refined app-backed x2 TPGSR checkpoint	`x2-v3-tpgsr-refine-20260411/last_model.pt` reached 91.2295 OCR confidence and 0.0765 CER on the updated held-out set
2026-04-12	Refined x3 real-video checkpoint	Compared with `x3-real`, `x3-real-refine-20260412` improved PSNR by 1.6661, SSIM by 0.0326, and CER by 0.0285 on the same updated held-out set
2026-04-13 to 2026-04-14	Continued x3 v4 refinement and OCR sweep experiments	Added longer-running ablation evidence for x3 model selection in `TRAINING_REPORT.md`

Study Design

EduScale evaluates super-resolution as a readability problem, not only as an image reconstruction problem.

Metric	Why it matters
PSNR	Measures pixel-level reconstruction quality against the high-resolution target
SSIM	Measures structural similarity, useful for slide edges, diagrams, and text blocks
Runtime	Tracks whether a model is practical for mobile or offline processing
OCR confidence	Estimates whether text becomes more readable to an OCR engine
CER	Measures text recognition errors; lower CER usually means more usable educational content

The active held-out manifests are:

datasets/manifests/heldout/heldout_real_x2.csv
datasets/manifests/heldout/heldout_real_x3.csv

Dataset Status

EduScale uses a combination of public super-resolution data, Kaggle PPT slide data, and author-created educational slide/video-frame data.

Dataset component	Status
Held-out x2 manifest	Available in `datasets/manifests/heldout/heldout_real_x2.csv`
Held-out x3 manifest	Available in `datasets/manifests/heldout/heldout_real_x3.csv`
PPT source data	Some slide data comes from Kaggle's Dataset for ppt
Author-created educational frames	Used for readability-focused evaluation
Full raw dataset	Not committed to Git
Source and redistribution notes	Documented in docs/DATASET_SOURCES.md

The full raw dataset is not included in the repository because of size, licensing, and redistribution restrictions.

Monorepo Layout

Path	Role in the study
android	Kotlin Android app, app assets, workers, and offline model packaging
models	SPAN implementation, training code, checkpoints, conversion scripts, and optimized TFLite models
datasets	Dataset manifests, held-out splits, and data preparation utilities
benchmarks	Benchmark scripts plus saved PSNR, SSIM, runtime, OCR confidence, and CER outputs
scripts	End-to-end training, evaluation, statistics, and pipeline helpers
tools	Desktop analysis tools, video analyzer, quality comparator, reporting utilities, and document helpers
experiments	Research notebooks for ablations, optimization, frame selection, and OCR analysis
docs	User guide, development guide, Android notes, dataset docs, model docs, and benchmark docs
thesis	Thesis chapters, references, figures, proposal materials, and generated exports
config	App, model, device, and processing configuration

Reproduce The Main Evaluation

Create or activate a virtual environment, then install the Python dependencies:

.\.venv\Scripts\python.exe -m pip install -r requirements.txt
.\.venv\Scripts\python.exe -m pip install -r requirements-optional.txt

Run an x2 held-out evaluation:

.\.venv\Scripts\python.exe -m models.training.evaluation `
  --pairs_csv datasets/manifests/heldout/heldout_real_x2.csv `
  --checkpoint models/span/education-finetuned/x2-v3-tpgsr-refine-20260411/last_model.pt `
  --output_csv benchmarks/results/x2-v3-tpgsr-refine-20260411-last-heldout-updated-eval.csv `
  --output_summary_json benchmarks/results/x2-v3-tpgsr-refine-20260411-last-heldout-updated-summary.json `
  --include_ocr_metrics

Run an x3 held-out evaluation:

.\.venv\Scripts\python.exe -m models.training.evaluation `
  --pairs_csv datasets/manifests/heldout/heldout_real_x3.csv `
  --checkpoint models/span/education-finetuned/x3-real-refine-20260412/best_model.pt `
  --output_csv benchmarks/results/x3-real-refine-20260412-heldout-updated-eval.csv `
  --output_summary_json benchmarks/results/x3-real-refine-20260412-heldout-updated-summary.json `
  --include_ocr_metrics

Run the broader benchmark matrix:

.\.venv\Scripts\python.exe scripts/benchmark.py

Android Build

Compile the Android app from the repository root:

cd android
.\gradlew.bat :app:compileDebugKotlin

The app models are expected in:

android/app/src/main/assets/span_education_x2.tflite
android/app/src/main/assets/span_education_x3.tflite

To refresh app assets from optimized model exports:

Copy-Item models/span/optimized/span_education_x2.tflite android/app/src/main/assets/span_education_x2.tflite -Force
Copy-Item models/span/optimized/span_education_x3.tflite android/app/src/main/assets/span_education_x3.tflite -Force

Common Commands

Task	Command
Install base dependencies	`pip install -r requirements.txt`
Install optional dependencies	`pip install -r requirements-optional.txt`
Run benchmark matrix	`python scripts/benchmark.py`
Build Android debug app	`Push-Location android; .\gradlew.bat :app:assembleDebug; Pop-Location`
Compile Android Kotlin	`Push-Location android; .\gradlew.bat :app:compileDebugKotlin; Pop-Location`
Copy optimized x2 model to Android assets	`Copy-Item models/span/optimized/span_education_x2.tflite android/app/src/main/assets/span_education_x2.tflite -Force`
Copy optimized x3 model to Android assets	`Copy-Item models/span/optimized/span_education_x3.tflite android/app/src/main/assets/span_education_x3.tflite -Force`

Development Workflow

Prepare or update dataset manifests in datasets.
Train SPAN checkpoints with models/training.
Evaluate PSNR, SSIM, runtime, OCR confidence, and CER through benchmarks.
Convert selected checkpoints through models/conversion.
Package TFLite assets into android/app/src/main/assets.
Record results in benchmarks/results and summarize progress in TRAINING_REPORT.md.

Documentation Map

Need	Start here
Install and run the project	docs/user-guide/installation.md
Dataset organization	datasets/README.md
Model training and conversion	models/README.md
Benchmarking workflow	benchmarks/README.md
Dataset citations and redistribution	docs/DATASET_SOURCES.md
Android packaging	docs/guides/android.md
Tooling	docs/guides/tools.md
Thesis materials	thesis/README.md
Full training history	TRAINING_REPORT.md
Public release checklist	docs/PUBLIC_RELEASE_CHECKLIST.md
Details to confirm	docs/RESEARCH_QUESTIONS.md

License, Citation, And Contributions

Need	File
Source license	LICENSE
Contribution guidelines	CONTRIBUTING.md
Citation metadata	CITATION.cff
Dataset source and redistribution notes	docs/DATASET_SOURCES.md

What Is Included

Android app source code
TensorFlow Lite model assets
SPAN training/evaluation code
Dataset manifests
Held-out evaluation CSVs
Benchmark summaries
Thesis/research documentation

What Is Not Included

Full raw datasets
Generated video outputs
Large model checkpoints not required for Android packaging
Temporary logs and local experiment outputs
Thesis export files such as PDF, DOCX, or HTML builds

Repository Maturity

Component	Maturity
Android app	Prototype / thesis implementation
TFLite model packaging	Working research snapshot
Training pipeline	Research workflow
Benchmarking pipeline	Reproducible from saved manifests and scripts
Dataset pipeline	Manifest-based, raw data excluded
Deployment	Not yet production-released

Repository Hygiene

This repository keeps source code, manifests, documentation, and benchmark summaries visible. Generated data and heavy local artifacts are ignored by .gitignore, including:

PDF, HTML, Office, and thesis export files
Python caches, pytest temp folders, and virtual environments
Android and Gradle build outputs
generated datasets, raw slides, rendered frames, and logs
model checkpoints, exported binaries, and downloaded installers

Keep benchmark summaries and manifest CSVs when they are needed to reproduce a reported result. Keep raw datasets, generated videos, and temporary exports outside Git.

Research and Release Status

EduScale is shared as a research prototype and implementation monorepo for an Android-based educational video enhancement system. The repository is intended for source review, thesis evaluation, and reproducibility review rather than production deployment.

The public repository includes the Android application source, training and evaluation workflows, benchmark summaries, dataset manifests, documentation, and the Android-packaged TensorFlow Lite model assets. TensorFlow Lite exports are also mirrored on Hugging Face.

Raw datasets, generated video outputs, large intermediate checkpoints, and thesis export files are intentionally excluded from Git because of size, licensing, and redistribution constraints. Dataset sources and reuse notes are documented in docs/DATASET_SOURCES.md.

Source code and documentation are released under the MIT License. Academic citation metadata is available in CITATION.cff, and contribution guidance is available in CONTRIBUTING.md.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
android		android
benchmarks		benchmarks
config		config
datasets		datasets
docs		docs
experiments		experiments
images		images
models		models
scripts		scripts
tests		tests
thesis		thesis
tools		tools
.gitignore		.gitignore
CITATION.cff		CITATION.cff
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
TRAINING_REPORT.md		TRAINING_REPORT.md
requirements-optional.txt		requirements-optional.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

EduScale

Study Snapshot

Training Metric Charts

Quick Start

1. Clone the repository

2. Install Python dependencies

3. Run the Android app build

4. Run benchmark evaluation

For Evaluators

Android App Status

Tested Environment

Current Android Models

Selected Model Rationale

Benchmark Highlights

Updated Held-Out Evaluation

Improvement Over Time

Study Design

Dataset Status

Monorepo Layout

Reproduce The Main Evaluation

Android Build

Common Commands

Development Workflow

Documentation Map

License, Citation, And Contributions

What Is Included

What Is Not Included

Repository Maturity

Repository Hygiene

Research and Release Status

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages