Layout Parser + Handwriting Recognition

Detects layout regions (titles, paragraphs, lists) in handwritten note images using DocLayout-YOLO, then runs a custom character-level OCR model to convert them into structured Markdown text.

Requirements

Python 3.11 — TensorFlow does not support 3.12+ on Windows yet
Git

Setup

1. Clone the repo

git clone https://github.com/Gilliooo/Layout-Parser-ComputerVision.git
cd Layout-Parser-ComputerVision

2. Install Python 3.11

Windows:

winget install Python.Python.3.11

Mac / Linux: Download from python.org

3. Create a virtual environment

Windows (PowerShell):

Set-ExecutionPolicy -Scope CurrentUser -ExecutionPolicy RemoteSigned
py -3.11 -m venv .venv --without-pip
.venv\Scripts\Activate.ps1
python -m ensurepip --upgrade

Mac / Linux:

python3.11 -m venv .venv
source .venv/bin/activate

4. Install dependencies

python -m pip install -r requirements.txt

5. Run the app

python -m streamlit run app.py

On first run with DocLayout-YOLO mode selected, the model weights (~100 MB) are automatically downloaded from Hugging Face and cached locally.

Notes

handwriting_recognition_model.keras and classes.json are included in the repo — no manual download needed.
The DocLayout-YOLO weights are fetched from juliozhao/DocLayout-YOLO-DocStructBench on first use.
Use python -m streamlit instead of just streamlit to ensure the venv's Python is used.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
Computer Vision Project_Report.pdf		Computer Vision Project_Report.pdf
README.md		README.md
app.py		app.py
classes.json		classes.json
doclayout_module.py		doclayout_module.py
handwriting_recognition_model.keras		handwriting_recognition_model.keras
ocr_module.py		ocr_module.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Layout Parser + Handwriting Recognition

Requirements

Setup

1. Clone the repo

2. Install Python 3.11

3. Create a virtual environment

4. Install dependencies

5. Run the app

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Layout Parser + Handwriting Recognition

Requirements

Setup

1. Clone the repo

2. Install Python 3.11

3. Create a virtual environment

4. Install dependencies

5. Run the app

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages