OCR-Challenge

An end-to-end MLops pipeline project focused on optical character recognition problems. Monitored, fully reproducible, and compliant with auditing needs.

Directory Overview:

ml: Jupyter notebooks, scripts, and supporting libs for training text extraction models from images.
dataset: .gitignore'd directory for download and processing data during development.
model: .gitignore'd directory for outputting/downloading models for deployment/EDA.
service: FastAPI service to serve inference results over HTTP.
deployment: Terraform scripts and Dockerfiles used to deploy the model and service to GCP.

Details are available in each directory's README.md.

Setup venv

python3.11 -m venv .venv-textflow
source .venv-textflow/bin/activate
pip install --upgrade pip
pip install -e .

Dependencies are tracked in pyproject.toml. Keep the list lean to avoid bloated Docker images.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OCR-Challenge

Directory Overview:

Setup venv

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
api		api
build/lib/ml		build/lib/ml
dataset		dataset
deployment		deployment
ml		ml
model		model
pipeline		pipeline
service		service
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

michaelhsj/TextFlow

Folders and files

Latest commit

History

Repository files navigation

OCR-Challenge

Directory Overview:

Setup venv

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages