WarpLess Docs

Production-ready document cleanup API for real-world camera-captured forms.

Frame correction → shadow removal → template-handle dewarping → adaptive histogram normalization → single final image output.

What is WarpLess Docs?

WarpLess Docs turns a noisy phone photo of a document into a clean, scanner-like output.

It is built as a modular computer-vision pipeline with an HTTP API, progress tracking, and a single final image result.

The pipeline handles:

camera perspective and document frame correction
shadow removal with an ONNX document model
template feature matching and handle-based dewarping
adaptive histogram normalization for a brighter, more readable final scan

Pipeline Preview

Input	Frame	Deshadow	Template Handles	Dewarp	Final

Processing Stages

input image
    ↓
1. Frame correction
    ↓
2. Shadow removal
    ↓
3. Template-handle dewarping
    ↓
4. Adaptive histogram normalization
    ↓
single final PNG result

The API returns one final output file. Intermediate images are only used internally.

Project Structure

WarpLess-Docs/
├── api.py
├── run_pipeline.py
├── scan_document_frame.py
├── main.py
├── dewarp_with_template_handles.py
├── enhance_document.py
├── input/
│   ├── samples/
│   └── template/
├── models/
│   └── docshadow_sd7k.onnx
├── outputs/
│   ├── api_jobs/
│   ├── framed/
│   ├── deshadowed/
│   ├── template_rectified/
│   ├── template_dewarped/
│   └── final/
└── src/
    └── warpless_docs/
        ├── document_frame/
        ├── shadow_removal/
        ├── template_rectification/
        ├── enhancement/
        └── pipeline/

Installation

git clone https://github.com/ehsanwwe/WarpLess-Docs.git
cd WarpLess-Docs
python -m venv .venv

Windows PowerShell:

.\.venv\Scripts\Activate.ps1
pip install -r requirements.txt

Linux / macOS:

source .venv/bin/activate
pip install -r requirements.txt

Recommended Python:

Python 3.11 or 3.12

Download the Shadow Removal Model

The ONNX model is not committed because it is a large binary file.

mkdir -p models
curl -L "https://github.com/fabio-sim/DocShadow-ONNX-TensorRT/releases/download/v1.0.0/docshadow_sd7k.onnx" -o "models/docshadow_sd7k.onnx"

Windows PowerShell:

Invoke-WebRequest `
  -Uri "https://github.com/fabio-sim/DocShadow-ONNX-TensorRT/releases/download/v1.0.0/docshadow_sd7k.onnx" `
  -OutFile "models\docshadow_sd7k.onnx"

Expected path:

models/docshadow_sd7k.onnx

Prepare the Template

Put the clean flat template image here:

input/template/template_page_1.png

The template should match the same form layout as the uploaded document.

Run as an API

Start the server:

uvicorn api:app --reload

Open the interactive docs:

http://127.0.0.1:8000/docs

Health check:

curl http://127.0.0.1:8000/health

API Workflow with Progress

1. Upload an image

curl -X POST \
  -F "file=@input/samples/Untitled5.jpg" \
  http://127.0.0.1:8000/api/v1/process

Response:

{
  "job_id": "2f4c...",
  "status": "queued",
  "progress": 0,
  "stage": "Queued",
  "status_url": "/api/v1/jobs/2f4c...",
  "result_url": "/api/v1/jobs/2f4c.../result"
}

2. Poll progress

curl http://127.0.0.1:8000/api/v1/jobs/YOUR_JOB_ID

Example progress response:

{
  "status": "running",
  "progress": 70,
  "stage": "Applying handle-based template dewarping"
}

Progress stages:

10%  Frame correction
30%  Shadow removal
55%  Template feature matching
70%  Handle-based dewarping
90%  Adaptive normalization
100% Completed

3. Download the final result

curl -L \
  -o result.png \
  http://127.0.0.1:8000/api/v1/jobs/YOUR_JOB_ID/result

The result endpoint returns only one file:

result.png

Synchronous API for Quick Tests

For demos or scripts where progress is not needed:

curl -X POST \
  -F "file=@input/samples/Untitled5.jpg" \
  -o result.png \
  http://127.0.0.1:8000/api/v1/process-sync

API Endpoints

Method	Path	Description
`GET`	`/health`	Service health check
`POST`	`/api/v1/process`	Upload image and start async processing
`GET`	`/api/v1/jobs/{job_id}`	Read status and progress
`GET`	`/api/v1/jobs/{job_id}/result`	Download final PNG
`GET`	`/api/v1/jobs/{job_id}/metadata`	Get technical stage metadata
`DELETE`	`/api/v1/jobs/{job_id}`	Delete job files

Environment Variables

WARPLESS_MODEL_PATH      defaults to models/docshadow_sd7k.onnx
WARPLESS_TEMPLATE_PATH   optional explicit template image
WARPLESS_API_OUTPUT_DIR  defaults to outputs/api_jobs
WARPLESS_CPU             set to 1 to force CPU

Example:

WARPLESS_CPU=1 uvicorn api:app --reload

Windows PowerShell:

$env:WARPLESS_CPU="1"
uvicorn api:app --reload

Run the Pipeline Locally

The API uses the same pipeline as the CLI scripts.

python run_pipeline.py

Manual execution:

python scan_document_frame.py
python main.py
python dewarp_with_template_handles.py
python enhance_document.py

Python API

from pathlib import Path
from warpless_docs.pipeline import DocumentPipeline

pipeline = DocumentPipeline()

image_bytes = Path("input/samples/Untitled5.jpg").read_bytes()

final_image, metadata = pipeline.process_bytes(
    image_bytes=image_bytes,
    progress=lambda percent, stage: print(percent, stage),
)

final_image.save("result.png")

Why this project is useful

WarpLess Docs is designed like a real production preprocessing layer for document AI:

it accepts imperfect camera photos
it reports progress for UI integration
it returns a single clean output
it keeps heavy ML inference behind a simple API
it exposes metadata for debugging and evaluation
it is modular enough to replace or improve each stage independently

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
input		input
models		models
outputs		outputs
src/warpless_docs		src/warpless_docs
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
api.py		api.py
detect_warp.py		detect_warp.py
dewarp_with_template_handles.py		dewarp_with_template_handles.py
enhance_document.py		enhance_document.py
main.py		main.py
pyproject.toml		pyproject.toml
rectify_with_template.py		rectify_with_template.py
requirements-api.txt		requirements-api.txt
requirements.txt		requirements.txt
run_pipeline.py		run_pipeline.py
scan_document_frame.py		scan_document_frame.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WarpLess Docs

What is WarpLess Docs?

Pipeline Preview

Processing Stages

Project Structure

Installation

Download the Shadow Removal Model

Prepare the Template

Run as an API

API Workflow with Progress

1. Upload an image

2. Poll progress

3. Download the final result

Synchronous API for Quick Tests

API Endpoints

Environment Variables

Run the Pipeline Locally

Python API

Why this project is useful

Roadmap

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

WarpLess Docs

What is WarpLess Docs?

Pipeline Preview

Processing Stages

Project Structure

Installation

Download the Shadow Removal Model

Prepare the Template

Run as an API

API Workflow with Progress

1. Upload an image

2. Poll progress

3. Download the final result

Synchronous API for Quick Tests

API Endpoints

Environment Variables

Run the Pipeline Locally

Python API

Why this project is useful

Roadmap

License

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages