Release Japan OCR Mini Benchmark v0.1.0 · K10124/japan-ocr-mini-benchmark-public

Initial public sample release of Japan OCR Mini Benchmark.

This release includes:

5 synthetic noisy Japanese receipt images
Ground-truth JSON files
Qwen3.6 35B A3B model output JSON files
Python evaluation script
Experiment log
Failure case notes
License and synthetic data notice

The benchmark focuses on Japanese receipt OCR/VLM extraction, including item names, amounts, tax target fields, discounts, point usage, payment amount, cash received, and change.

Known failure case:
Qwen3.6 35B A3B made measurable errors on receipt_005_noisy.png, including small tax target amount errors and dakuten/handakuten item-name errors.

Note: Qwen3.6 35B A3B results were generated using a Q4_K_M GGUF quantized model in LM Studio.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Japan OCR Mini Benchmark v0.1.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!