Japan OCR Mini Benchmark v0.1.0
Initial public sample release of Japan OCR Mini Benchmark.
This release includes:
- 5 synthetic noisy Japanese receipt images
- Ground-truth JSON files
- Qwen3.6 35B A3B model output JSON files
- Python evaluation script
- Experiment log
- Failure case notes
- License and synthetic data notice
The benchmark focuses on Japanese receipt OCR/VLM extraction, including item names, amounts, tax target fields, discounts, point usage, payment amount, cash received, and change.
Known failure case:
Qwen3.6 35B A3B made measurable errors on receipt_005_noisy.png, including small tax target amount errors and dakuten/handakuten item-name errors.
Note: Qwen3.6 35B A3B results were generated using a Q4_K_M GGUF quantized model in LM Studio.