Skip to content

Japan OCR Mini Benchmark v0.1.0

Choose a tag to compare

@K10124 K10124 released this 06 Jun 14:44
· 19 commits to main since this release

Initial public sample release of Japan OCR Mini Benchmark.

This release includes:

  • 5 synthetic noisy Japanese receipt images
  • Ground-truth JSON files
  • Qwen3.6 35B A3B model output JSON files
  • Python evaluation script
  • Experiment log
  • Failure case notes
  • License and synthetic data notice

The benchmark focuses on Japanese receipt OCR/VLM extraction, including item names, amounts, tax target fields, discounts, point usage, payment amount, cash received, and change.

Known failure case:
Qwen3.6 35B A3B made measurable errors on receipt_005_noisy.png, including small tax target amount errors and dakuten/handakuten item-name errors.

Note: Qwen3.6 35B A3B results were generated using a Q4_K_M GGUF quantized model in LM Studio.