-
Notifications
You must be signed in to change notification settings - Fork 0
FAQ
Gaurav14cs17 edited this page Jun 21, 2026
·
1 revision
FlashOCR-m achieves 400+ FPS on GPU and 60+ FPS on edge devices with only 1.5M parameters for text recognition.
Any custom charset can be specified. Default is alphanumeric (0-9a-z). You can include uppercase, special characters, or non-Latin scripts.
Yes, use the registry system to register custom backbones:
from flashocr.registry import BACKBONES
@BACKBONES.register("my_backbone")
class MyBackbone(nn.Module):
...flashocr export --model best.pth --output model.onnx --simplifyThen convert ONNX to TFLite, CoreML, or NCNN as needed.
- CTC: Faster inference, simpler training, best for fixed-charset recognition
- Attention: Handles variable-length outputs better, more accurate for complex scripts
- standard: Classic low-rank adapters
- dora: Better generalization via weight decomposition
- adalora: Automatically adjusts rank per layer
- lora_plus: Different LR for A and B matrices
A TSV file with tab-separated image paths and text labels:
image_001.jpg hello
image_002.jpg world