In this tutorial we will perform line-level text detection using Surya toolkit and OpenVINO.
Model used for line-level text detection based on Segformer. It has the following features:
- It is specialized for document OCR. It will likely not work on photos or other images.
- It is for printed text, not handwriting.
- The model has trained itself to ignore advertisements.
- Languages with very different character sets may not work well.
- Fetch test image.
- Run PyTorch inference.
- Convert model to OpenVINO Intermediate Representation (IR) format.
- Run OpenVINO model.
- Apply post-training quantization using NNCF
- Prepare dataset
- Quantize model
- Run quantized OpenVINO model
- Interactive inference.
This is a self-contained example that relies solely on its own code.
We recommend running the notebook in a virtual environment. You only need a Jupyter server to start.
For details, please refer to Installation Guide.