This repo contains the demo code for OCR on Signboard images from the ArT dataset using PaddleOCR' EAST + CRNN, MMOCR's ASTER and CRAFT. Google Colab is used for this demonstration.
- Setup on Google Colab or ➡️Click here to kickstart this demo directly at Google Colab
- HMean Calculation for Text Detection & Spotting Tasks
- Accuracy & Normalized Edit Distance Calculation for Text Recognition Task
- Data Exploration for the ArT dataset
- Text Detection, Recognition & Spotting on the ArT dataset
Do note that you need to run the ArT notebook before going into Signboard_OCR for the first time.
- You will learn how the HMean is calculated for the text detection and text spotting tasks with detailed examples.
- You will learn how the accuracy, number of correctly recognized words, and normalized edit distance are calculated for the text recognition task.
- You will learn how to make use of the ArT, a commonly used scene text dataset. We will compare the results of using pre-trained regular and irregular scene text methods using the evaluation methods mentioned above. Then, we can analyze the performance gap between the regular and irregular scene text methods on signboard images.