A regional language and location detecting application using a tri-combination of Google's TensorFlow Object Detection API, Convolutional Recurrent Neural Network (CRNN) and a 2-layered Neural Network for language classification.
The Application uses three modules in sequential manner:
- Google's TensorFlow Object Detection API for Text localisation
- CRNN for text Recognition
- 2-layered Neural Network for language classification Location is determined by predicted text using Geotext
- Edit the Image path in main.py
#Image-Location-Detector/
python3 main.py
- [Python 3.6]
- [TensorFlow 1.2]
- [PyTorch]
- [OpenCv] - Either build from source or
sudo apt-get install python-opencv
- [Geotext]
pip3 install geotext
wget https://www.dropbox.com/s/l0vo83hmvv2aipn/crnn.pth
- Currently
ssd_mobilenet_v1_coco_11_06_2017
model is being used - boundry box prediction for text detection will be poor - Try to build your own custom dataset using
labelImg
- Train custom dataset for Text detection
- Notes on generating custom dataset