A Python tool to extract and search Hindi/Marathi names from voter-list PDFs using OCR.
- Extracts names from PDFs
- Supports OCR for scanned files
- Transliteration (Hindi → English)
- Fast caching system
pip install -r requirements.txtpython hindi_name_finder.py /path/to/folder
python hindi_name_finder.py /path/to/file.pdf- Python 3.x
- Tesseract OCR installed