GitHub - Evaan2001/Passport_OCR_App: My Webapp to extract data from a passport photo

Here's a web-app where users can upload a photo of a passport and extract the essential details!

I was freelancing for a security company in Austin, TX, and was tasked to write code for an embedded device to extract someone's information from a photo of their passport. I took that software and created a web-app using Streamlit so that others can also use my work. Click here to use my app!

What Information Can We Get?

Given a photo of a passport, my software can retrieve the following:

Passport Issuing Country (It's almost always the country of citizenship of the passport holder)
Full Name
Passport Number
Date of Birth
Sex

What's The Process?

Every passport has a Machine Readable Zone, or MRZ for short. The MRZ is a standardized format for encoding essential passport holder information in a format that can be easily read by machine. It consists of two or three lines of text at the bottom of the personal information page of a passport. Here's a good article on how we can decode an MRZ.

PyImageSearch has a good article on how we can isolate the MRZ using traditional Computer Vision methods, so no fancy ML models. The author then suggests using HP/Google's Tesseract OCR engine to OCR the identified MRZ area (OCR simply means getting all the words/characters in an image). However, PyImageSearch's MRZ-identifying code proved unreliable from my tests. Additionally, Tesseract often made errors while OCRing characters in the Passport-Font.

I then ran into PassportEye, a Python library and a command-line tool that claims to be able to extract the relevant details from a photo of a passport. It is also rotation-invariant, so it works for images rotated by 90°, 180°, or 270°. However, I found that the author's algorithm works well only if we hava a close-up photo of a passport. Anything in the background triggers a lot of errors. Plus, it uses the Tesseract OCR engine which, as discussed above, is not the best for the passport font.

I then discovered the Free OCR API by ocr.space. It's OCR Engine 2 works really well for the Passport Font. However, my testing revealed that the OCR loses accuracy when working on a close-up shot of a passport. So, I decided to combine this with the PassportEye library to get the following algorithm:

Make the API call to ocr.space with the uploaded passport photo
Decode the OCR'ed text
If decoding is proving troublesome or the OCR seems inaccurate, use the PassportEye Library
Show output

What's Streamlit?

Streamlit is an open-source Python library that allows you to create web applications for machine learning and data science projects quickly and easily. It is designed to make the process of building interactive web applications as simple as writing Python scripts. There are 2 parts to using Streamlit –

The super simple Front-End Support which makes designing the UI of the web-app a breeze
Hosting the web-app on the free Streamlit Cloud so that people can use your app at no cost

Limitations

The free version of the ocr.space API is restricted to 500 calls per month. As such, I'm currently limiting usage of my app to 75 uploads in a week to provide equal access to everybody.
Because I don't have any control over the traffic present on ocr.space's servers or on the Streamlit Cloud, the process can sometimes take longer than 30 seconds, though the average processing time is about 15 seconds.
Based on the picture quality and the correctness of the OCR'ed text, my software is not 100% accurate. The current accuracy is 95%.

Files

Here's what you'll find –

Demo_Images – A list of images I used to test my algorithm
app-venv – The virtual environment I used to develop this app
functions.py – A file that has a bunch of helper functions I used frequently (including my string processing functions for OCR'ed text)
images_processed.txt – A text file that records how many images my web-app has processed in the last week
packages.txt – list of Linux dependencies outside the Python environment to be installed by Streamlit using apt-get for app deployment
requirements.txt – list of python packages to be installed by Streamlit for app deployment
streamlit_app.py – python script that manages the front-end of the app and calls the relevant algorithm/function contained in functions.py

You can ignore the other files

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Here's a web-app where users can upload a photo of a passport and extract the essential details!

What Information Can We Get?

What's The Process?

What's Streamlit?

Limitations

Files

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
.devcontainer		.devcontainer
Demo_Images		Demo_Images
__pycache__		__pycache__
app-venv		app-venv
.DS_Store		.DS_Store
README.md		README.md
functions.py		functions.py
images_processed.txt		images_processed.txt
packages.txt		packages.txt
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

Evaan2001/Passport_OCR_App

Folders and files

Latest commit

History

Repository files navigation

Here's a web-app where users can upload a photo of a passport and extract the essential details!

What Information Can We Get?

What's The Process?

What's Streamlit?

Limitations

Files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages