This project is to design a piece of software to record information from scanned forms for Prevention Point. This project utilizes openCV and the Microsoft Azure API. Please contact the project lead on slack on how to get access the API key.
There are several different types of forms that need to be scanned. The form intake interface should allow the user to select which type of form is being scanned.
In order for the bounding box section of the project to work. The images for each form need to be in the same orientation and scale. The current thinking is to use the prevention point logo on each page to orient and scale.
For each of the forms bounding boxes need to be created around text fields and marks. The collected data should be deposited in a csv that corresponds to the variables in the variable dictionary for that form.
- Text fields use the Microsoft Azure Read API
- Mark fields use openCV to determine if a box is marked.
The client (Prevention Point) has flagged "Name" and "UniqueID" as the fields that need the most accurate information. The Microsoft Azure Read API returns a certainty number. If the certainty number is below a certain threshold. The software should display the form and a field that allows for manual entry and correction.