Skip to content

OCR-Text-Scanner is an application for extracting data from images of Indian passports and driving licenses. Built with a Flask backend using Tesseract for OCR and regex for data extraction, and a React frontend with Tailwind CSS, it provides a streamlined, user-friendly interface for retrieving key document information accurately.

Notifications You must be signed in to change notification settings

arif05khan/Document-Capture-OCR-Text-Scanner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

#Document-Capture-OCR-Text-Scanner

Document-Capture-OCR-Text-Scanner is a robust tool designed to extract text from images, focusing specifically on Indian passports and driving licenses. With an intuitive interface and seamless data extraction, it simplifies the process of OCR (Optical Character Recognition) for Indian government-issued documents.

Key Features: Image-to-Text Extraction: Efficiently extracts data from images of Indian passports and driving licenses using OCR technology. User-Friendly Interface: Built with React and styled using Tailwind CSS, ensuring a smooth and responsive user experience.

Technology Stack:

Backend: Flask: Handles the backend API for managing OCR operations. Tesseract: Powers the OCR engine for text extraction from images. Regex: Ensures accurate extraction and parsing of specific information from the OCR output.

Frontend: React: Used for building the user interface, ensuring a dynamic and responsive experience. Tailwind CSS: Provides styling for a clean, modern look and feel.

Installation Guide: To get started with Document-Capture-OCR-Text-Scanner, follow the steps below:

  1. Clone the Repository Clone the repository to your local machine: git clone <repository_url> cd Document-Capture-OCR-Text-Scanner

  2. Install Backend Dependencies Navigate to the backend directory and install the necessary packages: cd backend pip install -r requirements.txt

  3. Start the Backend Server Launch the Flask server: python server.py

  4. Install Frontend Dependencies Go to the frontend directory and install the required packages: cd frontend npm install

  5. Run the Frontend Application Start the React app: npm start

  6. Open the Application Access the application by visiting http://localhost:3000 in your web browser.

Screenshots: UI - Upload Page A simple UI to upload the document image for OCR processing.

Extracted Data Displays extracted details such as name, document number, and dates after uploading.

Data Validation Allows users to review and confirm the parsed details.

About

OCR-Text-Scanner is an application for extracting data from images of Indian passports and driving licenses. Built with a Flask backend using Tesseract for OCR and regex for data extraction, and a React frontend with Tailwind CSS, it provides a streamlined, user-friendly interface for retrieving key document information accurately.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •