GitHub - arif05khan/Document-Capture-OCR-Text-Scanner: OCR-Text-Scanner is an application for extracting data from images of Indian passports and driving licenses. Built with a Flask backend using Tesseract for OCR and regex for data extraction, and a React frontend with Tailwind CSS, it provides a streamlined, user-friendly interface for retrieving key document information accurately.

#Document-Capture-OCR-Text-Scanner

Document-Capture-OCR-Text-Scanner is a robust tool designed to extract text from images, focusing specifically on Indian passports and driving licenses. With an intuitive interface and seamless data extraction, it simplifies the process of OCR (Optical Character Recognition) for Indian government-issued documents.

Key Features: Image-to-Text Extraction: Efficiently extracts data from images of Indian passports and driving licenses using OCR technology. User-Friendly Interface: Built with React and styled using Tailwind CSS, ensuring a smooth and responsive user experience.

Technology Stack:

Backend: Flask: Handles the backend API for managing OCR operations. Tesseract: Powers the OCR engine for text extraction from images. Regex: Ensures accurate extraction and parsing of specific information from the OCR output.

Frontend: React: Used for building the user interface, ensuring a dynamic and responsive experience. Tailwind CSS: Provides styling for a clean, modern look and feel.

Installation Guide: To get started with Document-Capture-OCR-Text-Scanner, follow the steps below:

Clone the Repository Clone the repository to your local machine: git clone <repository_url> cd Document-Capture-OCR-Text-Scanner
Install Backend Dependencies Navigate to the backend directory and install the necessary packages: cd backend pip install -r requirements.txt
Start the Backend Server Launch the Flask server: python server.py
Install Frontend Dependencies Go to the frontend directory and install the required packages: cd frontend npm install
Run the Frontend Application Start the React app: npm start
Open the Application Access the application by visiting http://localhost:3000 in your web browser.

Screenshots: UI - Upload Page A simple UI to upload the document image for OCR processing.

Extracted Data Displays extracted details such as name, document number, and dates after uploading.

Data Validation Allows users to review and confirm the parsed details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

arif05khan/Document-Capture-OCR-Text-Scanner

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages