#Document-Capture-OCR-Text-Scanner
Document-Capture-OCR-Text-Scanner is a robust tool designed to extract text from images, focusing specifically on Indian passports and driving licenses. With an intuitive interface and seamless data extraction, it simplifies the process of OCR (Optical Character Recognition) for Indian government-issued documents.
Key Features: Image-to-Text Extraction: Efficiently extracts data from images of Indian passports and driving licenses using OCR technology. User-Friendly Interface: Built with React and styled using Tailwind CSS, ensuring a smooth and responsive user experience.
Technology Stack:
Backend: Flask: Handles the backend API for managing OCR operations. Tesseract: Powers the OCR engine for text extraction from images. Regex: Ensures accurate extraction and parsing of specific information from the OCR output.
Frontend: React: Used for building the user interface, ensuring a dynamic and responsive experience. Tailwind CSS: Provides styling for a clean, modern look and feel.
Installation Guide: To get started with Document-Capture-OCR-Text-Scanner, follow the steps below:
-
Clone the Repository Clone the repository to your local machine: git clone <repository_url> cd Document-Capture-OCR-Text-Scanner
-
Install Backend Dependencies Navigate to the backend directory and install the necessary packages: cd backend pip install -r requirements.txt
-
Start the Backend Server Launch the Flask server: python server.py
-
Install Frontend Dependencies Go to the frontend directory and install the required packages: cd frontend npm install
-
Run the Frontend Application Start the React app: npm start
-
Open the Application Access the application by visiting http://localhost:3000 in your web browser.
Screenshots: UI - Upload Page A simple UI to upload the document image for OCR processing.
Extracted Data Displays extracted details such as name, document number, and dates after uploading.
Data Validation Allows users to review and confirm the parsed details.