BizCardX is a Python-based tool designed to simplify the extraction of data from business cards using Optical Character Recognition (OCR). This project streamlines the process of digitizing business card information, making it efficient and user-friendly.
-
OCR Extraction: Utilizes Optical Character Recognition to extract text data from business card images.
-
Output Organization: Provides a structured and organized output format for the extracted information.
-
Image Format Support: Supports various image formats commonly used for business cards.
Ensure you have the following dependencies installed before using BizCardX:
- Python 3.x
- Tesseract OCR (Ensure it's in your system's PATH)
-
Clone the repository:
git clone https://github.com/your-username/BizCardX.git cd BizCardX
-
Install required Python packages:
pip install -r requirements.txt
-
Place business card images in the
images/
directory. -
Run the following command:
python extract_data.py
-
Extracted data will be saved in
output.csv
in the project directory.
Customize BizCardX by modifying settings in the config.py
file. Adjust OCR settings, output format, or other parameters to suit your requirements.
We welcome contributions! If you'd like to contribute to BizCardX, please follow the contribution guidelines.
This project is licensed under the MIT License.
Thanks to the Tesseract OCR project for providing the OCR engine.
For questions or issues, please open an issue.