PDF Form AutoFill System

An intelligent PDF form auto-fill system that combines OCR, computer vision, and LLM to automatically detect form fields and fill them with provided data.

Features

Box Detection: Uses OpenCV to detect input fields in PDF forms
OCR Text Extraction: Extracts labels using OCR.space API
LLM Mapping: Uses Azure OpenAI to intelligently map JSON data to form fields
Accurate Placement: Fixed coordinate system for precise text placement
Multiple Field Types: Supports letter-by-letter and entire text filling

Setup

Install Dependencies:

pip install opencv-python pdf2image pytesseract PyMuPDF requests pillow

Configure API Keys:
- Copy config_template.py to config.py
- Add your API keys:
  - OCR.space API key
  - Azure OpenAI API key and endpoint
Run the System:

python llm_autofill_mapper.py

How It Works

PDF to Image: Converts PDF to image for processing
Field Detection: Detects input boxes using OpenCV contour detection
OCR Extraction: Extracts text labels using OCR.space API
LLM Mapping: Uses Azure OpenAI to map JSON data to detected fields
Text Placement: Writes text on image using correct coordinates
PDF Generation: Converts filled image back to PDF

Output

output/autofilled_form.pdf - Final filled PDF
output/filled_image.png - Filled image for verification
output/field_mappings.json - Complete mapping details
output/detected_all_field_types.png - Visualization of detected fields

Configuration

Edit config.py to customize:

API keys and endpoints
File paths
Processing settings (DPI, font size, etc.)

Requirements

Python 3.7+
OpenCV
PyMuPDF
OCR.space API key
Azure OpenAI API key

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
input		input
.gitignore		.gitignore
README.md		README.md
character_box_detector.py		character_box_detector.py
config_template.py		config_template.py
llm_autofill_mapper.py		llm_autofill_mapper.py
ocr_text_extractor.py		ocr_text_extractor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Form AutoFill System

Features

Setup

How It Works

Output

Configuration

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PDF Form AutoFill System

Features

Setup

How It Works

Output

Configuration

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages