Screenshot & OCR Extractor

A Python-based OCR tool that extracts text from images and saves it to Google Docs automatically.
Supports Google Drive sharing, text formatting, and document organization.

Features

Extracts text from screenshots & images using Tesseract OCR
Automatically saves text to Google Docs
Append text to existing docs or create new ones
Share documents via email (Google Drive API)
Formatted text output with bold headings & structured layout
Handles API authentication & permissions efficiently

How This Project Was Built

Idea & Concept

I wanted to automate text extraction from screenshots and make the process seamless.
After exploring Notion, PDFs, and Google Docs, I chose Google Docs for better API support.

Development Process

Setting Up the Environment

Installed Tesseract OCR, configured paths, and handled dependencies.
Set up Google Cloud API credentials for authentication.

OCR & Image Processing

Used pytesseract to extract text from images.
Cleaned & formatted the extracted text.

Google Docs Integration

Automated document creation & updating via the Google Docs API.
Implemented document sharing via Google Drive API (Users can add their email).

Debugging & Fixes

Solved Windows permission errors for accessing credentials.json.
Fixed Google Drive API permission issues when sharing docs.

AI-Assisted Development

I leveraged AI for guidance, but I actively:

Debugged errors manually
Decided which features to implement
Researched & understood Google API workflows
Customized the formatting & user interaction flow

What I Learned

How OCR works in Python
How to authenticate & interact with Google Docs API
How to handle API-based document sharing
How to troubleshoot API permission issues
How AI can assist in development while still requiring critical thinking

Getting Started

Install Dependencies

pip install pytesseract pillow google-auth google-auth-oauthlib google-auth-httplib2 google-api-python-client

Install Tesseract OCR

Windows:

Download & install Tesseract OCR.
Add Tesseract to your system PATH.
Find the installation path (e.g., C:\Program Files\Tesseract-OCR\tesseract.exe).

Linux/macOS:

sudo apt install tesseract-ocr  # Ubuntu/Debian
brew install tesseract          # macOS

Run the Script

python screenshot_ocr.py

Select an Image

Choose an image file containing text.
OCR will extract the text and save it to Google Docs.

Enter Your Email (Optional)

If you want document access, enter your Google email.
Otherwise, the document will remain private.

Future Enhancements

AI-powered text correction (fix OCR errors using GPT)
Export to multiple platforms (Notion, Trello, PDFs)
Auto-detect text language & translate it
Hotkey-based screenshot capture & auto-processing

Credits & Acknowledgment

AI-Assisted Development:
I used ChatGPT as an assistant for debugging, research, and structuring API calls,
but every decision, problem-solving step, and customization was done manually.

Links

GitHub Repository: Coderanger08

Google Docs API Guide: Google Docs API Docs

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
config.py		config.py
file_manager.py		file_manager.py
google_docs_api.py		google_docs_api.py
main.py		main.py
ocr_extractor.py		ocr_extractor.py
requirements.txt		requirements.txt
text_processing.py		text_processing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Screenshot & OCR Extractor

Features

How This Project Was Built

Idea & Concept

Development Process

Setting Up the Environment

OCR & Image Processing

Google Docs Integration

Debugging & Fixes

AI-Assisted Development

What I Learned

Getting Started

Install Dependencies

Install Tesseract OCR

Windows:

Linux/macOS:

Run the Script

Select an Image

Enter Your Email (Optional)

Future Enhancements

Credits & Acknowledgment

Links

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Coderanger08/Textractor

Folders and files

Latest commit

History

Repository files navigation

Screenshot & OCR Extractor

Features

How This Project Was Built

Idea & Concept

Development Process

Setting Up the Environment

OCR & Image Processing

Google Docs Integration

Debugging & Fixes

AI-Assisted Development

What I Learned

Getting Started

Install Dependencies

Install Tesseract OCR

Windows:

Linux/macOS:

Run the Script

Select an Image

Enter Your Email (Optional)

Future Enhancements

Credits & Acknowledgment

Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages