GitHub - Har23s/Document-Text-Extraction

This script uses the DeepSeek-OCR modell from Hugging Face to convert document images into readable text. It runs entirely on CPU by disabling CUDA and mixed-precision operations for compatibility.

The script first scans the images/ folder for .jpg, .jpeg, or .png files and prompts the user to select one. It then loads the DeepSeek-OCR model and tokenizer, prepares a text-based prompt, and performs inference to extract text from the selected image.

The recognized text and visualization results are saved in the output/ directory. The code ensures stable performance on any system by forcing all computations to use float32.

In short, this script provides a step-by-step, CPU and GPU-compatible OCR pipeline that converts document images into readable text using the DeepSeek-AI model.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
images		images
output		output
1.py		1.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages