This tool allows you to download images and extract overlay text from a public Scribd document and generate a combined PDF file with both visual and textual content.
- ✅ Downloads all pages from a Scribd document
- 🖼️ Saves them as images
- 🔤 Uses OCR (Tesseract) to extract embedded text
- 📄 Combines both into a searchable PDF
- 🖥️ Simple and intuitive PyQt6-based GUI
- 🧼 Cleans up temporary files automatically
Install all dependencies using:
pip install -r requirements.txt