Skip to content

Files

Latest commit

 

History

History

PDF-Text-Extractor

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 

PDF-Text-Extractor

This GUI Application allows you to extract the texgt from the PDF files. The project is build using the PyPDF2 library for extracting text from PDFs, and the tkinter library for creating the GUI.

Getting Started

To run the project, you will need to have Python and pip installed on your system.

Installation

  1. Clone or download the repository to your local machine.

    git clone https://github.com/SamAddy/PDF-Extract-Text.git
    
  2. Enter the working directory.

    cd PDF-Extract-Text
    
  3. Use pip to install the required libraries.

    pip install -r requirements.txt
    

Usage

  1. Run the app using the following command:

    python app.py
    
  2. A GUI window will appear, with a button to selecgt the PDF file you want to extract text from.

  3. Once you have selected the file, the text will be extracted and displayed in the text box.

  4. You can also save the text to a file by clicking 'Save' button.

image1 image2

Note

Please keep in mind that not all pdfs are created equal, and some pdfs may have text in an image format or other format that may not be extractable with PyPDF2.

Built With

  • Python - The programming language used.
  • PYPDF2 - A library for extracting text from PDF files.
  • Tkinter - A library for creating GUI in Python.

Contributing

Contributions are absolutely welcome. If you have an idea for an improvement, please open an issue or submit a pull request.

Acknowledgement