Skip to content

Pradyumna744/Pdf-to-Excel-Using-Python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

PDF-TO-EXCEL-PYTHON

PDF to Excel Converter

This repository contains a Python application for converting PDF files to Excel files. The application uses various libraries to handle PDF extraction, OCR, and GUI functionalities.

Installation

To set up the environment for this project, you need to install the following Python libraries:

  1. pandas

    • Description: A powerful data manipulation and analysis library.
    • Installation:
      pip install pandas
  2. PyPDF2

    • Description: A pure Python PDF toolkit for splitting, merging, cropping, and transforming PDF pages.
    • Installation:
      pip install PyPDF2
  3. pdf2image

    • Description: A Python wrapper for pdftoppm and pdfinfo to convert PDF pages to images.
    • Installation:
      pip install pdf2image
  4. pytesseract

    • Description: An OCR tool for Python to recognize and "read" the text embedded in images.
    • Installation:
      pip install pytesseract
  5. Pillow

    • Description: The Python Imaging Library (PIL) fork that adds some user-friendly features.
    • Installation:
      pip install Pillow
  6. pdfplumber

    • Description: A Python library for extracting content from PDFs, built on top of pdfminer.six.
    • Installation:
      pip install pdfplumber
  7. openpyxl

    • Description: A Python library to read/write Excel 2010 xlsx/xlsm/xltx/xltm files.
    • Installation:
      pip install openpyxl

Built-in Modules

The following built-in Python modules are also used in the project:

  1. os

    • Description: Provides a way of using operating system dependent functionality like reading or writing to the file system.
  2. re

    • Description: Provides support for regular expressions in Python.
  3. tkinter

    • Description: The standard GUI library for Python.
  4. pathlib

    • Description: Provides an object-oriented interface for filesystem paths.

Usage

To use the application, follow these steps:

  1. Clone the repository:
    git clone https://github.com/Pradyumna744/PDF-TO-EXCEL-PYTHON-FOR-ADVICE-LETTERS.git

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages