This Python project is designed to analyze invoices by extracting information from PDF files, emails, and text files. It processes the data to calculate the total amount in euros.
- PDF to Text Conversion: Utilizes
Pdf2Txt.pyto convert PDF files to text format for easy analysis. - Email Parsing: Extracts relevant invoice data from emails using
ExtractMails.py. - Text Analysis: Analyzes text files containing invoice information with
AnalyzeTxtFINAL.py. - Total Calculation: Calculates the total amount in euros based on the extracted data.
-
Clone the Repository: Clone this repository to your local machine using:
git clone <repository_url> -
Install Dependencies: Ensure you have Python installed on your system. Install the required dependencies by running:
pip install -r requirements.txt -
Usage:
- Place your PDF files in the
Descargasdirectory. - Run the relevant Python scripts to process the invoices and generate the total amount in euros.
- Place your PDF files in the
This is the Input:
The script 'Pdf2Txt.py' converts the pdf to text so Python can analyze it and manipulate it.
Output:
- DescargasMail: Directory where PDFs files are located.
- OutputTxtsV2: Output directory for converted text files.
- AnalyzeTxtFINAL.py: Python script to analyze text files and calculate totals.
- ExtractMails.py: Python script to extract invoice information from emails.
- Pdf2Txt.py: Python script to convert PDF files to text format.
- Some items needed manual modification so the total woks well.
- Review
OutputTxtsV2for insights into the analyzed data and file descriptions.
This project is licensed under the MIT License.
Feel free to adjust the content and add more details as needed!

