PoE Textract is an Amazon Textract Tabular Formatted Parser Application for UoT Price of Empire project. The application can let users upload pdf or image files and scan the tabular data into csv files.
- GUI Python Tkinter
- Data All data is saved in Amazon S3 and Amazon DynamoDB
- Processing Amazon Textract
This app depends on AWS Textract service. An AWS account is needed. The default location of AWS_SHARED_CREDENTIALS_FILE
and AWS_CONFIG_FILE
are at '~/.aws/credentials'
and '~/.aws/config'
. Use
$ aws configure
to set up your credentials and default region. For more details about AWS configuration, check Boto3 1.28.5 documentation.
This project mainly depends on Tkinter
$ pip install -r requirements.txt
- Run the app:
$ python file_uploader.py
- The FileUploader window will open.
- Click the "Upload File" button to select a PDF or image file.
- Once the file is uploaded, the file name will be displayed.
- Click the "Download File" button to save the uploaded file to a desired location.