Skip to content

Price-of-Empire-Main-Repository/PoE-Textract

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PoE-Textract

PoE Textract is an Amazon Textract Tabular Formatted Parser Application for UoT Price of Empire project. The application can let users upload pdf or image files and scan the tabular data into csv files.

Table of Contents

Stack

  • GUI Python Tkinter
  • Data All data is saved in Amazon S3 and Amazon DynamoDB
  • Processing Amazon Textract

Pipeline

Setup

Set up AWS configuration

This app depends on AWS Textract service. An AWS account is needed. The default location of AWS_SHARED_CREDENTIALS_FILE and AWS_CONFIG_FILE are at '~/.aws/credentials' and '~/.aws/config'. Use

$ aws configure

to set up your credentials and default region. For more details about AWS configuration, check Boto3 1.28.5 documentation.

Install the required dependencies

This project mainly depends on Tkinter

$ pip install -r requirements.txt

Usage

  1. Run the app:
    $ python file_uploader.py
  2. The FileUploader window will open.
  3. Click the "Upload File" button to select a PDF or image file.
  4. Once the file is uploaded, the file name will be displayed.
  5. Click the "Download File" button to save the uploaded file to a desired location.

Deployment

Contributors

License

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages