Skip to content

anirbankonar123/documentai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Google Doc AI Demo

Getting started

Pre-requisites

python 3.9

pip install streamlit

Setup Google Cloud Vision using the instructions at https://cloud.google.com/vision/docs/ocr

Setup Google Document AI using the instructions at https://cloud.google.com/document-ai

Steps to folow

  1. To read the PDF file and create .csv files
export GOOGLE_APPLICATION_CREDENTIALS="path to security credentials json file"
python doc_ai_table.py --pdf <path to pdf file> --folder <output folder>

Check the csv files produced for each table detected in the PDF.
Also check the header.json produced based on form-fields (key value pairs) detected in first page of PDF
Customize the post-processing logic based on your need to write the Invoice.csv<br?

  1. Run the UI demo
streamlit run app.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages