Skip to content

isabella232/document-ai-samples

 
 

Google Cloud Document AI Samples

License GitHub Super-Linter Document AI

Welcome to the Google Cloud Document AI sample repository.

Overview

The repository contains samples and Community Samples that demonstrate how to analyze, classify and search documents using Google Cloud Document AI.

Samples

  • PDF Splitter Sample: This project uses the Document AI API to split PDF documents.
  • Web App Demo: This project is a fullstack application that uses Document AI to process different types of documents. This application currently supports Form, Invoice and OCR processors.
  • Tax Processing Pipeline: This project uses the Document AI API to classify, parse, and calculate a tax form using multiple document types.
  • Fraud Detection: This project uses the Document AI Invoice Parser with EKG and Google Maps to store document Entities in BigQuery.

Test Document Files

If you need Document Files to run the samples, you can access them from this publicly-accessible Google Cloud Storage Bucket.

gs://cloud-samples-data/documentai/

The directory is organized by solution and document type, you can see the folder structure listed here.

documentai/
├── ContractDocAI
├── GeneralProcessors
│   ├── FormParser
│   ├── OCR
│   └── Quality
├── IdentityDocAI
│   ├── Driver's License (USA)
│   └── Passport (USA)
├── LendingDocAI
│   ├── 1040 Parser
│   ├── 1099-DIV Parser
│   ├── 1099-INT Parser
│   ├── 1099-MISC Parser
│   ├── 1099-NEC Parser
│   ├── 1099-R Parser
│   ├── Bank Statement Parser
│   ├── Lending Document Splitter & Classifier
│   └── Pay Slip Parser
├── ProcurementDocAI
│   ├── Expense Parser
│   ├── Invoice Parser
│   ├── Procurement Document Splitter & Classifier
│   └── Utility Parser
├── codelabs
    ├── form-parser
    ├── hitl
    ├── ocr
    └── specialized-processors

Codelabs

Codelabs Logo

Community Samples


Disclaimer: Community samples are not officially maintained by Google.


Contributing

Contributions welcome! See the Contributing Guide.

Getting help

Please use the issues page to provide feedback or submit a bug report.

Disclaimer

This is not an officially supported Google product. The code in this repository is for demonstrative purposes only.

About

No description, website, or topics provided.

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 52.3%
  • TypeScript 33.0%
  • HTML 7.5%
  • CSS 3.5%
  • Shell 1.7%
  • JavaScript 1.1%
  • Dockerfile 0.9%