Skip to content

OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.

License

Notifications You must be signed in to change notification settings

wintermi/ocr-runner

Repository files navigation

OCR Runner

Workflows Go Report License Release

Description

A command line application designed to recursively walk through the input path submitting all image files for optical character recognition (OCR) via either the Google Cloud Vision API or a Google Cloud Document AI processor if a prediction endpoint is provided. The application will then output the image information and annotations to a single newline delimited JSON File.

USAGE:
    ocr-runner -i PATH -o FILE

ARGS:
  -endpoint string
        Document AI Prediction Endpoint  (Optional)
  -full
        Output full details to JSON
  -i string
        Input Path  (Required)
  -o string
        Output File  (Required)
  -verbose
        Display verbose or debug detail

Valid File Extensions

The application will automatically filter out all files that do not have one of the following extensions:

  • .bmp
  • .gif
  • .jpg
  • .jpeg
  • .pdf
  • .png
  • .tif
  • .tiff
  • .webp

License

ocr-runner is released under the Apache License 2.0 unless explicitly mentioned in the file header.

About

OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages