Using the Vision API with Python

Learn how to use various functionality of the Google Cloud Vision API with Python by building 4 separate scripts that highlight a different feature available in the API. This is the corresponding code repo for the Using the Vision API with Python codelab (free, self-paced, hands-on tutorial) at http://g.co/codelabs/vision-python.

Prerequisites

A Gmail/Google account (Workspace accounts may require administrator approval)
A Google Cloud (Platform) project
An active GCP billing account
Basic Python skills

Supported versions: Python 2.6+ or 3.6+

Python 2 considerations

Python 2 has been sunset by its community in Jan 2020. As such, support for Python 2 in most Google Cloud products will be waning over time (with the except of App Engine, which has expressed continued long-term support of legacy runtimes). This includes the Vision API, whose final client library version supporting Python 2 is v1.0.0, and whose use is no longer featured in the Vision API documentation.

To help accelerate upgrading to 3.x, the scripts in this tutorial only support Python 3 as-is, but commented out code supporting both Python 2 & 3 are available for use if desired. Removing the "# Py2+3" in the code samples gives you a script that works under both Python 2 (under Vision client library v1.0.0) and Python 3 (latest Vision client library). (No "Python 2-only" options are provided.) The Vision API client library source can be found in its open source repo.

Description

The Google Cloud Vision API allows developers to easily integrate vision detection features within applications, including image labeling, facial features detection, landmark detection, optical character recognition (OCR), "safe search", or tagging of explicit content, detecting product or corporate logos, and several others. These are sample scripts that demonstrate usage of the API for the Python language. The code in this repo is both Python 2 & 3 compatible. It is available in Python 2 to help developers migrate to Python 3, and we recommend migrating to 3.x as soon as possible.

Codelab

This repository consists of the sample scripts that correspond to the "Using the Vision API with Python" hands-on codelab. That codelab teaches developers how to use some of the features described above with the Cloud Vision API using Python, namely label annotations, OCR/text extraction, landmark detection, and detecting facial features.

Cost

Use of the Vision API is not free, however certain Google Cloud Platform (GCP) products feature an "Always Free" tier for which you have to exceed in order to incur billing. For the purposes of the codelab, each call to the Vision API counts against that free tier, and so long as you stay within its limits in aggregate (within each month), you should not incur any charges.

Resources

Cloud Vision API

Google Cloud Vision API home page and live demo
Vision API label detection/annotation
Vision API facial feature recognition
Vision API landmark detection
Vision API optical character recognition (OCR)
Vision API "Safe Search"
Vision API product or corporate logo detection
Stack Overflow

Python and Google Cloud

Python on the Google Cloud Platform
GCP Python client libraries

Other languages

.NET/C#
Ruby (archived)

Support

If you've found an error in the codelab or the sample app, check the Issues tab to see if there's an open issue or file a new one. Patches are encouraged; please refer to CONTRIBUTING for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Using the Vision API with Python

Prerequisites

Python 2 considerations

Description

Codelab

Cost

Resources

Cloud Vision API

Python and Google Cloud

Other languages

Support

Files

README.md

Latest commit

History

README.md

File metadata and controls

Using the Vision API with Python

Prerequisites

Python 2 considerations

Description

Codelab

Cost

Resources

Cloud Vision API

Python and Google Cloud

Other languages

Support