Azure Cognitive Services - Speech to Text Batch Transcription

Microsoft's Azure Cognitive Services are a set of machine learning models for problems such as object recognition, sentiment analysis, entity extraction, and speech to text.

This demo uses the Speech API's Speech to Text Batch Transcription service to transcribe audio from an Azure Blob Storage Container

Prerequisites

Data must be uploaded into an Azure Blob Storage Container
Audio files must be in either WAV or MP3 format (with PCM Codec) or OGG (OPUS Codec), with a bitrate of 16-bit and a sample rate of either 8 or 16 kHz, in either mono or stero.
A Cognitive Services Speech Subscription will be required, using a standard tier subscription

Further details are available from the documentation

A Swagger is available for API testing and documentation (see "Custom Speech Transcriptions" section). Note you may need to change the region URL from uksouth to where you've provisioned the Speech API in the following section.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
Speech Services Demo - API.ipynb		Speech Services Demo - API.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Speech Services Demo - API.ipynb

Speech Services Demo - API.ipynb

Repository files navigation

Azure Cognitive Services - Speech to Text Batch Transcription

Prerequisites

About

Releases

Packages

Languages

sconeill/Cognitive-Services

Folders and files

Latest commit

History

README.md

README.md

Speech Services Demo - API.ipynb

Speech Services Demo - API.ipynb

Repository files navigation

Azure Cognitive Services - Speech to Text Batch Transcription

Prerequisites

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages