Azure batch transcription PoC

This CLI tool is a working version of Azure Cognitive Speech Service with batch transcription. It can be a perfect example of call center transcription. It detects the language, timestamp, and speaker recognition/diarization. Just store your audio files in a directory and the tool will correct the audio file's bitness and change the audio channel to mono (if it's stereo), then it'll upload the verified audio files in a storage container and collect the blob links and submit those links for batch transcription. When the job is done, it'll create an output directory containing your transcriptions as txt files.

Installation

Use the package manager pip to install foobar.

pip install -r requirements.txt

Configurations

Grab your Azure Client id, Client secret, Tenant id, Service region, subscription key (Azure Speech), and Azure storage container connection string from the portal and put them into the .env file accordingly.

Set your folder containing audio files in the TRANSCRIPTION_DIRECTORY variable from .env Set the VERBOSE variable to True if you want to see every step in CLI

Usage

python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.idea		.idea
.gitignore		.gitignore
example.env		example.env
main.py		main.py
readme.md		readme.md
requirements.txt		requirements.txt
workflow.png		workflow.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

.gitignore

.gitignore

example.env

example.env

main.py

main.py

readme.md

readme.md

requirements.txt

requirements.txt

workflow.png

workflow.png

Repository files navigation

Azure batch transcription PoC

Installation

Configurations

Usage

About

Releases

Packages

Languages

kazirahiv/AzureTranscribePoC

Folders and files

Latest commit

History

Repository files navigation

Azure batch transcription PoC

Installation

Configurations

Usage

About

Topics

Resources

Stars

Watchers

Forks

Languages