Audio Transcriber API

This is a simple flask API that accepts a ogg base64 data (although it may be compatible to other types of audio formats), converts to WAV (using librosa and soundfile) and then transcribe using vosk, returning the text transcribed.

The current code is with the Portuguese-BR model, however, it can be easily changed to other vosk model (https://alphacephei.com/vosk/models).

How to run (development env)

Install packages

pip install -r requirements

Go to flask API folder

cd ./flaskapp

Start flask server (http://localhost:5000)

flask run

How to run (production env)

Instead running a flask server, use gunicorn WSGI HTTP server

gunicorn -w 1 --bind 0.0.0.0:3800 wsgi

Create docker image

To create a docker image, build it with:

docker build -t audiotranscriberapi .

Then run it port-forwarding the required port

docker run -p 3800:3800 audiotranscriberapi

How to use

It's recommended to use an API tool like Postman.

On Headers: Include the key Content-Type with value application/json as we will send the base64 audio data using a JSON format.

In Body: Create a JSON where the data key has the base64 audio data, for example:

{
  "data": "BASE64DATA"
}

Finally on URL field, select the POST method and send the JSON to the following address: http://localhost:5000/transcribe.

If successful, it will return a JSON with code 200 and the transcribed text in data.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
Deployment		Deployment
flaskapp		flaskapp
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Transcriber API

How to run (development env)

How to run (production env)

Create docker image

How to use

About

Releases

Packages

Languages

rmazzine/AudioTranscriberAPI

Folders and files

Latest commit

History

Repository files navigation

Audio Transcriber API

How to run (development env)

How to run (production env)

Create docker image

How to use

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages