Skip to content

n0th1ng-else/voice-to-text-bot

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
e2e
 
 
k8s
 
 
src
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Telegram Bot Converts Voice Messages Into Text

Bot logo

This is the simple bot that converts voice into text. I tried to review available public speech recognition services and the results you can see below in the table.

Tech Stack

Service model

flowchart BT
  subgraph tg[Telegram]
    voice[Voice message]
    audio[Audio]
    video[Video note]
    text[Text message]
    bot[AudioMessBot API]
  end
  subgraph cluster[Replicas]
    r1{{Replica 1}}
    ar{{Active replica}}
    r2{{Replica N}}
  end
  voice-->bot
  audio-->bot
  video-->bot
  bot-->text
  bot---ar
  ar---db[(PSQL\nDatabase)]
  ar---cloud((Cloud API provider))

Services overview

Service provider Russian lang Synchronous API Duration limitation File upload Speed
IBM Watson no no N/A Unknown Unknown
Microsoft Azure no no N/A Unknown Unknown
Amazon AWS yes no Unlimited S3 Minutes
Google Cloud yes yes 1 minute*[1] Direct / GDrive Instant*[2]
Wit.ai yes yes 5 minutes Direct Instant
  • For direct upload
    1 Unlimited for asynchronous upload via Google Drive
    2 Takes a while for asynchronous upload via Google Drive