Skip to content

sudo-Boris/whisperapp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Whisperapp

This little app showcases how simple it is to use a state-of-the-art machine learning model!

We are working with OpenAI's Whisper model, a transformer architecture that takes a voice recording as input, splits it into 30 second chunks, converts them into a special kind of spectrogram (by using a Fourier transformation) called Mel spectrogram, infers the language, and then transcribes or even translates the text to english!

Whisper Model Architecture

The model was trained on 680,000 hours of multilingual and multitask supervised data that was collected from the web! The model is capable of understanding and transcribing in 98 different languages!

Feel free to use this code and extend it to make the app prettier or maybe even implement new features!

I also have a YouTube video where I go talk more about the model and code and also play around by trying different languages!

About

Voice to text app using OpenAI's Whisper model.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages