Whisperapp

This little app showcases how simple it is to use a state-of-the-art machine learning model!

We are working with OpenAI's Whisper model, a transformer architecture that takes a voice recording as input, splits it into 30 second chunks, converts them into a special kind of spectrogram (by using a Fourier transformation) called Mel spectrogram, infers the language, and then transcribes or even translates the text to english!

The model was trained on 680,000 hours of multilingual and multitask supervised data that was collected from the web! The model is capable of understanding and transcribing in 98 different languages!

Feel free to use this code and extend it to make the app prettier or maybe even implement new features!

I also have a YouTube video where I go talk more about the model and code and also play around by trying different languages!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
img		img
v1		v1
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

v1

v1

README.md

README.md

Repository files navigation

Whisperapp

About

Releases

Packages

Languages

sudo-Boris/whisperapp

Folders and files

Latest commit

History

Repository files navigation

Whisperapp

About

Resources

Stars

Watchers

Forks

Languages