Join GitHub today
GitHub is home to over 40 million developers working together to host and review code, manage projects, and build software together.Sign up
Feature/Speech to text transcription #495
To keep things simple, the implementation for now uses html5 audio instead of something more sophisticated like wavesurfer. This can be improved in the future if the requirement arises. The
For ease-of-use, speech-to-text data can be imported either by posting audio files (MP3, WAV, etc.) or by uploading a JSONL manifest that encodes the audio as data URIs or URLs to the audio files.
To make it easier to identify and distinguish audio files, the document left-navigation has been updated to display a file name (instead of file content) if the