-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add scripts for speech-to-text using whisper and stt+forced alignment with whisperX #13
Conversation
hey @900miles this looks great so far. Could you try to play around with having the input to the functions be the Audio class. this is a nice way to zip the signal and sampling rate throughout the functions see here for when it is output, and two lines down from there where it is an input b2aiprep/src/b2aiprep/process.py Line 51 in b5b342f
|
hey @900miles do you mind adding the packages you use in your |
New commit should allow working directly with Audio objects. I've also added a requirements.txt but I've never really made one before so I'm not sure if I did it correctly. |
instead of a requirements.txt just add it to the |
also perhaps change the filename to |
Done and done! |
Adds two functions for using Whisper or WhisperX to transcribe an audio file, and can perform speaker diarization and forced alignment of text output if using WhisperX.