We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Might be a stretch, but would it be possible to label different speakers if audio has >1 person talking?
This would come handy for conference recordings with multiple presenters, etc.
The text was updated successfully, but these errors were encountered:
Thinking about possible implementation, the simplest one might be to label based on the audio channel.
Say we have a stereo recording:
Sorry, something went wrong.
Stereo-diarization is already implemented - see #64 I have some other ideas in mind for general diarization, but low-priority for the moment.
Add model_dir to arguments (ggerganov#202)
0b1ba3d
* Add model_dir to arguments * minor formatting change Co-authored-by: Jong Wook Kim <jongwook@openai.com>
No branches or pull requests
Might be a stretch, but would it be possible to label different speakers if audio has >1 person talking?
This would come handy for conference recordings with multiple presenters, etc.
The text was updated successfully, but these errors were encountered: