We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我做过相关AI字幕的工作,不方便上传代码,但有几个方向值得借鉴:
最后时间轴的结果里无需人为修正的准确结果可以达到80%+
The text was updated successfully, but these errors were encountered:
感谢,我抽空会研究一下
Sorry, something went wrong.
也可以参考一下whisper webui https://gitlab.com/aadnk/whisper-webui 它调用silero-vad先对音频进行分块然后喂给whisper,基本上可以完美解决莫名其妙反复重复某句话的bug。对于小语种特别有用,openai/whisper#397 这里讨论的例子也是日语。
whisperX 的 colab 使用 似乎必定會牽扯到要重啟環境
! pip install torch==2.0.0+cu118 torchvision==0.15.1+cu118 torchaudio==2.0.1 torchtext==0.15.1 --index-url https://download.pytorch.org/whl/cu118
! pip install git+https://github.com/m-bain/whisperx.git
因為環境涉及到重新安裝pytorch的樣子
No branches or pull requests
我做过相关AI字幕的工作,不方便上传代码,但有几个方向值得借鉴:
最后时间轴的结果里无需人为修正的准确结果可以达到80%+
The text was updated successfully, but these errors were encountered: