Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

识别前的声音分隔,能否使用断句分割 #336

Open
popdog0 opened this issue Apr 2, 2024 · 0 comments
Open

识别前的声音分隔,能否使用断句分割 #336

popdog0 opened this issue Apr 2, 2024 · 0 comments

Comments

@popdog0
Copy link

popdog0 commented Apr 2, 2024

几点建议:
1.识别前的声音分隔,能否使用断句分割,就是根据静音检测,把一句话分割到一个片断里,这样识别的效果可能更好一点。
2.能否去掉拟声词,如:啊、哦、噢、嗯等,以及笑声(哈、呵、嘿)、哭声(呜)等,因为这些没必要翻译
3.还有就是人们口语中的重复词,如:别动、别动、别动;其实也只写一个就行了,没必要多个

如果觉得有用,而且技术上也没问题,可以考虑再以后面面实现,觉得不好就算了。
下面是我静音检测的命令:
ffmpeg.exe -i test.mp3 -af silencedetect=noise=-30dB:d=0.5 -f null

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant