New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fixes to --preserve-punctuation #119
Conversation
Codecov Report
@@ Coverage Diff @@
## master #119 +/- ##
=========================================
Coverage 100.00% 100.00%
=========================================
Files 23 23
Lines 1152 1168 +16
=========================================
+ Hits 1152 1168 +16
Continue to review full report at Codecov.
|
@hadware I think this is good to go, but it definitely changes the output when preserving punctuation compared to previous versions of phonemizer. This way seems much more consistent to me – the output will have the same number of word separators whether punctuation is being preserved or not. See the tests I modified for examples of the new output. |
Alright, super cool. I'll review the PR probably tomorrow, and hopefully we can merge this. This is fantastic, thanks again for your work! |
Great! Let me know if you need me to make any changes. And thank you for this project. It's replaced my DIY solution of a transformers model trained on the CMUDict, and is so much faster and more accurate. |
This is looking good! I'm merging this. I'd like to add a test with the large e-book from #108 , then i'll release a new version (Probably |
Oh, btw. Since you seem to have a very good understanding of the whole lib, would you mind if I ask for your opinion on future PR's (from other github users)? I'm also making you "triager" for this repo. |
Happy to help as much as I can! |
This reworks a few things related to preserving punctuation.
preserve_punctuation
is True or False.A number of the tests had to be updated due to that first bullet point.