Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pronouncing.phones_for_word("a") returns ['AH0', 'EY1', 'EY1 F AO1 R T UW1 W AH1 N T UW1 EY1 T'] #34

Closed
denisfitz57 opened this issue Oct 23, 2017 · 7 comments

Comments

@denisfitz57
Copy link

The third result seems incorrect.

@hugovk
Copy link
Contributor

hugovk commented Oct 23, 2017

It does!

Looks like some oddness in the CMU dictionary, which begins:

A AH0
A(1) EY1
A'S EY1 Z
A. EY1
A.'S EY1 Z
A.S EY1 Z
A42128 EY1 F AO1 R T UW1 W AH1 N T UW1 EY1 T
AA EY2 EY1

@hugovk
Copy link
Contributor

hugovk commented Oct 23, 2017

Or the dictionary is fine, but A42128 is a bit of an odd word!

Rather the problem is here:

https://github.com/aparrish/pronouncingpy/blob/master/pronouncing/__init__.py#L30

Stripping the brackets and numbers means the word A42128 is interpreted as the word A.

@hugovk
Copy link
Contributor

hugovk commented Oct 23, 2017

See PR #36 for a fix with tests.

@aparrish
Copy link
Owner

weird, I wonder why "A42128" is even in there in the first place.

@hugovk
Copy link
Contributor

hugovk commented Oct 23, 2017

I've no idea, but it is quite a nice shade of red.

@denisfitz57
Copy link
Author

denisfitz57 commented Oct 23, 2017 via email

@hugovk
Copy link
Contributor

hugovk commented Oct 23, 2017

Ha, that video is just generated from the same CMU Pronouncing Dictionary!

Maybe it's a Mountweazel. https://en.m.wikipedia.org/wiki/Fictitious_entry

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants