Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Accentor doesn't always return all of the possible variants #28

Open
DinoTheDinosaur opened this issue Jul 10, 2018 · 2 comments
Open

Comments

@DinoTheDinosaur
Copy link
Collaborator

DinoTheDinosaur commented Jul 10, 2018

When setting the parameter mode to 'many'
For example:
дорогой -> доро+гой and дорого+й
BUT!
дела -> дела+, окна -> о+кна
Although the warning ' Word has too many accent variants!' appears in both cases.

@DinoTheDinosaur
Copy link
Collaborator Author

Seems to be the trouble with the homograph dict.

@DinoTheDinosaur
Copy link
Collaborator Author

DinoTheDinosaur commented Jul 10, 2018

Accents_new.json doesn't include some variants of words, although it always includes precise morphological form of the existing variant. It is better to use the combination of homographs_old.json and wiktionary parsing for dictionary generation. For transcription without variants it is better to use homographs.json. This will be implemented in a switch homograph_dict='old' or homograph_dict='new' accordingly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant