Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Idiosyncracies in each language. #41

Open
arnavkapoor opened this issue Aug 26, 2020 · 0 comments
Open

Idiosyncracies in each language. #41

arnavkapoor opened this issue Aug 26, 2020 · 0 comments

Comments

@arnavkapoor
Copy link
Collaborator

arnavkapoor commented Aug 26, 2020

I looked at German/French and both have some idiosyncrasies which needs to be handled. For French, using quatre-vingt for 80 and then allowing numbers from 1 to 19 as the suffix , eg) quatre-vingt-dix-neuf for 99, (80+19), would need to be handled. With German, it's a more fundamental issue, as they tend to build numbers from left to right. achtunddreißig (28) which is like 8 and twenty. Can refer to this for more details about this building method and other languages also use it. This might be fixed by reversing the list of tokens. (But need to look more into it). Also for larger numbers (greater than one thousand I believe) it does revert back to left to right.

While this does only mention two languages there would definitely be such cases and exceptions in other languages too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant