-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make a postprocess to handle capitalisation #67
Comments
At what stage exactly and on the basis of which information? I'm thinking about how dealing with the difference in French nouns like "allemand" (the language) and "Allemand" (a person). Currently, I do this in transfer. |
@ftyers we can use secondary tags to propagate the case till the post generator and then apply it there if needed. |
This is related: #75 |
@hectoralos I would do it in posttransfer using the LU and perhaps a 1-2 word context window. |
@ftyers basically only using dictionary case and "is this a sentence end"-context and ignoring input case? We'd lose the ability to keep UPPER CASE and Titles with Titlecase but maybe that's worth the code simplification … |
|
@mr-martian |
Processor added in 7e7004d |
Capitalisation should not be done in transfer, it should be done in a postprocess, much like "recasing" in SMT.
The text was updated successfully, but these errors were encountered: