-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is it possible to train with just src, mt, ter? #46
Comments
Hi James! Although this was something we never considered, we did have some requests about that and we added that functionality. This is the reason why all of the examples assume you would have more data. However, as I mentioned, you can train with just src, mt and TER. To do so you need to specify the following in your config file (besides the rest of the parameters): sentence-level: True
predict-gaps: False
predict-target: False
predict-source: False Keep in mind that there are some config options based on word-level tags that might not work when training just for sentence-level. Let us know if you find any errors while training only with sentences! |
Closing this since there have been no updates, feel free to re-open if you have further questions! |
Thanks Miguel. I successfully trained a predictor-estimator model on WMT data following your advice. I did this using a modified version of the config file in the experiments directory. In case it's useful, here are the modifications I made: OpenKiwi/experiments/train_estimator.yaml Line 32 in 715eba7
OpenKiwi/experiments/train_estimator.yaml Line 46 in 715eba7
OpenKiwi/experiments/train_estimator.yaml Line 108 in 715eba7
OpenKiwi/experiments/train_estimator.yaml Line 114 in 715eba7
|
Hi,
Thanks for making openkiwi available!
Can any of the openkiwi models be trained when the only data available is
[source sentence], [machine translation sentence], [TER score]
? As far as I can tell all the examples need more data, for example the tags. But maybe I missed something.
Thanks,
James
The text was updated successfully, but these errors were encountered: