Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Good toy parameters? #572

Closed
bittlingmayer opened this issue Nov 5, 2018 · 3 comments
Closed

Good toy parameters? #572

bittlingmayer opened this issue Nov 5, 2018 · 3 comments

Comments

@bittlingmayer
Copy link
Contributor

For teaching, baselines and proofs of concept, it would be good to have toy parameters that have trained something in, for example, 1 hour on a CPU, on a relatively simple dataset. (We have parallel datasets for language pairs that are similar, or the same language but with different alphabets.) Can be word-level or character-level.

Can you suggest something?

@bittlingmayer
Copy link
Contributor Author

By the way, some parameters in the example at https://aws.amazon.com/en/blogs/machine-learning/train-neural-machine-translation-models-with-sockeye/ are out of date (for example --attention-type).

Right now it is hard to know which combinations of parameters are valid, and which values are the default (for example rnn).

@mjpost
Copy link
Contributor

mjpost commented Nov 5, 2018

You might find it helpful to start with the sequence copy model tutorial. This uses a very small vocabulary of integers and the settings there should work fine.

If you want to update the tutorial (or add another) and issue it as a PR against docs/, that would likely be welcomed!

@bittlingmayer
Copy link
Contributor Author

That works, thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants