Good toy parameters? #572

bittlingmayer · 2018-11-05T04:51:24Z

For teaching, baselines and proofs of concept, it would be good to have toy parameters that have trained something in, for example, 1 hour on a CPU, on a relatively simple dataset. (We have parallel datasets for language pairs that are similar, or the same language but with different alphabets.) Can be word-level or character-level.

Can you suggest something?

bittlingmayer · 2018-11-05T06:28:03Z

By the way, some parameters in the example at https://aws.amazon.com/en/blogs/machine-learning/train-neural-machine-translation-models-with-sockeye/ are out of date (for example --attention-type).

Right now it is hard to know which combinations of parameters are valid, and which values are the default (for example rnn).

mjpost · 2018-11-05T16:17:22Z

You might find it helpful to start with the sequence copy model tutorial. This uses a very small vocabulary of integers and the settings there should work fine.

If you want to update the tutorial (or add another) and issue it as a PR against docs/, that would likely be welcomed!

bittlingmayer · 2018-11-05T21:26:42Z

That works, thank you!

bittlingmayer closed this as completed Nov 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Good toy parameters? #572

Good toy parameters? #572

bittlingmayer commented Nov 5, 2018

bittlingmayer commented Nov 5, 2018

mjpost commented Nov 5, 2018

bittlingmayer commented Nov 5, 2018

Good toy parameters? #572

Good toy parameters? #572

Comments

bittlingmayer commented Nov 5, 2018

bittlingmayer commented Nov 5, 2018

mjpost commented Nov 5, 2018

bittlingmayer commented Nov 5, 2018