Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added tutorial about domain adaptation, including LHUC #409

Merged
merged 2 commits into from
May 24, 2018

Conversation

davvil
Copy link
Contributor

@davvil davvil commented May 23, 2018

Included tutorial about adapting NMT models, including LHUC.

Pull Request Checklist

  • Changes are complete (if posting work-in-progress code, prefix your pull request title with '[WIP]'
    until you can check this box.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

@davvil
Copy link
Contributor Author

davvil commented May 23, 2018

This is a nice use case for the recently introduced config files. :-)

Copy link
Contributor

@fhieber fhieber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for adding!

this tutorial, we show two methods on how to perform domain adaptation of a
general translation system using Sockeye.

We assume you already have an already trained Sockeye model, for example the
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

'we assume you already have a trained Sockeye model'

general translation system using Sockeye.

We assume you already have an already trained Sockeye model, for example the
one trained in the [second tutorial](../wmt/README.md). We also assume that you
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

'for example from the second tutorial.'


We assume you already have an already trained Sockeye model, for example the
one trained in the [second tutorial](../wmt/README.md). We also assume that you
have two training sets, one composed of general or out-of-domain data, and one
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if would introduce the abbreviations you use below here ('id' and 'ood')


## Continuation of training

This method consists in taking the parameters of an already trained system and
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe just: 'This method fine-tunes a trained model and starts a second training run on in-domain data, initialized with the parameters obtained from the out-domain data.'

This method consists in taking the parameters of an already trained system and
use them as initialization of a new training run only on the in-domain data.
Thus you "continue training" on the data you are more interested in. Freitag
and Al-Onaizan (2016) showed that, while being quite a straightforward
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe add a "# references" section at the bottom with the full citation.

use them as initialization of a new training run only on the in-domain data.
Thus you "continue training" on the data you are more interested in. Freitag
and Al-Onaizan (2016) showed that, while being quite a straightforward
technique, this method can achieve good results.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

'showed that this straightforward technique can achieve good results.'


## Leaning Hidden Unit Contribution

Leaning Hidden Unit Contribution (LHUC) is a method proposed by Vilar (2018),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

typo: leaning -> learning

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

full citation at the bottom maybe

Copy link
Contributor

@fhieber fhieber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@fhieber fhieber merged commit af59303 into master May 24, 2018
@fhieber fhieber deleted the adapt-tutorial branch May 24, 2018 06:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants