Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update training commands #134

Merged
merged 5 commits into from Mar 28, 2018
Merged

Update training commands #134

merged 5 commits into from Mar 28, 2018

Conversation

hitvoice
Copy link
Contributor

Update training commands in data/README to match the latest version of this project according to #132.

  • Motivation: in the previous data/README, the commands are obsolete and will cause the error "unrecognized arguments: --label-smoothing 0.1 --force-anneal 50".
  • What's changed: add arguments "--criterion label_smoothed_cross_entropy" and "--lr-scheduler fixed" to the training commands of all 3 datasets.
  • Result: the new commands run without error on all 3 datasets.

Change "awailable" to "available".
Update training commands in data/README to match the latest version of this project according to facebookresearch#132.

- Motivation: in the previous data/README, the commands are obsolete and will cause the error "unrecognized arguments: --label-smoothing 0.1 --force-anneal 50". 
- What's changed: add arguments "--criterion label_smoothed_cross_entropy" and "--lr-scheduler fixed" to the training commands of all 3 datasets.
- Result: the new commands run without error on all 3 datasets.
Update training commands in data/README to match the latest version of this project according to facebookresearch#132.

Continue from 3c07295: add omitted "\".
Copy link
Contributor

@edunov edunov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for making this change! Only one little comment and we should be able to land it.

data/README.md Outdated
@@ -20,9 +20,10 @@ $ python preprocess.py --source-lang de --target-lang en \

# Train the model:
$ mkdir -p checkpoints/fconv
$ CUDA_VISIBLE_DEVICES=0 python train.py data-bin/iwslt14.tokenized.de-en \
$ python train.py data-bin/iwslt14.tokenized.de-en \

This comment was marked as off-topic.

This comment was marked as off-topic.

specify a single GPU setup for IWSLT14
@edunov edunov merged commit 4883652 into facebookresearch:master Mar 28, 2018
myleott added a commit that referenced this pull request Jun 26, 2018
noisychannel pushed a commit to noisychannel/fairseq that referenced this pull request Jan 3, 2020
moussaKam pushed a commit to moussaKam/language-adaptive-pretraining that referenced this pull request Sep 29, 2020
yfyeung pushed a commit to yfyeung/fairseq that referenced this pull request Dec 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants