Skip to content
This repository has been archived by the owner on Nov 22, 2022. It is now read-only.

changes to make tutorial code simpler #1002

Closed

Conversation

rutyrinott
Copy link
Contributor

Summary:
create lang2id maps for 15,20,and 43 that can be imported

define the read_vocab helper functions outside the tensorize to be used in notebook tutorial

Reviewed By: kartikayk

Differential Revision: D17555133

Michael Wu and others added 2 commits September 25, 2019 09:25
Summary:
Pull Request resolved: facebookresearch#979

D17241503 selects only masked tokens for the final logits / targets during MLM. This fails when there are no masked tokens (e.g. at the end of a file there can be a very short batch). In this case, select just the first token in the first batch.

BTW, the new masking strategy is faster (20%) than the old one - f138005051 vs f138005044.

Differential Revision: D17370855

fbshipit-source-id: 453ccc4797c0d976281aa14212eba43b6f9b5d6f
Summary:
create lang2id maps for 15,20,and 43 that can be imported

define the read_vocab helper functions outside the tensorize to be used in notebook tutorial

Reviewed By: kartikayk

Differential Revision: D17555133

fbshipit-source-id: 2d00fa2357959d58a2511302377feeb7f376cf01
@facebook-github-bot facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Sep 25, 2019
@facebook-github-bot
Copy link
Contributor

This pull request has been merged in d6e9b14.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
CLA Signed Do not delete this pull request or issue due to inactivity. Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants