Skip to content
This repository has been archived by the owner on Nov 22, 2022. It is now read-only.

Fix RoBERTa Q&A Training Bug with multiple BoS tokens. #1343

Closed
wants to merge 1 commit into from

Conversation

debowin
Copy link

@debowin debowin commented Apr 29, 2020

Summary:
SquadForRoBERTaTensorizer adds two BoS tokens to the input sequence during training and eval which differs from inference time behavior.
This diff fixes that bug.

Differential Revision: D21287576

@facebook-github-bot facebook-github-bot added CLA Signed Do not delete this pull request or issue due to inactivity. fb-exported labels Apr 29, 2020
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D21287576

…rch#1343)

Summary:
Pull Request resolved: facebookresearch#1343

SquadForRoBERTaTensorizer adds two BoS tokens to the input sequence during training and eval which differs from inference time behavior.
This diff fixes that bug.

Reviewed By: hikushalhere

Differential Revision: D21287576

fbshipit-source-id: e1a5ad3b3c621a99e1bbc85035879b2ae31caa28
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D21287576

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 77a32ac.

alexeib added a commit to alexeib/pytext that referenced this pull request Oct 19, 2020
…arch#1343)

Summary:
this is the main pr that switches on hydra functionality in fairseq

we migrate "args" object into omegaconf "DictConfig" at all legacy entry points

in addition this migrates various components from secondary registries (like bpe encoders and tokenizers) to make the migration smoother

i am going through code that references migrated fairseq components and changing it to inherit from "Legacy*" components instead. hopefully tests will catch most of this

Pull Request resolved: fairinternal/fairseq-py#1343

Differential Revision: D23973928

Pulled By: alexeib

fbshipit-source-id: e1eb1b59160dacd87640a5d230d5fba027cdb502
facebook-github-bot pushed a commit that referenced this pull request Oct 20, 2020
Summary:
Pull Request resolved: #1510

this is the main pr that switches on hydra functionality in fairseq

we migrate "args" object into omegaconf "DictConfig" at all legacy entry points

in addition this migrates various components from secondary registries (like bpe encoders and tokenizers) to make the migration smoother

i am going through code that references migrated fairseq components and changing it to inherit from "Legacy*" components instead. hopefully tests will catch most of this

Pull Request resolved: fairinternal/fairseq-py#1343

Reviewed By: myleott

Differential Revision: D23973928

Pulled By: alexeib

fbshipit-source-id: dd9554981fff51ea75c1ff343874d1d6e61793c9
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
CLA Signed Do not delete this pull request or issue due to inactivity. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants