Transfoxl seq classification #8868

spatil6 · 2020-12-01T10:09:34Z

This PR implements Sequence classification for Transformer XL model
TransfoxlForSequenceClassification uses the last token in order to do the classification, as other causal models (e.g. GPT-1,GPT-2) do.

Fixes #7623 (Partially)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@LysandreJik

sync up

…til6/transformers into transfoxl-seq-classification

LysandreJik

LGTM, thanks a lot @spatil6!

Same as GPT-2, this would benefit from also handling padding on the left; I'll work on this in another PR.

sgugger

Thanks a lot for adding this!

sgugger · 2020-12-01T17:42:48Z

src/transformers/models/transfo_xl/modeling_transfo_xl.py

@@ -632,6 +633,40 @@ class TransfoXLModelOutput(ModelOutput):
    attentions: Optional[Tuple[torch.FloatTensor]] = None


+@dataclass
+class TransfoXLSequenceClassifierOutputWithPast(ModelOutput):


This new class needs to be documented in the .rst with the other TransfoXL-specific outputs.

patrickvonplaten · 2020-12-01T22:29:10Z

src/transformers/models/transfo_xl/modeling_transfo_xl.py

+            batch_size, sequence_length = inputs_embeds.shape[:2]
+
+        assert (
+            self.config.pad_token_id is not None or batch_size == 1


great assert!

patrickvonplaten

Good to merge for me

spatil6 added 4 commits December 1, 2020 15:02

Merge pull request #6 from huggingface/master

09768e6

sync up

Transfoxl sequence classification

8a23579

Transfoxl sequence classification

3a29571

Merge branch 'transfoxl-seq-classification' of https://github.com/spa…

49729fc

…til6/transformers into transfoxl-seq-classification

LysandreJik approved these changes Dec 1, 2020

View reviewed changes

LysandreJik requested review from sgugger and patrickvonplaten December 1, 2020 17:37

sgugger approved these changes Dec 1, 2020

View reviewed changes

patrickvonplaten reviewed Dec 1, 2020

View reviewed changes

patrickvonplaten approved these changes Dec 1, 2020

View reviewed changes

LysandreJik approved these changes Dec 2, 2020

View reviewed changes

LysandreJik merged commit f6b44e6 into huggingface:master Dec 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transfoxl seq classification #8868

Transfoxl seq classification #8868

spatil6 commented Dec 1, 2020

LysandreJik left a comment

sgugger left a comment

sgugger Dec 1, 2020

patrickvonplaten Dec 1, 2020

LysandreJik Dec 2, 2020

patrickvonplaten left a comment

Transfoxl seq classification #8868

Transfoxl seq classification #8868

Conversation

spatil6 commented Dec 1, 2020

Before submitting

Who can review?

LysandreJik left a comment

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment

sgugger Dec 1, 2020

Choose a reason for hiding this comment

patrickvonplaten Dec 1, 2020

Choose a reason for hiding this comment

LysandreJik Dec 2, 2020

Choose a reason for hiding this comment

patrickvonplaten left a comment

Choose a reason for hiding this comment