-
Notifications
You must be signed in to change notification settings - Fork 26.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[TF Longformer] Add Multiple Choice, Seq Classification Model #6401
Comments
Hi ! |
Awesome, feel free to open an issue :-) |
Hello ! I'm a bit lost here. I've looked at Here is a link to what I've done so far: Groskilled@461ee62 Am I on the right track ? And what am I missing on the tests ? Sorry to ask such simple questions, it's my first time participating in an open source project. |
No worries ;-). This looks alright! Could you open a PR so that we can see your changes directly on the PR? You can checkout this doc to understand how to do PRs: https://github.com/huggingface/transformers/blob/master/CONTRIBUTING.md. Would be great if you can ping me on the PR and then we look together! |
HI @Groskilled and @patrickvonplaten, I have been playing a bit around this issue, as I have some familiarity with Keras/TF2 but no previous experience with transformers, and I was figuring out a way to start familiarising with them. As I am interested in classifying long documents Longformer is of interest to me. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Issue is still open! If stuck, feel free to take a look at the unfinished PR. |
🚀 Feature request
modeling_longformer.py
has the classesLongformerForSequenceClassification
,LongformerForMultipleChoice
andLongformerForTokenClassification
which are not present inmodeling_tf_longformer.py
at the moment.Those classes should be equally added to
modeling_tf_longformer.py
.Motivation
The pretrained weights for TFLongformer are available so that these classes could be used for finetuning.
Your contribution
This issue is a good first issue because it is not too complicated to add these models. One should take a look at
modeling_tf_roberta.py
to see how these models are implemented forTFRoberta
and implement them analogous forTFLongformer
. Please make sure that the docstring is correct and that test are added for each class (again Roberta can serve as an example here, check outtest_modeling_tf_roberta.py
).I am happy to guide interested community contributors through the PR and help them get it merged.
The text was updated successfully, but these errors were encountered: