Community contribution - `BetterTransformer` integration for more models! #20372

younesbelkada · 2022-11-22T08:55:51Z

hamishdickson · 2022-11-22T13:49:59Z

NotImplementedError: The Better Transformers implementation for the model DebertaV2Model has not beenimplemented yet. Please open an issue requesting the addition of this model with its BetterTransformerimplementation.

It's not on your list, but would you complain if I did this for DebertaV2Model?

michaelbenayoun · 2022-11-22T14:18:14Z

It is not in the list because DebertaV2 does not have a regular attention mechanism, so it is not possible to use it with BetterTransformer.

younesbelkada · 2022-11-22T14:53:25Z

Yes I second what @michaelbenayoun said, please see related: huggingface/optimum#487

hamishdickson · 2022-11-22T15:42:46Z

makes a lot of sense - sorry I should have thought about that a bit harder before posting!

GenVr · 2022-11-23T08:19:53Z

I noticed that Better Transformers for the T5 model has not been implemented yet. Will it be implemented in the future (if possible)? Thanks.

younesbelkada · 2022-11-23T09:04:14Z

Hi @GenVr
Thanks a lot for your reply! Unfortunately T5 cannot be supported because of the nature of its attention mechanism. In fact T5 uses attention bias and this is not supported for BetterTransformer
Thanks!

RJZauner · 2022-11-25T13:56:00Z

Hi :) I would like to work on the implementation for RemBertLayer.

What are the next steps in getting started?

Thank you!

younesbelkada · 2022-11-25T16:46:30Z

Hey @RJZauner !
Thanks so much for your interest in helping us integrating more models for BetterTransformer !
RemBert seems to use the same attention mechanism as BERT, the only difference should be on the Embedding layer, which is fine for us! So I would say you can move ahead and start forking optimum library, create a new branch and open a draft PR. Feel free to have some inspiration from what has been done by huggingface/optimum#494 and huggingface/optimum#508 to see what exactly needs to be done ;) Ping us (myself, @michaelbenayoun & @fxmarty) whenever you feel that you need help!

shogohida · 2022-11-25T17:06:14Z

Hi @younesbelkada, I would like to work on the easiest of the models mentioned above. Which one do you recommend? What I said might sound a bit weird but I want to tackle a simple one since I'm not very familiar with these models 🙏

JuheonChu · 2022-11-25T17:12:12Z

Hello, I would like to tackle the implementation for TapasLayer.

May I ask you how I can start the further steps?

Thank you for your time.

michaelbenayoun · 2022-11-25T17:46:10Z

Hi @shogohida and @JuheonChu ,

You can read this page for learning how to contribute. You can then open a PR with your code, and ask questions there, we will be glad to help!

Also @shogohida, I think they are all similar in terms of difficulty, so do not block on that, maybe choose a model with the modality the most familiar to you.

younesbelkada · 2022-11-25T17:48:23Z

Seconding what @michaelbenayoun said, feel free to check some example PRs huggingface/optimum#508 or huggingface/optimum#494 for reference!
@shogohida , you can take RocBERT, actually it copies from Bert so the conversion will be very easy :)

shogohida · 2022-11-25T18:03:58Z

Thanks guys for your replies! I will take RocBERT then!

JuheonChu · 2022-11-25T18:31:49Z

Thanks @michaelbenayoun ! I will take TapasLayer !

ravenouse · 2022-11-26T04:24:10Z

Hi! Thank you so much for opening this issue.

I was implementing the RemBERT and had some questions. But then I noticed that @RJZauner had already been working on that. I am going to hold my work on that and I am looking forward to see RJZauner's implementations!
I will work on the mBART.
I also found some dead links and some points unclear on this page. How should I report and help to solve the problems I found?

blakechi · 2022-11-26T08:02:35Z

Hello @younesbelkada,

I would like to take DetrLayer. Nice tutorial btw 😀

younesbelkada · 2022-11-26T09:57:10Z

Hi @blakechi !
Sure you can take it ;) let me know if you need help opening a PR!

younesbelkada · 2022-11-26T09:59:46Z

Hi @ravenouse !
Thanks for your help! Yes you can take MBART ;)
Regarding the dead link could you open an issue at optimum?
Thanks!

RJZauner · 2022-11-26T17:48:56Z

Hey @RJZauner !
Thanks so much for your interest in helping us integrating more models for BetterTransformer !
RemBert seems to use the same attention mechanism as BERT, the only difference should be on the Embedding layer, which is fine for us! So I would say you can move ahead and start forking optimum library, create a new branch and open a draft PR. Feel free to have some inspiration from what has been done by huggingface/optimum#494 and huggingface/optimum#508 to see what exactly needs to be done ;) Ping us (myself, @michaelbenayoun & @fxmarty) whenever you feel that you need help!

Thank you for the info!

lucaspct · 2022-11-29T10:17:53Z

Hello @michaelbenayoun and @younesbelkada !

First time contributing for me :)

I would like to handle the implementation for Speech2Text

What are the first steps ? Create a PR ?

Thanks in advance.

JuheonChu · 2022-11-29T15:00:23Z

Hello @michaelbenayoun and @younesbelkada !

First time contributing for me :)

I would like to handle the implementation for Speech2Text

What are the first steps ? Create a PR ?

Thanks in advance.

Hello, I am absolutely sure that they will give you a better suggestion than what I have.
I would like to share that it is good to read CONTRIBUTING.md in the transformer repository.
I read through every content very carefully and made my first contribution!

lucaspct · 2022-11-29T16:03:18Z

Hello @michaelbenayoun and @younesbelkada !
First time contributing for me :)
I would like to handle the implementation for Speech2Text
What are the first steps ? Create a PR ?
Thanks in advance.

Hello, I am absolutely sure that they will give you a better suggestion than what I have. I would like to share that it is good to read CONTRIBUTING.md in the transformer repository. I read through every content very carefully and made my first contribution!

Hello @JuheonChu :)

I am definitely have a look at it ! thanks

michaelbenayoun · 2022-11-30T09:41:53Z

Hi @lucaspct,

Yes the first steps would be to read the guide explaining how to contribute to optimum.bettertransformer, and then opening a PR on Optimum, we will support you there!

miyu386 · 2022-11-30T19:46:00Z

Hi @younesbelkada @michaelbenayoun I'd love to take on the RoFormer model if it isn't claimed yet. Will open a PR after I read through the guide!

adit299 · 2022-12-01T02:43:09Z

I would like to take a crack at the ProphetNet encoder if it has not been claimed yet

younesbelkada · 2022-12-01T09:07:04Z

Thank you very much @miyu386 & @adit299 !
Of course yes you can give a try on that ;) feel free to start to open a PR on optimum and we'll guide you from there 💪

younesbelkada · 2023-01-15T09:15:10Z

Hi @HVjay,
Thanks for your interest! I think Detr can be supported as well as ConditionalDetr as it seems to use classic attention mechanism - this can be also confirmed by the paper that states that the method uses classic transformer-based models. However note that only the encoder part can be converted.

Hi @mszsorondo,
Thank you for your message! Recently BLIP has been added, the model should support BetterTransformer integration (Vision + text)

younesbelkada · 2023-01-16T08:25:51Z

Hi @HVjay ,

Actually there is already someone working on Detr, check: huggingface/optimum#684

JanFidor · 2023-02-12T21:08:38Z

Hi @younesbelkada , could I pick up RoFormer ?

dewasahu2003 · 2023-04-30T16:39:03Z

@sushmanthreddy are you doing Detr anymore...? if doing please tell

dewasahu2003 · 2023-04-30T16:48:55Z

@younesbelkada Hi 👋 could I take Speech2Text 🙂

y3sar · 2023-05-04T04:56:01Z

@younesbelkada Hello, I would love to contribute to this issue. I am new to contributing in transformers. Can you please tell me which of the model layers are vacant I would like to take one up :)

awinml · 2023-05-04T14:39:49Z

@younesbelkada I would like to work on Detr.

@mszsorondo Are you still working on it? There has not been any activity on your PR since Jan 8. I can pull from your PR and fix the failing tests.

dewasahu2003 · 2023-05-04T14:57:57Z

@awinml I actaully submitted the pr for Detr Model

so i forgot to mention,sorry buddy
you can look for other model available
here is the pr

awinml · 2023-05-04T15:33:51Z

@dewasahu2003 No problem.

Its always better to inform the original author and pull from their PR so they get due credit. Hence the question was aimed at @mszsorondo.

dewasahu2003 · 2023-05-04T15:46:18Z

@younesbelkada Hey 👋

I have submitted the pr for BetterTransformer for detr
I mentioned you there PR
From next time i would keep in mind to ask pr authors

mobley-trent · 2023-05-08T18:27:40Z

Hi, @younesbelkada I'd like to work on ProphetNet 😀

mszsorondo · 2023-05-08T23:33:46Z

@younesbelkada I would like to work on Detr.

@mszsorondo Are you still working on it? There has not been any activity on your PR since Jan 8. I can pull from your PR and fix the failing tests.

Go for it! Sorry for the delay

Jack-Chuang · 2023-05-27T00:25:53Z

Hi @younesbelkada, @michaelbenayoun, and @fxmarty,

I would like to work on Speech2TextLayer.

What are the next steps in getting started?

Thank you!

jucamohedano · 2023-05-27T19:47:26Z

Hi! @younesbelkada @michaelbenayoun @fxmarty
I'm interested in adding support for one of the models in the list. Although, I believe that the only model left might be Speech2TextLayer and has been claimed by @Jack-Chuang

mobley-trent · 2023-05-28T09:43:02Z

Hello @younesbelkada @fxmarty and @michaelbenayoun
I would like to work on the RoFormer layer since I saw that someone had already worked on ProphetNet. Has the model been claimed ?

RoboTuan · 2023-06-11T11:10:13Z

Hello @younesbelkada @fxmarty and @michaelbenayoun
I would love to help you with the integration of more models for BetterTransformer! I'm happy to take what is left since a lot of developers are already contributing to most of the models I think. Let me know if I can still help with something!

mohammedElfatihSalah · 2023-06-29T10:06:11Z

@younesbelkada is there anything I can help with in this issue?

deepwilson · 2023-09-20T12:56:10Z

@younesbelkada please could you update the original list of pending items?
Or has this project been stalled?

sam-h-bean · 2023-09-27T05:15:25Z

Is splade possible?

ghost · 2023-10-06T03:28:17Z

Hi @younesbelkada ! I'm new to the Open Source community but have good experience with torch, transformers, numpy, etc. can I be assigned the RoFormer task, I'd like to give it a shot!

adeepbiswas · 2023-10-06T03:30:12Z

Hi @younesbelkada,
Can I take up ProphetNet task? I'm new to open source and might take some time but eager to try my hands at this.

younesbelkada · 2023-10-06T08:40:17Z

Hi everyone,
Sorry for the delay in replying to this issue and community contribution - we had some internal discussion and we decided to migrate the BetterTransformer API into transformers core by directly supporting torch.scaled_dot_product_attention in the modeling files. Check out this issue: #26557 for more details and this PR for the PoC: #26572
We will possibly open a community contribution for that to extend the support for all architectures but not sure. I will keep you all posted!
Thanks again for all your effort and amazing contribution! 🎉

vu0607 · 2024-01-24T07:51:38Z

Hi @younesbelkada, @michaelbenayoun, and @fxmarty
The model type vision-encoder-decoder is not yet supported to be used with BetterTransformer !!!
Hope you to support soon <3

younesbelkada added the Good First Issue label Nov 22, 2022

guillaume-be mentioned this issue Nov 26, 2022

Potentially use BetterTransformer from PyTorch guillaume-be/rust-bert#302

Open

younesbelkada mentioned this issue Nov 30, 2022

Community contribution - BetterTransformer integration for more models! huggingface/optimum#488

Open

15 tasks

younesbelkada mentioned this issue Dec 1, 2022

Add MBart support for BetterTransformer huggingface/optimum#516

Merged

katiele47 mentioned this issue Mar 22, 2023

Flava model better transformers huggingface/optimum#907

Open

3 tasks

hirotasoshu mentioned this issue Mar 26, 2023

[BT] add BetterTransformer support for ProphetNet huggingface/optimum#923

Merged

3 tasks

sushmanthreddy mentioned this issue Apr 25, 2023

added bettertransformerlayer for detr model huggingface/optimum#1017

Closed

awinml mentioned this issue May 20, 2023

Add BetterTransformer integration for Detr huggingface/optimum#1065

Open

issamarabi mentioned this issue Oct 1, 2023

Detr bettertransformer huggingface/optimum#1424

Open

3 tasks

younesbelkada closed this as completed Oct 6, 2023

kirillsemenov1314 mentioned this issue Dec 14, 2023

Add blip-2 to bettertransformer huggingface/optimum#1125

Merged

1 task

Community contribution - BetterTransformer integration for more models! #20372

Community contribution - BetterTransformer integration for more models! #20372

Comments

younesbelkada commented Nov 22, 2022 • edited

BetterTransformer integration for more models!

hamishdickson commented Nov 22, 2022

michaelbenayoun commented Nov 22, 2022

younesbelkada commented Nov 22, 2022 • edited

hamishdickson commented Nov 22, 2022

GenVr commented Nov 23, 2022

younesbelkada commented Nov 23, 2022

RJZauner commented Nov 25, 2022

younesbelkada commented Nov 25, 2022

shogohida commented Nov 25, 2022

JuheonChu commented Nov 25, 2022

michaelbenayoun commented Nov 25, 2022

younesbelkada commented Nov 25, 2022

shogohida commented Nov 25, 2022

JuheonChu commented Nov 25, 2022

ravenouse commented Nov 26, 2022 • edited

blakechi commented Nov 26, 2022

younesbelkada commented Nov 26, 2022

younesbelkada commented Nov 26, 2022

RJZauner commented Nov 26, 2022

lucaspct commented Nov 29, 2022

JuheonChu commented Nov 29, 2022

lucaspct commented Nov 29, 2022

michaelbenayoun commented Nov 30, 2022

miyu386 commented Nov 30, 2022

adit299 commented Dec 1, 2022

younesbelkada commented Dec 1, 2022

younesbelkada commented Jan 15, 2023

younesbelkada commented Jan 16, 2023

JanFidor commented Feb 12, 2023 • edited

dewasahu2003 commented Apr 30, 2023

dewasahu2003 commented Apr 30, 2023

y3sar commented May 4, 2023

awinml commented May 4, 2023

dewasahu2003 commented May 4, 2023 • edited

awinml commented May 4, 2023

dewasahu2003 commented May 4, 2023

mobley-trent commented May 8, 2023

mszsorondo commented May 8, 2023

Jack-Chuang commented May 27, 2023 • edited

jucamohedano commented May 27, 2023

mobley-trent commented May 28, 2023 • edited

RoboTuan commented Jun 11, 2023

mohammedElfatihSalah commented Jun 29, 2023

deepwilson commented Sep 20, 2023

sam-h-bean commented Sep 27, 2023

ghost commented Oct 6, 2023

adeepbiswas commented Oct 6, 2023

younesbelkada commented Oct 6, 2023

vu0607 commented Jan 24, 2024

Community contribution - `BetterTransformer` integration for more models! #20372

Community contribution - `BetterTransformer` integration for more models! #20372

younesbelkada commented Nov 22, 2022 •

edited

`BetterTransformer` integration for more models!

younesbelkada commented Nov 22, 2022 •

edited

ravenouse commented Nov 26, 2022 •

edited

JanFidor commented Feb 12, 2023 •

edited

dewasahu2003 commented May 4, 2023 •

edited

Jack-Chuang commented May 27, 2023 •

edited

mobley-trent commented May 28, 2023 •

edited