🌟 New model addition: FNet #12411

cccntu · 2021-06-29T11:53:40Z

🌟 New model addition: FNet

FNet is a highly efficient Transformer-like encoder architecture, wherein the self-attention sublayers have been wholly replaced by standard, unparameterized Fourier Transforms.

I would like to help adding this!

Open source status

the model implementation is available: https://github.com/google-research/google-research/tree/master/f_net
the model weights are available: https://github.com/google-research/google-research/tree/master/f_net
who are the authors: (@ilyaeck @santiontanon) (Not sure, googled the authors' name + github, sorry if it's incorrect)

NielsRogge · 2021-06-29T12:49:23Z

Somebody is already working on this, see #12335

cccntu · 2021-06-29T13:17:28Z

Thanks @NielsRogge , weird that I didn't see it when I searched.

gchhablani · 2021-06-29T13:40:55Z

@cccntu I believe what you want for the JAX/Flax community week is a Flax model. It seems unlikely that I will finish the PR in the next week. Maybe, you can start working on the Flax model parallely?

Or, we can discuss over slack and then try to finish both.

@patil-suraj @patrickvonplaten wdyt? Is it easier to go from PyTorch to Flax? Or it doesn't matter at all? In case PT is needed, I am willing to spend my time next week on this and try to finish it.

cccntu · 2021-06-29T14:19:27Z

@gchhablani Yes! I would love to add the Flax part.
@patil-suraj @patrickvonplaten I have a few questions before I proceed:

There is no license in the original repo, should I email the authors for permission for code and weights?
How much of the original model code should I modify, other than wrapping it in huggingface/transformers classes?
Should we refactor it for better weight alignment with pytorch code e.t.c?

Thanks!

gchhablani · 2021-06-29T14:20:35Z

Great @cccntu! Let's discuss over Slack.

cccntu added the New model label Jun 29, 2021

This was referenced Jun 30, 2021

cookiecutter template for adding flax model #12440

Closed

(WIP) Add FNet with flax template #12454

Closed

gchhablani mentioned this issue Aug 8, 2021

Add FNet #13045

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🌟 New model addition: FNet #12411

🌟 New model addition: FNet #12411

cccntu commented Jun 29, 2021

NielsRogge commented Jun 29, 2021

cccntu commented Jun 29, 2021

gchhablani commented Jun 29, 2021 •

edited

Loading

cccntu commented Jun 29, 2021

gchhablani commented Jun 29, 2021

🌟 New model addition: FNet #12411

🌟 New model addition: FNet #12411

Comments

cccntu commented Jun 29, 2021

🌟 New model addition: FNet

Open source status

NielsRogge commented Jun 29, 2021

cccntu commented Jun 29, 2021

gchhablani commented Jun 29, 2021 • edited Loading

cccntu commented Jun 29, 2021

gchhablani commented Jun 29, 2021

gchhablani commented Jun 29, 2021 •

edited

Loading