[Flax] Addition of FlaxPegasus #13420

bhadreshpsavani · 2021-09-04T15:02:47Z

What does this PR do?

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. link of PR
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

bhadreshpsavani · 2021-09-04T15:52:25Z

Hi @patil-suraj and @patrickvonplaten,
I was not able to figure out how to add PegasusSinusoidalPositionalEmbedding in the Flax Version and QuestionAnswering and Classification classes are not added yet since the original torch version don't have the classes. Shall we add it?
Please let me know your review on this PR.

patil-suraj · 2021-09-06T05:41:34Z

You could find the flax version of SinusoidalPositionalEmbedding in the FlaxMarian

transformers/src/transformers/models/marian/modeling_flax_marian.py

Line 752 in 76c4d8b

self.embed_positions = create_sinusoidal_positions(

Also, Pegasus isn't really intended for QA and classification so it's okay to not add those heads yet.

patil-suraj

Thanks a lot for adding this @bhadreshpsavani , great work!

The PR looks good overall, I've left a few comments below.

Mainly

Let's try to use as many copied from ... statements as possible
The order of layer_norm in FlaxPegasusEncoder and FlaxPegasusDecoder should be changed. I've left details in the comment.

Let me know if it's not clear or if you need any other help :)

src/transformers/models/auto/modeling_flax_auto.py

src/transformers/models/pegasus/modeling_flax_pegasus.py

tests/test_modeling_flax_pegasus.py

src/transformers/models/pegasus/modeling_flax_pegasus.py

patrickvonplaten · 2021-09-09T17:31:50Z

Thanks a lot for more or less completing the PR - great job @bhadreshpsavani !

It seems like there are some small differences between the PyTorch & Flax Model. This could be due to slightly different activation functions or small differences with the position ids....

It would be awesome if you could try to debug layer by layer what might be the problem there @bhadreshpsavani

Another possibility is that there is no difference and it's just the framework that causes the difference. In this case, we'll just have to accept it and change the tolerance.

bhadreshpsavani · 2021-09-10T04:18:29Z

Sure @patrickvonplaten,
I will compare the code and debug it :)

src/transformers/models/pegasus/modeling_flax_pegasus.py

bhadreshpsavani · 2021-09-10T17:02:28Z

Hi @patil-suraj and @patrickvonplaten,
Thanks for the review and suggestions. Please let me know if anything missing in the PR.

patil-suraj · 2021-09-13T06:39:06Z

Thanks a lot for fixing the issues, looks good now. If you could give me access to this branch I would like to update the slow tests.

bhadreshpsavani · 2021-09-13T10:41:02Z

Done!
Please go ahead with the fix for slow tests.
Once this is merged, I will create another PR for that Typo Fix in BART and PEGASUS that I come across

tests/test_modeling_flax_pegasus.py

patil-suraj

Thanks a lot for adding this model, great work @bhadreshpsavani !

I updated the slow tests and also pushed a couple of flax checkpoints (pegasus-large, pegasus-xum) to the hub. WIll also push the remaining official weights later.

@patrickvonplaten do you wanna give it another look?

patrickvonplaten

Awesome!

* added initial files * fixes pipeline * fixes style and quality * fixes doc issue and positional encoding * fixes layer norm and test * fixes quality issue * fixes code quality * removed extra layer norm * added layer norm back in encoder and decoder * added more code copy quality checks * update tests * Apply suggestions from code review * fix import * fix test Co-authored-by: patil-suraj <surajp815@gmail.com>

bhadreshpsavani added 4 commits September 4, 2021 20:29

added initial files

a7f0c5d

Merge remote-tracking branch 'origin/master' into flax-pegasus-addition

23bfcc7

fixes pipeline

16b111b

fixes style and quality

25fc333

bhadreshpsavani added 2 commits September 6, 2021 21:21

fixes doc issue and positional encoding

8948982

Merge remote-tracking branch 'origin/master' into flax-pegasus-addition

c98d354

patil-suraj reviewed Sep 7, 2021

View reviewed changes

bhadreshpsavani added 4 commits September 7, 2021 21:31

fixes layer norm and test

1e793b2

Merge remote-tracking branch 'origin/master' into flax-pegasus-addition

f6c636f

fixes quality issue

94200eb

fixes code quality

90cf5eb

patrickvonplaten reviewed Sep 9, 2021

View reviewed changes