Bug fix in original google-research implementation #50

gulnazaki · 2020-12-22T18:42:17Z

Hey there,

I've seen that a significant bug regarding the data_normalizer has been recently fixed in the original implementation in case you haven't checked it yet. I see it exists here too, since you ported the code.

google-research/google-research@b09ac83

The text was updated successfully, but these errors were encountered:

lucidrains · 2020-12-22T19:08:05Z

@gulnazaki haha, I ported over their Jax code, so it should be fine :) that's their new tensorflow implementation

lucidrains · 2020-12-22T19:09:02Z

@gulnazaki thanks for letting me know!

btw, new follow-up paper for Performer! https://arxiv.org/abs/2012.11346

tldr: sorta-gradient checkpointing along the sequence dimension

gulnazaki · 2020-12-23T21:22:11Z

Oh yes, I am sorry I only had a quick look at it. I understand they fixed the tf implementation to match the one in jax.

Cool paper also, now the sky is the limit 😄

gulnazaki closed this as completed Dec 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fix in original google-research implementation #50

Bug fix in original google-research implementation #50

gulnazaki commented Dec 22, 2020 •

edited

lucidrains commented Dec 22, 2020

lucidrains commented Dec 22, 2020

gulnazaki commented Dec 23, 2020

Bug fix in original google-research implementation #50

Bug fix in original google-research implementation #50

Comments

gulnazaki commented Dec 22, 2020 • edited

lucidrains commented Dec 22, 2020

lucidrains commented Dec 22, 2020

gulnazaki commented Dec 23, 2020

gulnazaki commented Dec 22, 2020 •

edited