Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Routing Transformers / Add Google PG-19 Models #11686

Open
GenTxt opened this issue May 11, 2021 · 2 comments
Open

Routing Transformers / Add Google PG-19 Models #11686

GenTxt opened this issue May 11, 2021 · 2 comments

Comments

@GenTxt
Copy link

GenTxt commented May 11, 2021

🌟 New model addition - Google PG-19 Models

Model description

Model checkpoints finally released as discussed in "Efficient Content-Based Sparse Attention with Routing Transformers'
Aurko Roy, Mohammad Saffar, Ashish Vaswani, David Grangier (https://arxiv.org/abs/2003.05997)

Open source status

Note: These tf2 models require proper conversion to pytorch versions and modifications to scripts to enable training and inference.

@vblagoje
Copy link
Contributor

There is an open-source pytorch implementation already - https://github.com/lucidrains/routing-transformer
Can't we adapt RT @lucidrains wrote to HF?

@GenTxt
Copy link
Author

GenTxt commented Jul 15, 2021 via email

@LysandreJik LysandreJik changed the title Add Google PG-19 Models Routing Transformers / Add Google PG-19 Models Sep 17, 2021
@LysandreJik LysandreJik added this to Code & weights available in New model additions Sep 17, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
New model additions
Code & weights available
Development

No branches or pull requests

2 participants