You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Model checkpoints finally released as discussed in "Efficient Content-Based Sparse Attention with Routing Transformers'
Aurko Roy, Mohammad Saffar, Ashish Vaswani, David Grangier (https://arxiv.org/abs/2003.05997)
Open source status
[X ] the model implementation is available: (same link as below)
I've checked the repo before and was hoping with the release of the models
this would be possible.
The original models may be tf1 and not tf2 format. This requires a custom
conversion script to pytorch.
Perhaps coders with advanced python skills will show interest in solving
the above issues.
🌟 New model addition - Google PG-19 Models
Model description
Model checkpoints finally released as discussed in "Efficient Content-Based Sparse Attention with Routing Transformers'
Aurko Roy, Mohammad Saffar, Ashish Vaswani, David Grangier (https://arxiv.org/abs/2003.05997)
Open source status
Note: These tf2 models require proper conversion to pytorch versions and modifications to scripts to enable training and inference.
The text was updated successfully, but these errors were encountered: