transformergallery Implementation of transformer papers from scratch without any specific prebuilt functions. The purpose of this repository is to understand and benchmark different transformer variants.