Skip to content

Qingzheng-Wang/ShallowMLA

Repository files navigation

ShallowMLA

The PyTorch implementation of Multi-head Latent Attention.

About

The PyTorch implementation of Multi-head Latent Attention.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages