This repository is based on FasterTransformer adapted to GLM-130B, for FasterTransformer, please read the original project.
forked from NVIDIA/FasterTransformer
-
Notifications
You must be signed in to change notification settings - Fork 13
Transformer related optimization, including BERT, GPT
License
THUDM/FasterTransformer
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Transformer related optimization, including BERT, GPT
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- C++ 64.3%
- Cuda 30.9%
- CMake 2.5%
- Python 1.4%
- Shell 0.8%
- C 0.1%