forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 4
A high-throughput and memory-efficient inference and serving engine for LLMs
License
drikster80/vllm
ErrorLooks like something went wrong!
About
A high-throughput and memory-efficient inference and serving engine for LLMs
Resources
License
Code of conduct
Security policy
Stars
Watchers
Forks
Packages 0
No packages published
Languages
- Python 84.4%
- Cuda 10.3%
- C++ 3.8%
- C 0.6%
- Shell 0.5%
- CMake 0.3%
- Dockerfile 0.1%