Skip to content

analytics-zoo/vllm

Error
Looks like something went wrong!

About

A high-throughput and memory-efficient inference and serving engine for LLMs

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 83.6%
  • Cuda 9.8%
  • C++ 5.0%
  • C 0.6%
  • Shell 0.5%
  • CMake 0.4%
  • Dockerfile 0.1%