RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
-
Updated
May 22, 2024 - C++
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Add a description, image, and links to the llmops topic page so that developers can more easily learn about it.
To associate your repository with the llmops topic, visit your repo's landing page and select "manage topics."