C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs
-
Updated
Jun 8, 2024 - C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
Add a description, image, and links to the internlm topic page so that developers can more easily learn about it.
To associate your repository with the internlm topic, visit your repo's landing page and select "manage topics."