C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs
-
Updated
Jun 8, 2024 - C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs
This project accelerates local deployment of chatglm and vector inference using PyTorch compiled in C++, and includes an OpenAI API Mock script for quick setup of local speed testing services. This setup enhances performance and efficiency, ideal for high-performance applications and development testing.
An spoken English chatbot runs in realtime and offline based on LLM.
Add a description, image, and links to the chatglm3 topic page so that developers can more easily learn about it.
To associate your repository with the chatglm3 topic, visit your repo's landing page and select "manage topics."