A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
-
Updated
Jun 4, 2024 - Python
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
fastchat/Integrate Langchain/Create Private Knowledge Base
A speech-to-speech talking bot (in development)
Add a description, image, and links to the vicuna-7b topic page so that developers can more easily learn about it.
To associate your repository with the vicuna-7b topic, visit your repo's landing page and select "manage topics."