vicuna-7b

Here are 4 public repositories matching this topic...

jackaduma / Vicuna-LoRA-RLHF-PyTorch

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna

pytorch llama gpt lora finetune ppo peft vicuna llm chatgpt rlhf reward-models vicuna-7b

Updated May 20, 2024
Python

ZJLAB-AMMI / LLM4RL

Star

A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM

reinforcement-learning interaction ppo llm vicuna-7b vicuna-13b

Updated Aug 22, 2024
Python

AliaXueting / fastchat-Vicuna-Langchain-Modif_KnowledgeBase

Star

fastchat/Integrate Langchain/Create Private Knowledge Base

knowledge-base langchain vicuna-7b fastchat

Updated Aug 18, 2023
Python

Zsbyqx20 / VicunaTalk

Star

A speech-to-speech talking bot (in development)

huggingface-transformers vicuna-7b fastchat

Updated May 13, 2023
Python

Improve this page

Add a description, image, and links to the vicuna-7b topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vicuna-7b topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vicuna-7b

Here are 4 public repositories matching this topic...

jackaduma / Vicuna-LoRA-RLHF-PyTorch

ZJLAB-AMMI / LLM4RL

AliaXueting / fastchat-Vicuna-Langchain-Modif_KnowledgeBase

Zsbyqx20 / VicunaTalk

Improve this page

Add this topic to your repo