sft
Here are 15 public repositories matching this topic...
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
-
Updated
Jun 12, 2024 - Python
EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets
-
Updated
Dec 12, 2023 - Python
Finetune baichuan pretrained model with QLora method
-
Updated
Jul 13, 2023 - Python
Train expert conversational role-play LLMs with synthetic data
-
Updated
Nov 24, 2023 - Python
本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。
-
Updated
Feb 28, 2024 - Python
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"
-
Updated
Jan 17, 2024 - Python
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
-
Updated
Jun 30, 2023 - Python
chatglm 6b finetuning and alpaca finetuning
-
Updated
Apr 21, 2024 - Python
Improve this page
Add a description, image, and links to the sft topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the sft topic, visit your repo's landing page and select "manage topics."