#

dpo

Here are 41 public repositories matching this topic...

shibing624 / MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

medical llama gpt dpo llm chatgpt medicalgpt

Updated May 7, 2024
Python

modelscope / swift

ms-swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs

Updated May 9, 2024
Python

ContextualAI / HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

alignment ppo halos dpo kto rlhf

Updated Apr 29, 2024
Python

ukairia777 / tensorflow-nlp-tutorial

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

nlp natural-language-processing tensorflow transformers named-entity-recognition question-answering llama lora trainer bert keras-tutorial sft dpo nlp-tutorial huggingface bert-ner llm

Updated Feb 22, 2024
Jupyter Notebook

argilla-io / notus

Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

zephyr fine-tuning dpo trl lm-alignment preference-data alignment-handbook

Updated Jan 15, 2024
Python

armbues / SiLLM

SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.

lora mlx dpo apple-silicon large-language-models llm llm-training llm-inference

Updated Apr 25, 2024
Python

anilca / NetTrader.Indicator

Technical anaysis library for .NET

Updated Mar 13, 2023
C#

martin-wey / CodeUltraFeedback

CodeUltraFeedback for aligning large language models to coding preferences

alignment code-generation dpo large-language-models llm-as-a-judge codeultrafeedback codal-bench

Updated Mar 17, 2024
Python

RobinSmits / Dutch-LLMs

Various training, inference and validation code and results related to Open LLM's that were pretrained (full or partially) on the Dutch language.

transformers pytorch alpaca peft dpo trl large-language-models open-llama polylm qwen2

Updated Apr 9, 2024
Jupyter Notebook

sugarandgugu / Simple-Trl-Training

基于DPO算法微调语言大模型，简单好上手。

simple dpo trl llm rlhf

Updated Apr 16, 2024
Python

Zepson-Tech / dpo-laravel

A Laravel package to simplify using DPO Payment API in your application. https://dpogroup.com

php laravel hacktoberfest dpo dpogroup directpay

Updated Sep 8, 2023
PHP

karhel / glpi-dporegister

Processings Register for DPO (GDPR) - GLPI Plugin

plugin pdf register glpi gdpr rgpd personal-data-protection dpo

Updated Jul 17, 2023
PHP

vicgalle / configurable-safety-tuning

Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"

alignment safety preference-learning dpo llm

Updated Apr 23, 2024
Python

adithya-s-k / Indic-llm

A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of domains and languages.

lora finetuning dpo llm finetuning-llms continual-pre-training

Updated Apr 25, 2024
Python

DPO-Group / DPO_WooCommerce

This is the DPO Pay plugin for WooCommerce.

woocommerce woocommerce-payment dpo

Updated Jan 4, 2024
PHP

somvy / slic-hf

Experiments of divergence functions for DPO, RLHF

Updated Dec 11, 2023
Jupyter Notebook

DaehanKim / EasyRLHF

EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets

language-model ipo sft dpo rlhf instruction-tuning rrhf

Updated Dec 12, 2023
Python

armbues / SiLLM-examples

Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon

lora mlx dpo apple-silicon large-language-models llm llm-training llm-inference

Updated Apr 23, 2024
Python

ssbuild / llm_dpo

dpo finetuning

Updated Apr 23, 2024
Python

golang-malawi / go-dpo

Unofficial Go library for DPO Group

golang library payments dpo

Updated May 3, 2024
Go

Improve this page

Add a description, image, and links to the dpo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dpo topic, visit your repo's landing page and select "manage topics."