human-feedback

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

machine-translation llama lora contrastive gpt-4 chatgpt human-feedback instruction-tuning bloomz error-guided

Updated Dec 31, 2024
Python

xrsrke / instructGOOSE

Star

Implementation of Reinforcement Learning from Human Feedback (RLHF)

reinforcement-learning chatgpt human-feedback rlhf instructgpt

Updated Apr 7, 2023
Jupyter Notebook

trubrics / trubrics-python

Star

Product analytics for AI Assistants

machine-learning mlops streamlit ml-monitoring llm human-feedback llmops model-feedback

Updated Mar 12, 2025
Python

PKU-Alignment / beavertails

Star

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

safety llama gpt datasets language-model beaver ai-safety human-feedback-data llm llms human-feedback rlhf large-language-model safe-rlhf

Updated Oct 27, 2023
Makefile

HannahKirk / prism-alignment

Star

The Prism Alignment Project

dataset alignment multicultural sociotechnical human-feedback-data human-feedback

Updated Apr 25, 2024
Jupyter Notebook

davidberenstein1957 / dataset-viber

Star

Dataset Viber is your chill repo for data collection, annotation and vibe checks.

evaluation data-collection data-quality human-feedback

Updated Sep 5, 2024
Python

ZhenbangDu / Reliable_AD

Star

[ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback

image-generation advertising datasets diffusion diffusion-models diffusers human-feedback rlhf eccv2024

Updated Nov 8, 2024
Python

gao-g / prelude

Star

Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".

transformers alignment user-feedback edits interpretability preference-learning gpt4 llm llms human-feedback

Updated Nov 23, 2024
Python

ZiyiZhang27 / tdpo

Star

[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"

reinforcement-learning alignment text-to-image diffusion-models stable-diffusion human-feedback rlhf

Updated Jul 12, 2024
Python

AlaaLab / pathologist-in-the-loop

Star

[ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"

synthetic-data human-feedback rlhf pathology-images

Updated Oct 19, 2023
Python

victor-iyi / rlhf-trl

Star

Reinforcement Learning from Human Feedback with 🤗 TRL

reinforcment-learning human-feedback rlhf

Updated Jun 14, 2023
Python

wang8740 / MAP

Star

Documentation at

finetuning llm human-feedback rlhf human-value-alignment multi-objective-alignment

Updated Oct 30, 2024
Python

JacqueWill / SEO_HIF_JS

Star

Search Engine Optimization using Human Implicit Feedback

machine-learning seo-optimization data-privacy implicit-feedback edge-computing search-ranking human-feedback

Updated Apr 7, 2023
JavaScript

cluebbers / dpo-rlhf-paraphrase-types

Star

Enhancing paraphrase-type generation using Direct Preference Optimization (DPO) and Reinforcement Learning from Human Feedback (RLHF), with large-scale HPC support. This project aligns model outputs to human-ranked data for robust, safety-focused NLP.

reinforcement-learning deep-learning transformers alignment paraphrase-generation human-feedback direct-preference-optimization paraphrase-type-generation

Updated Feb 3, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the human-feedback topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the human-feedback topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

human-feedback

Here are 19 public repositories matching this topic...

lucidrains / PaLM-rlhf-pytorch

opendilab / awesome-RLHF

conceptofmind / LaMDA-rlhf-pytorch

huggingface / data-is-better-together

yk7333 / d3po

wxjiao / ParroT

xrsrke / instructGOOSE

trubrics / trubrics-python

PKU-Alignment / beavertails

HannahKirk / prism-alignment

davidberenstein1957 / dataset-viber

ZhenbangDu / Reliable_AD

gao-g / prelude

ZiyiZhang27 / tdpo

AlaaLab / pathologist-in-the-loop

victor-iyi / rlhf-trl

wang8740 / MAP

JacqueWill / SEO_HIF_JS

cluebbers / dpo-rlhf-paraphrase-types

Improve this page

Add this topic to your repo