reward-modeling

Star

Here are 16 public repositories matching this topic...

YangLing0818 / IterComp

Star

[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

text-to-image dpo rlhf reward-modeling

Updated Feb 19, 2025
Python

sileod / tasksource

Star

Datasets collection and preprocessings framework for NLP extreme multitask learning

Updated Jul 9, 2025
Python

VectorInstitute / vector-inference

Star

Efficient LLM inference on Slurm clusters using vLLM.

inference vlm text-embedding llm vllm llm-inference reward-modeling

Updated Jul 8, 2025
Python

holarissun / RewardModelingBeyondBradleyTerry

Star

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

reward inverse-reinforcement-learning large-language-models rlhf reward-models largelanguagemodels reward-modeling llm-aligment llmalignment

Updated Apr 2, 2025
Python

Jialuo-Li / Science-T2I

Star

[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis

science benchmark computer-vision dataset generative-model post-training reward-modeling

Updated Apr 27, 2025
Python

bobxwu / learning-from-rewards-llm-papers

Star

A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-inference stages.

reinforcement-learning post-training self-correction reward-learning large-language-models llm llms reward-models reward-model reward-modeling guided-decoding test-time-scaling

Updated Jun 13, 2025

zli12321 / qa_metrics

Star

An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model prompting and evaluation, exact match, F1 Score, PEDANT semantic match, transformer match. Our package also supports prompting OPENAI and Anthropic API.

qa-automation-test rl-training llm exact-matching llm-evaluation llm-evaluation-toolkit llm-evaluation-framework reward-modeling

Updated Jun 21, 2025
Python

allenai / hybrid-preferences

Star

Learning to route instances for Human vs AI Feedback (ACL 2025 Main)

language-model dpo rlhf reward-modeling

Updated May 16, 2025
Python

quanshr / DMoERM

Star

[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

rlhf large-language-model reward-modeling

Updated Jun 6, 2024
Python

MiuLab / DogeRM

Star

The code used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging"

large-language-models rlhf model-merging reward-modeling

Updated Oct 8, 2024
Python

zhuohaoyu / RewardAnything

Star

RewardAnything: Generalizable Principle-Following Reward Models

evaluation alignment llm rlhf reward-models rlaif reward-modeling reasoning-language-models

Updated Jun 11, 2025
HTML

lca0503 / MergeToVLRM

Star

Source code of our paper "Transferring Textual Preferences to Vision-Language Understanding through Model Merging", ACL 2025

model-merging large-vision-language-model reward-modeling

Updated Apr 25, 2025
Python

homzer / Q-RM

Star

Code for SFT and RL

reinforcement-learning model-parallel reward-modeling supervised-fine-tuning

Updated Jun 22, 2025
Python

ranzeet013 / RLHF-CustomData

Star

Building an LLM with RLHF involves fine-tuning using human-labeled preferences. Based on Learning to Summarize from Human Feedback, it uses supervised learning, reward modeling, and PPO to improve response quality and alignment.

transformer reinforcement-learning-algorithms fine-tuning ppo-agent policy-optimization-algorithms supervised-finetuning flan-t5 reward-modeling

Updated Mar 24, 2025
Jupyter Notebook

JohannesAck / OffPolicyCorrectedRewardModeling

Star

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

rlhf reward-modeling

Updated Jul 11, 2025
Python

itsvaibhav01 / Relic

Star

Official Repository for RELIC : Enhancing Reward Model Generalization for Low-Resource Indic Languages with Few-Shot Examples

ai-safety indian-languages few-shot-learning low-resouce-language in-context-learning reward-modeling

Updated Jun 23, 2025

Improve this page

Add a description, image, and links to the reward-modeling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reward-modeling topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reward-modeling

Here are 16 public repositories matching this topic...

YangLing0818 / IterComp

sileod / tasksource

VectorInstitute / vector-inference

holarissun / RewardModelingBeyondBradleyTerry

Jialuo-Li / Science-T2I

bobxwu / learning-from-rewards-llm-papers

zli12321 / qa_metrics

allenai / hybrid-preferences

quanshr / DMoERM

MiuLab / DogeRM

zhuohaoyu / RewardAnything

lca0503 / MergeToVLRM

homzer / Q-RM

ranzeet013 / RLHF-CustomData

JohannesAck / OffPolicyCorrectedRewardModeling

itsvaibhav01 / Relic

Improve this page

Add this topic to your repo