Clean PyTorch implementations of imitation and reward learning algorithms
-
Updated
Jan 7, 2025 - Python
Clean PyTorch implementations of imitation and reward learning algorithms
A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-inference stages.
Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)
Experiments in applying interpretability techniques to learned reward functions.
A repo for Implemented online preference-based reward learning under human irrationality & delayed feedback
Version of the PST for DIVA, implemented in E-Prime.
Add a description, image, and links to the reward-learning topic page so that developers can more easily learn about it.
To associate your repository with the reward-learning topic, visit your repo's landing page and select "manage topics."