srzer

Follow

Ruizhe Shi srzer

Follow

PhD @ UW-CSE. Undergraduate @ THU-IIIS. 不停留的岁月中找到满足

48 followers · 37 following

University of Washington
Seattle
16:59 (UTC -07:00)
https://srzer.github.io

Achievements

Achievements

Highlights

Pro

Pinned Loading

Gap-in-Preference-Learning Gap-in-Preference-Learning Public

Official code for "Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO".

Python 3
Samplers-in-Online-DPO Samplers-in-Online-DPO Public

Official code for "The Crucial Role of Samplers in Online Direct Preference Optimization".

Python 7
MOD MOD Public

Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".

Python 25 3
LaMo-2023 LaMo-2023 Public

Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".

Python 53 9