yihedeng9

Follow

Yihe Deng yihedeng9

Follow

Personal website: https://yihe-deng.notion.site/Yihe-Deng-167ab2d2c1fb80b3a76dfb120f716c84

40 followers · 5 following

Achievements

Achievements

Highlights

Pro

Pinned Loading

rlhf-summary-notes Public

A brief and partial summary of RLHF algorithms.

127 3
OpenVLThinker Public

OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement

Python 32 1
DuoGuard Public

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Python 17 2
STIC Public

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Python 65 4
uclaml/SPIN Public

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1.1k 99
uclaml/PDE Public

Official repo of Progressive Data Expansion: data, code and evaluation

Jupyter Notebook 28 1