lkevinzc

Follow

🎯

Learning

zclzc lkevinzc

🎯

Learning

Follow

NUS PhD student working on RL @sail-sg

75 followers · 162 following

Achievements

Achievements

Organizations

Pinned Loading

mosecorg/mosec Public

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

Python 834 63
sail-sg/oat Public

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Python 299 16
sail-sg/understand-r1-zero Public

Understanding R1-Zero-Like Training: A Critical Perspective

Python 733 31
sail-sg/oat-zero Public

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 217 10
sail-sg/dice Public

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

Python 43 3
sail-sg/rosmo Public

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

Python 28