My primary work and research focus on advancing models toward achieving AGI. Thus, I concentrate on the following key areas: (1) scaling pre-training and post-training frameworks, (2) improving efficiency, and (3) optimizing inference/agent engines to facilitate the rapid iterative development of foundation models. (Personal Website, Google Scholar).
- Support RL Framework: slime-ROCm version, verl-ROCm version
- Agent Framework: AgentLaboratory, AgentVerse, ChatDev
- Efficient Training: Prompt-Transferability
- 💬 Personal Website: https://yushengsu-thu.github.io/
- 📫 E-mail: yushengsu.thu@gmail.com