π’
Focusing
π§βπ CS PhD Student @ UCAS | π€ Reinforcement Learning | πββοΈ Research Intern @zai-org | π¦Ά Ex-Intern @ LiAuto @SenseTime @ ZeronTruck.com
-
University of Chinese Academy of Sciences
- Beijing, China
-
13:58
(UTC +08:00) - sdpkjc.me
- https://orcid.org/0000-0001-9842-4706
- @sdpkjc_adam
Pinned Loading
-
vwxyzjn/cleanrl
vwxyzjn/cleanrl PublicHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
-
-
xlang-ai/OSWorld
xlang-ai/OSWorld Public[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
-
sgl-project/sglang
sgl-project/sglang PublicSGLang is a fast serving framework for large language models and vision language models.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.