HaoshengZou

Follow

Haosheng Zou (邹昊晟) HaoshengZou

Follow

LLM alignment@360, prev.@miHoYo & 4Paradigm. PhD@THU, advised by Prof. Jun Zhu.

36 followers · 5 following

360
Beijing
https://scholar.google.com/citations?user=zrqzQswAAAAJ

Achievements

Achievements

Block or Report

Block or report HaoshengZou

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

minGPT24 minGPT24 Public

Python
reversi-alpha-zero reversi-alpha-zero Public

Forked from mokemokechicken/reversi-alpha-zero

Reversi reinforcement learning by AlphaGo Zero methods.

Python
tianshou tianshou Public

Forked from thu-ml/tianshou

An elegant PyTorch deep reinforcement learning platform.

Python
trl trl Public

Python
schroederdewitt/multiagent_mujoco schroederdewitt/multiagent_mujoco Public

Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.

Python 321 34
taufikxu/youtube taufikxu/youtube Public archive

Youtube-8M challenge on Kaggle

Python 3 2