Skip to content
View sylvain-wei's full-sized avatar
😎
Hahaha
😎
Hahaha
  • Peking University
  • Beijing
  • 14:51 (UTC +08:00)

Highlights

  • Pro

Block or report sylvain-wei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sylvain-wei/README.md

Hi Bro👋

🧑🏻‍💻 Brief Intro

I am Shaohang WEI, insterested in Reasoning in LLMs, Post-training, and Interpretability of LLMs. Strive to seek ways to make LLMs generalize in reasoning ability.

🤝🏻  Contact Me

If you are interested in any aspect of me, please feel free to reach out to me!

Email: shaohang[at]stu.pku.edu.cn

Pinned Loading

  1. TIME TIME Public

    TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario

    Python 7

  2. 24-Game-Reasoning 24-Game-Reasoning Public

    超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of DeepSeek R1-Zero, DeepSeek R1

    Python 21 1

  3. Self-training-Paper-List Self-training-Paper-List Public

    Self-training relies on the model's internal features, output layer probability distributions, and other data to construct supervisory signals. It selects essential key data and designs more reason…

    2

  4. NightingaleCen/LeafyLingo NightingaleCen/LeafyLingo Public

    Python

  5. NightingaleCen/RL-MalmoPlayground NightingaleCen/RL-MalmoPlayground Public

    Python 2

  6. Natural-Language-Processing-2022Fall Natural-Language-Processing-2022Fall Public

    This repository is about the course projects on Natural Language Processing, 2022 Fall, which is provided by the Institute of Artificial Intelligence, Beihang University.

    Python