Skip to content
View WhenWen's full-sized avatar
🌎
Far From Home
🌎
Far From Home
Block or Report

Block or report WhenWen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
WhenWen/README.md

Hello! I am Kaiyue Wen. I am a senior-year undergraduate student in Yao's pilot class studying computer science and engineering at Tsinghua University. Here are my CV and Publications.

My research interest spreads broadly in machine learning, including theory and applications. I delve deeper into language models, exploring both macroscopic and microscopic attributes.

  1. Macroscopic Level. I am interested in better utilizing large language models, including but not limited to, improving interpretability, controllability, and reasoning ability, through building systems around LLMs through first-principle analysis and theoretical thinking.
  2. MIcroscopic Level. I am interested in understanding the training dynamics of large language models, including but not limited to, the generalization ability, the implicit bias, and the optimization dynamics of pretraining, through theoretical analysis and empirical study.

I believe that the two levels are closely related and mutually beneficial. I am applying for a PhD position starting in 2024. Please contact me through email if you are interested in my research!

Recent News

One More Thing

I keep a firm faith in analytical thinking, hard work, and consistent self-improvement. Any advice or feedback is welcome. You can use this Anonymous Form or discuss with me in person.

Pinned

  1. WhenWen.github.io WhenWen.github.io Public

    JavaScript 2

  2. THU-KEG/Skill-Neuron THU-KEG/Skill-Neuron Public

    Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".

    Python 16 4

  3. Solving-LPN-using-Neural-Networks Solving-LPN-using-Neural-Networks Public

    This repository is the official codebase the for paper Practically Solving LPN in High Noise Regimes Faster Using Neural Networks

    Python 4