Skip to content
View Linear95's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report Linear95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Linear95/README.md

Hi there 👋

I am Pengyu Cheng, a researcher in NLP and ML. Here are some facts about me:

  • I am currently at Tencent AI Lab, primarily working on LLM training, AI agents, and dialogue systems.
  • I have been experienced in research and projects about controllable generation, interpretability, and fairness of NLP.
  • I am also interested in probabilistic and information-theoretic machine learning methods.
  • I received my Ph.D. degree from Duke University in 2021, advised by Dr. Lawrence Carin.
  • I graduated from the Department of Mathematical Sciences at Tsinghua University in 2017, advised by Dr. Jiwen Lu.

Pinned

  1. CLUB CLUB Public

    Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

    Jupyter Notebook 281 38

  2. SPAG SPAG Public

    Self-playing Adversarial Language Game Enhances LLM Reasoning

    Python 50 3

  3. APO APO Public

    Implementation of Adversarial Preference Optimization (APO)

    Python 39 2

  4. DSP DSP Public

    Domain-specific preference (DSP) data and customized RM fine-tuning.

    Python 25 3

  5. RLM RLM Public

    Code for the paper - Replacing Language Model for Style Transfer

    Python 3

  6. bert-intent-slot-detector bert-intent-slot-detector Public

    BERT-based intent and slots detector for chatbots.

    Python 71 8