Skip to content
View Liang-Qiu's full-sized avatar

Block or report Liang-Qiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ScienceQA ScienceQA Public

    Forked from lupantech/ScienceQA

    Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

    Python

  2. PromptPG PromptPG Public

    Forked from lupantech/PromptPG

    Data and code for the paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".

    Python

  3. CHATS-lab/KokoMind CHATS-lab/KokoMind Public

    KokoMind: Can LLMs Understand Social Interactions?

    JavaScript 104 8

  4. WebAgent-R1 WebAgent-R1 Public

    Forked from weizhepei/WebAgent-R1

    WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

  5. ValueNet ValueNet Public

    HTML 2

  6. DFT DFT Public

    Forked from Optimization-AI/DFT

    Discriminative Fine-tuning of LLMs without reward models and human preference data

    Python