GKNL

🏠

Working from home

Miao Peng GKNL

🏠

Working from home

You adored the light, so you will never fear the darkness.

8 followers · 12 following

Wuhan University
Wuhan, China
https://gknl.github.io

Achievements

Highlights

Stars

hanningzhang / prm

Python 16 1 Updated Nov 3, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,856 584 Updated Mar 29, 2025

GAIR-NLP / ReasonEval

[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy

Python 56 3 Updated Dec 15, 2024

Blueyee / Efficient-CoT-LRMs

Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!

43 1 Updated Mar 12, 2025

Hongcheng-Gao / Awesome-Long2short-on-LRMs

Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.

171 4 Updated Mar 26, 2025

wang2226 / Awesome-LLM-Decoding

📜 Paper list on decoding methods for LLMs and LVLMs

33 Updated Jan 3, 2025

facebookresearch / coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,015 91 Updated Jan 24, 2025

KodCode-AI / kodcode

✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork

Python 177 10 Updated Mar 24, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,687 80 Updated Mar 5, 2025

twEErwdf / SchemaE

Python 1 1 Updated Dec 6, 2024

reasoning-survey / Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

540 54 Updated Mar 18, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,966 586 Updated Mar 27, 2025

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,737 132 Updated Jan 17, 2025

expz / quiet-star

Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)

Python 53 4 Updated Aug 8, 2024

AIDC-AI / Marco-o1

An Open Large Reasoning Model for Real-World Solutions

Python 1,477 76 Updated Mar 4, 2025

ezelikman / quiet-star

Code for Quiet-STaR

Python 721 89 Updated Aug 21, 2024

ICTMCG / LLM-for-misinformation-research

Paper list of misinformation research using (multi-modal) large language models, i.e., (M)LLMs.

230 11 Updated Dec 8, 2024

wjn1996 / Awesome-LLM-Reasoning-Openai-o1-Survey

The related works and background techniques about Openai o1

217 9 Updated Jan 7, 2025

dvlab-research / Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 357 14 Updated Jan 19, 2025

KbsdJames / Awesome-LLM-Preference-Learning

The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"

162 2 Updated Oct 28, 2024

daje0601 / Google_SCoRe

Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)

Jupyter Notebook 137 22 Updated Sep 21, 2024

sanowl / Self-Correcting-LLM--Reinforcement-Learning-

This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by google

Python 32 6 Updated Dec 29, 2024

OSU-NLP-Group / HippoRAG

[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

Python 2,105 172 Updated Mar 28, 2025

HanNight / AdaCAD

Code for paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"

Python 12 2 Updated Oct 14, 2024

stanfordnlp / pyreft

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,451 125 Updated Feb 6, 2025

amazon-science / ContextualUnderstanding-ContrastiveDecoding

Enhancing contextual understanding in large language models through contrastive decoding

Python 17 1 Updated May 3, 2024

OSU-NLP-Group / LLM-Knowledge-Conflict

[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"

Python 67 3 Updated Apr 12, 2024

xhan77 / context-aware-decoding

Python 39 7 Updated Nov 18, 2024

pillowsofwind / Knowledge-Conflicts-Survey

[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"

110 5 Updated Sep 21, 2024

llm-misinformation / llm-misinformation-survey

Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misinformation", accepted by AI Magazine 2024

98 9 Updated Nov 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Miao Peng GKNL

Achievements

Achievements

Highlights

Block or report GKNL

Stars

hanningzhang / prm

volcengine / verl

GAIR-NLP / ReasonEval

Blueyee / Efficient-CoT-LRMs

Hongcheng-Gao / Awesome-Long2short-on-LRMs

wang2226 / Awesome-LLM-Decoding

facebookresearch / coconut

KodCode-AI / kodcode

Open-Reasoner-Zero / Open-Reasoner-Zero

twEErwdf / SchemaE

reasoning-survey / Awesome-Reasoning-Foundation-Models

OpenRLHF / OpenRLHF

openreasoner / openr

expz / quiet-star

AIDC-AI / Marco-o1

ezelikman / quiet-star

ICTMCG / LLM-for-misinformation-research

wjn1996 / Awesome-LLM-Reasoning-Openai-o1-Survey

dvlab-research / Step-DPO

KbsdJames / Awesome-LLM-Preference-Learning

daje0601 / Google_SCoRe

sanowl / Self-Correcting-LLM--Reinforcement-Learning-

OSU-NLP-Group / HippoRAG

HanNight / AdaCAD

stanfordnlp / pyreft

amazon-science / ContextualUnderstanding-ContrastiveDecoding

OSU-NLP-Group / LLM-Knowledge-Conflict

xhan77 / context-aware-decoding

pillowsofwind / Knowledge-Conflicts-Survey

llm-misinformation / llm-misinformation-survey