Skip to content
View GKNL's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report GKNL

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 16 1 Updated Nov 3, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,856 584 Updated Mar 29, 2025

[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy

Python 56 3 Updated Dec 15, 2024

Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!

43 1 Updated Mar 12, 2025

Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.

171 4 Updated Mar 26, 2025

📜 Paper list on decoding methods for LLMs and LVLMs

33 Updated Jan 3, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,015 91 Updated Jan 24, 2025

✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork

Python 177 10 Updated Mar 24, 2025

Official Repo for Open-Reasoner-Zero

Python 1,687 80 Updated Mar 5, 2025
Python 1 1 Updated Dec 6, 2024

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

540 54 Updated Mar 18, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,966 586 Updated Mar 27, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,737 132 Updated Jan 17, 2025

Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)

Python 53 4 Updated Aug 8, 2024

An Open Large Reasoning Model for Real-World Solutions

Python 1,477 76 Updated Mar 4, 2025

Code for Quiet-STaR

Python 721 89 Updated Aug 21, 2024

Paper list of misinformation research using (multi-modal) large language models, i.e., (M)LLMs.

230 11 Updated Dec 8, 2024

The related works and background techniques about Openai o1

217 9 Updated Jan 7, 2025

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 357 14 Updated Jan 19, 2025

The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"

162 2 Updated Oct 28, 2024

Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)

Jupyter Notebook 137 22 Updated Sep 21, 2024

This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by google

Python 32 6 Updated Dec 29, 2024

[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

Python 2,105 172 Updated Mar 28, 2025

Code for paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"

Python 12 2 Updated Oct 14, 2024

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,451 125 Updated Feb 6, 2025

Enhancing contextual understanding in large language models through contrastive decoding

Python 17 1 Updated May 3, 2024

[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"

Python 67 3 Updated Apr 12, 2024

[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"

110 5 Updated Sep 21, 2024

Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misinformation", accepted by AI Magazine 2024

98 9 Updated Nov 9, 2024
Next
Showing results