reasoning-models

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".

reinforcement-learning fine-tuning post-training llm deepseek gpt-o1 reasoning-language-models reasoning-models deepseek-r1

Updated Apr 1, 2025
Python

czg1225 / VeriThinker

Star

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

efficiency fine-tuning large-language-models reasoning-models deepseek-r1-distill-llama deepseek-r1-distill-qwen

Updated May 29, 2025
Python

DolbyUUU / DeepEnlighten

Star

Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.

reinforcement-learning fine-tuning post-training llm deepseek gpt-o1 reasoning-language-models reasoning-models deepseek-r1

Updated Mar 16, 2025
Python

fscdc / ReasonMap

Star

[arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

reasoning multimodal-large-language-models reasoning-models efficient-reasoning

Updated May 16, 2025
Python

UKPLab / acl2025-diverse-cot

Star

Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"

cot lrm chain-of-thought large-reasoning-models reasoning-models

Updated Jun 20, 2025
Python

microsoft / BUILD25-LAB333

Star

This repository hosts the instructions and workshop materials for Lab 333 - Evaluate Reasoning Models for Your Generative AI Solutions

python openai model-catalog azure-ai-foundry reasoning-models

Updated May 21, 2025
Jupyter Notebook

intellectronica / generative-learning

Star

Using a reasoning LLM to learn a prompt from data

ai ml gemini-api llm reasoning-models

Updated May 5, 2025
Jupyter Notebook

yongchao98 / R1-Code-Interpreter

Star

R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning

reinforcement-learning planning large-language-models code-interpreter reasoning-models

Updated Jun 3, 2025
Python

mrorigo / agentic-deep-graph-reasoning

Star

Agentic Deep Graph Reasoning Implementation

knowledge-graph knowledge-distillation entity-extraction ai-learning reasoning-models

Updated Mar 4, 2025
Python

AbhaySingh71 / AI-Lawyer-RAG-with-Deepseek

Star

AI Lawyer is an intelligent reasoning legal assistant powered by DeepSeek , Ollama RAG and LangChain, designed to streamline legal research and document analysis. By leveraging retrieval-augmented generation (RAG), it provides precise legal insights, and contract summarization. With an intuitive Streamlit-based UI, analyze legal documents.

chatbot huggingface streamlit vector-database legal-analytics-and-data-science generative-ai langchain llm-agent retrieval-augmented-generation ollama faiss-vector-database groqapi ollamaembeddings reasoning-models deepseek-r1

Updated May 4, 2025
Python

sinanuozdemir / oreilly-agi

Star

Explore the evolution of AGI through historical context, reasoning models, and agent systems, while gaining hands-on experience with cutting-edge models like Claude 4, DeepSeek-R1, and OpenAI's o3. Learn to critically evaluate AGI benchmarks, understand their limitations, and identify where current models excel or struggle in reasoning tasks.

agi agents ai-agents artifical-general-inteligence reasoning-models

Updated Jun 20, 2025
Jupyter Notebook

sshh12 / state-sandbox

Star

State Sandbox is an experimental game for socioeconomic simulation. It uses Large Language Models (o3-mini) to simulate the world and complex policy impacts.

civilization ai-games o1 socioeconomics nation-states reasoning-models o3-mini

Updated Feb 4, 2025
JavaScript

Improve this page

Add a description, image, and links to the reasoning-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reasoning-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reasoning-models

Here are 35 public repositories matching this topic...

zilliztech / deep-searcher

MiniMax-AI / MiniMax-M1

Zefan-Cai / R-KV

UCSC-VLAA / MedReason

eric-ai-lab / Soft-Thinking

hao-ai-lab / Dynasor

IAAR-Shanghai / xVerify

codelion / pts

DolbyUUU / Logic-RL-Lite

czg1225 / VeriThinker

DolbyUUU / DeepEnlighten

fscdc / ReasonMap

UKPLab / acl2025-diverse-cot

microsoft / BUILD25-LAB333

intellectronica / generative-learning

yongchao98 / R1-Code-Interpreter

mrorigo / agentic-deep-graph-reasoning

AbhaySingh71 / AI-Lawyer-RAG-with-Deepseek

sinanuozdemir / oreilly-agi

sshh12 / state-sandbox

Improve this page

Add this topic to your repo