reasoning-models

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".

reinforcement-learning fine-tuning post-training llm deepseek gpt-o1 reasoning-language-models reasoning-models deepseek-r1

Updated Apr 1, 2025
Python

czg1225 / VeriThinker

Star

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

efficiency fine-tuning large-language-models reasoning-models deepseek-r1-distill-llama deepseek-r1-distill-qwen

Updated May 29, 2025
Python

DolbyUUU / DeepEnlighten

Star

Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.

reinforcement-learning fine-tuning post-training llm deepseek gpt-o1 reasoning-language-models reasoning-models deepseek-r1

Updated Mar 16, 2025
Python

fscdc / ReasonMap

Star

[arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

reasoning multimodal-large-language-models reasoning-models efficient-reasoning

Updated May 16, 2025
Python

UKPLab / acl2025-diverse-cot

Star

Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"

cot lrm chain-of-thought large-reasoning-models reasoning-models

Updated Jun 2, 2025
Python

microsoft / BUILD25-LAB333

Star

This repository hosts the instructions and workshop materials for Lab 333 - Evaluate Reasoning Models for Your Generative AI Solutions

python openai model-catalog azure-ai-foundry reasoning-models

Updated May 21, 2025
Jupyter Notebook

intellectronica / generative-learning

Star

Using a reasoning LLM to learn a prompt from data

ai ml gemini-api llm reasoning-models

Updated May 5, 2025
Jupyter Notebook

yongchao98 / R1-Code-Interpreter

Star

R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning

reinforcement-learning planning large-language-models code-interpreter reasoning-models

Updated Jun 3, 2025
Python

mrorigo / agentic-deep-graph-reasoning

Star

Agentic Deep Graph Reasoning Implementation

knowledge-graph knowledge-distillation entity-extraction ai-learning reasoning-models

Updated Mar 4, 2025
Python

AbhaySingh71 / AI-Lawyer-RAG-with-Deepseek

Star

AI Lawyer is an intelligent reasoning legal assistant powered by DeepSeek , Ollama RAG and LangChain, designed to streamline legal research and document analysis. By leveraging retrieval-augmented generation (RAG), it provides precise legal insights, and contract summarization. With an intuitive Streamlit-based UI, analyze legal documents.

chatbot huggingface streamlit vector-database legal-analytics-and-data-science generative-ai langchain llm-agent retrieval-augmented-generation ollama faiss-vector-database groqapi ollamaembeddings reasoning-models deepseek-r1

Updated May 4, 2025
Python

sshh12 / state-sandbox

Star

State Sandbox is an experimental game for socioeconomic simulation. It uses Large Language Models (o3-mini) to simulate the world and complex policy impacts.

civilization ai-games o1 socioeconomics nation-states reasoning-models o3-mini

Updated Feb 4, 2025
JavaScript

dialexity / dialectical-framework

Star

Turn stories, strategies, or systems into insight. Auto-generate Dialectical Wheels (DWs) from any text to reveal blind spots, surface polarities, and trace dynamic paths toward synthesis. DWs are semantic maps that expose tension, transformation, and coherence within a system—whether narrative, ethical, organizational, or technological.

ai agi semantic-analysis reasoning dialectics dialectic reasoning-agent reasoning-on-graph reasoning-algoritm reasoning-engine reasoning-language-models reasoning-models

Updated Jun 17, 2025
Python

Improve this page

Add a description, image, and links to the reasoning-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reasoning-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reasoning-models

Here are 33 public repositories matching this topic...

zilliztech / deep-searcher

MiniMax-AI / MiniMax-M1

Zefan-Cai / R-KV

UCSC-VLAA / MedReason

eric-ai-lab / Soft-Thinking

hao-ai-lab / Dynasor

IAAR-Shanghai / xVerify

codelion / pts

DolbyUUU / Logic-RL-Lite

czg1225 / VeriThinker

DolbyUUU / DeepEnlighten

fscdc / ReasonMap

UKPLab / acl2025-diverse-cot

microsoft / BUILD25-LAB333

intellectronica / generative-learning

yongchao98 / R1-Code-Interpreter

mrorigo / agentic-deep-graph-reasoning

AbhaySingh71 / AI-Lawyer-RAG-with-Deepseek

sshh12 / state-sandbox

dialexity / dialectical-framework

Improve this page

Add this topic to your repo