Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
-
Updated
Jun 18, 2025 - Python
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
R-KV: Redundancy-aware KV Cache Compression for Reasoning Models
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
Simple extension on vLLM to help you speed up reasoning model without training.
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Pivotal Token Search
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
VeriThinker: Learning to Verify Makes Reasoning Model Efficient
Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
[arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps
Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"
This repository hosts the instructions and workshop materials for Lab 333 - Evaluate Reasoning Models for Your Generative AI Solutions
Using a reasoning LLM to learn a prompt from data
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning
Agentic Deep Graph Reasoning Implementation
AI Lawyer is an intelligent reasoning legal assistant powered by DeepSeek , Ollama RAG and LangChain, designed to streamline legal research and document analysis. By leveraging retrieval-augmented generation (RAG), it provides precise legal insights, and contract summarization. With an intuitive Streamlit-based UI, analyze legal documents.
Explore the evolution of AGI through historical context, reasoning models, and agent systems, while gaining hands-on experience with cutting-edge models like Claude 4, DeepSeek-R1, and OpenAI's o3. Learn to critically evaluate AGI benchmarks, understand their limitations, and identify where current models excel or struggle in reasoning tasks.
State Sandbox is an experimental game for socioeconomic simulation. It uses Large Language Models (o3-mini) to simulate the world and complex policy impacts.
Add a description, image, and links to the reasoning-models topic page so that developers can more easily learn about it.
To associate your repository with the reasoning-models topic, visit your repo's landing page and select "manage topics."