Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
-
Updated
Jun 18, 2025 - Python
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
R-KV: Redundancy-aware KV Cache Compression for Reasoning Models
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
Simple extension on vLLM to help you speed up reasoning model without training.
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Pivotal Token Search
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
VeriThinker: Learning to Verify Makes Reasoning Model Efficient
Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
[arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps
Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"
This repository hosts the instructions and workshop materials for Lab 333 - Evaluate Reasoning Models for Your Generative AI Solutions
Using a reasoning LLM to learn a prompt from data
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning
Agentic Deep Graph Reasoning Implementation
AI Lawyer is an intelligent reasoning legal assistant powered by DeepSeek , Ollama RAG and LangChain, designed to streamline legal research and document analysis. By leveraging retrieval-augmented generation (RAG), it provides precise legal insights, and contract summarization. With an intuitive Streamlit-based UI, analyze legal documents.
State Sandbox is an experimental game for socioeconomic simulation. It uses Large Language Models (o3-mini) to simulate the world and complex policy impacts.
Turn stories, strategies, or systems into insight. Auto-generate Dialectical Wheels (DWs) from any text to reveal blind spots, surface polarities, and trace dynamic paths toward synthesis. DWs are semantic maps that expose tension, transformation, and coherence within a system—whether narrative, ethical, organizational, or technological.
Add a description, image, and links to the reasoning-models topic page so that developers can more easily learn about it.
To associate your repository with the reasoning-models topic, visit your repo's landing page and select "manage topics."