multi-step-reasoning

Here are 14 public repositories matching this topic...

StonyBrookNLP / ircot

Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23

question-answering multi-step-reasoning large-language-models chain-of-thought multi-step-retrieval retrieval-augmented-qa

Updated Jun 12, 2024
Jsonnet

mukhal / GRACE

Star

Discriminator-Guided Chain-of-Thought Reasoning

decoding text-generation language-model reasoning symbolic-reasoning mathematical-reasoning multi-step-reasoning llm chain-of-thought

Updated Oct 11, 2024
Python

TianduoWang / MsAT

Star

[ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707

multi-step-reasoning math-word-problem acl2023

Updated Jun 7, 2023
Python

versionHQ / multi-agent-system

Sponsor

Star

Autonomous agent networks for task automation that requires multi-step reasoning

orchestration-framework python3 networkx graph-theory matplotlib multi-agent-systems autonomous-agents pygraphviz self-directed-learning rag pydantic multi-step-reasoning langchain litellm agentic-ai composiotool mem0ai docling

Updated Jun 1, 2025
Python

Strong-AI-Lab / A-Neural-Symbolic-Paradigm

Star

From Symbolic Logic Reasoning to Soft Reasoning: A Neural-Symbolic Paradigm

natural-language-processing deep-learning transformer deductive-reasoning soft-reasoning symbolic-logic-reasoning neural-symbolic-paradigm multi-step-reasoning gate-attention

Updated Jul 18, 2022
Python

wzy6642 / PRP

Star

Official implementation for "Get an A in Math: Progressive Rectification Prompting" (AAAI 2024)

verification rectification iterative multi-step-reasoning gpt-35-turbo math-word-problem-solving zero-shot-prompting

Updated Mar 18, 2024
Python

HarshTrivedi / DecomP-ODQA

Star

Official repository for ODQA experiments from Decomposed Prompting: A Modular Approach for Solving Complex Tasks, ICLR23

question-answering multi-step-reasoning large-language-models chain-of-thought retrieval-augmented-qa

Updated Jul 28, 2023
Jsonnet

🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without expensive RLHF. Apple Silicon optimized. 🚀

ai artificial-intelligence llama thinking mlx mathematical-reasoning apple-silicon multi-step-reasoning llm chain-of-thought rlhf deepseek-r1 grpo reasoning-ai

Updated Jun 19, 2025
Python

Strong-AI-Lab / Multi-Step-Deductive-Reasoning-Over-Natural-Language

Star

Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation

deductive-reasoning multi-step-reasoning gate-attention out-of-distribution-generalisation

Updated Sep 22, 2023
Python

LakshitaS / Agentic-RAG-implementation

Star

Implementation of "Building Agentic RAG with LlamaIndex" offered by DeepLearning.AI focusing on developing intelligent research agents using the Retrieval-Augmented Generation (RAG) framework.

rag multi-step-reasoning agentic-workflow tool-calling router-query-engine

Updated Jun 25, 2024
Jupyter Notebook

Strong-AI-Lab / PARARULE-Plus

Star

PARARULE Plus: A Larger Deep Multi-Step Reasoning Dataset over Natural Language

natural-language-generation reasoning natural-language-understanding symbolic-logic soft-reasoning multi-step-reasoning

Updated Sep 22, 2023
Python

pritamqu / VCRBench

Star

VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models

benchmark video reasoning multi-step-reasoning causal-reasoning multimodal-large-language-models large-multimodal-models large-video-language-models

Updated May 14, 2025
Python

ksm26 / Reinforcement-Fine-Tuning-LLMs-with-GRPO

Star

The course teaches how to fine-tune LLMs using Group Relative Policy Optimization (GRPO)—a reinforcement learning method that improves model reasoning with minimal data. Learn RFT concepts, reward design, LLM-as-a-judge evaluation, and deploy jobs on the Predibase platform.

reinforcement-learning machine-learning-algorithms language-model reward-design rft ai-training deeplearning-ai-courses ai-optimization multi-step-reasoning ai-evaluation rlhf llm-fine-tuning opensource-ai llm-as-judge predibase grpo llm-development token-level-control

Updated Jun 13, 2025
Jupyter Notebook

ahmedmhussein111 / mlx-grpo

Star

MLX-GRPO allows you to train your own DeepSeek-R1 models directly on your Mac. This implementation simplifies the process of building advanced reasoning AI, making it accessible for developers. 🐙🌟

ai llama thinking mlx mathematical-reasoning apple-silicon multi-step-reasoning llm chain-of-thought rlhf deepseek-r1 grpo reasoning-ai

Updated Jun 18, 2025
Python

Improve this page

Add a description, image, and links to the multi-step-reasoning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-step-reasoning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-step-reasoning

Here are 14 public repositories matching this topic...

StonyBrookNLP / ircot

mukhal / GRACE

TianduoWang / MsAT

versionHQ / multi-agent-system

Strong-AI-Lab / A-Neural-Symbolic-Paradigm

wzy6642 / PRP

HarshTrivedi / DecomP-ODQA

adeelahmad / mlx-grpo

Strong-AI-Lab / Multi-Step-Deductive-Reasoning-Over-Natural-Language

LakshitaS / Agentic-RAG-implementation

Strong-AI-Lab / PARARULE-Plus

pritamqu / VCRBench

ksm26 / Reinforcement-Fine-Tuning-LLMs-with-GRPO

ahmedmhussein111 / mlx-grpo

Improve this page

Add this topic to your repo