GitHub topics: multi-step-reasoning
adeelahmad/mlx-grpo
🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without expensive RLHF. Apple Silicon optimized. 🚀
Language: Python - Size: 85.9 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 9 - Forks: 1

ahmedmhussein111/mlx-grpo
MLX-GRPO allows you to train your own DeepSeek-R1 models directly on your Mac. This implementation simplifies the process of building advanced reasoning AI, making it accessible for developers. 🐙🌟
Language: Python - Size: 87.9 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

pritamqu/VCRBench
VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models
Language: Python - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

versionHQ/multi-agent-system
Autonomous agent networks for task automation that requires multi-step reasoning
Language: Python - Size: 3.58 MB - Last synced at: 25 days ago - Pushed at: 2 months ago - Stars: 18 - Forks: 3

TianduoWang/MsAT
[ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707
Language: Python - Size: 3.78 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 24 - Forks: 0

LakshitaS/Agentic-RAG-implementation
Implementation of "Building Agentic RAG with LlamaIndex" offered by DeepLearning.AI focusing on developing intelligent research agents using the Retrieval-Augmented Generation (RAG) framework, specifically utilizing LlamaIndex.
Language: Jupyter Notebook - Size: 2.13 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

StonyBrookNLP/ircot
Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23
Language: Jsonnet - Size: 2.01 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 126 - Forks: 16

mukhal/grace
[EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning
Language: Python - Size: 29.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 36 - Forks: 0

wzy6642/PRP
Official implementation for "Get an A in Math: Progressive Rectification Prompting" (AAAI 2024)
Language: Python - Size: 1.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Strong-AI-Lab/Multi-Step-Deductive-Reasoning-Over-Natural-Language
Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation
Language: Python - Size: 10.2 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 2

Strong-AI-Lab/A-Neural-Symbolic-Paradigm
From Symbolic Logic Reasoning to Soft Reasoning: A Neural-Symbolic Paradigm
Language: Python - Size: 650 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 1

Strong-AI-Lab/PARARULE-Plus
PARARULE Plus: A Larger Deep Multi-Step Reasoning Dataset over Natural Language
Language: Python - Size: 10 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

HarshTrivedi/DecomP-ODQA
Official repository for ODQA experiments from Decomposed Prompting: A Modular Approach for Solving Complex Tasks, ICLR23
Language: Jsonnet - Size: 237 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
