GitHub topics: multi-step-reasoning

Repositories

adeelahmad/mlx-grpo

🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without expensive RLHF. Apple Silicon optimized. 🚀

Language: Python - Size: 85.9 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 9 - Forks: 1

ahmedmhussein111/mlx-grpo

MLX-GRPO allows you to train your own DeepSeek-R1 models directly on your Mac. This implementation simplifies the process of building advanced reasoning AI, making it accessible for developers. 🐙🌟

Language: Python - Size: 87.9 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

pritamqu/VCRBench

VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models

Language: Python - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

versionHQ/multi-agent-system

Autonomous agent networks for task automation that requires multi-step reasoning

Language: Python - Size: 3.58 MB - Last synced at: 25 days ago - Pushed at: 2 months ago - Stars: 18 - Forks: 3

TianduoWang/MsAT

[ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707

Language: Python - Size: 3.78 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 24 - Forks: 0

LakshitaS/Agentic-RAG-implementation

Implementation of "Building Agentic RAG with LlamaIndex" offered by DeepLearning.AI focusing on developing intelligent research agents using the Retrieval-Augmented Generation (RAG) framework, specifically utilizing LlamaIndex.

Language: Jupyter Notebook - Size: 2.13 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

StonyBrookNLP/ircot

Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23

Language: Jsonnet - Size: 2.01 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 126 - Forks: 16

mukhal/grace

[EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning

Language: Python - Size: 29.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 36 - Forks: 0

wzy6642/PRP

Official implementation for "Get an A in Math: Progressive Rectification Prompting" (AAAI 2024)

Language: Python - Size: 1.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Strong-AI-Lab/Multi-Step-Deductive-Reasoning-Over-Natural-Language

Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation

Language: Python - Size: 10.2 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 2

Strong-AI-Lab/A-Neural-Symbolic-Paradigm

From Symbolic Logic Reasoning to Soft Reasoning: A Neural-Symbolic Paradigm

Language: Python - Size: 650 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 1

Strong-AI-Lab/PARARULE-Plus

PARARULE Plus: A Larger Deep Multi-Step Reasoning Dataset over Natural Language

Language: Python - Size: 10 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

HarshTrivedi/DecomP-ODQA

Official repository for ODQA experiments from Decomposed Prompting: A Modular Approach for Solving Complex Tasks, ICLR23

Language: Jsonnet - Size: 237 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Related Keywords

multi-step-reasoning 13 chain-of-thought 5 reasoning 3 llm 3 mathematical-reasoning 3 gate-attention 2 deductive-reasoning 2 rag 2 soft-reasoning 2 retrieval-augmented-qa 2 question-answering 2 large-language-models 2 thinking 2 rlhf 2 reasoning-ai 2 mlx 2 llama 2 grpo 2 deepseek-r1 2 apple-silicon 2 ai 2 iterative 1 math-word-problem-solving 1 gpt-35-turbo 1 text-generation 1 symbolic-reasoning 1 language-model 1 decoding 1 multi-step-retrieval 1 tool-calling 1 rectification 1 verification 1 zero-shot-prompting 1 out-of-distribution-generalisation 1 deep-learning 1 natural-language-processing 1 neural-symbolic-paradigm 1 symbolic-logic-reasoning 1 transformer 1 natural-language-generation 1 natural-language-understanding 1 symbolic-logic 1 artificial-intelligence 1 benchmark 1 causal-reasoning 1 large-multimodal-models 1 large-video-language-models 1 multimodal-large-language-models 1 video 1 agentic-ai 1 autonomous-agents 1 composiotool 1 docling 1 graph-theory 1 langchain 1 litellm 1 matplotlib 1 mem0ai 1 multi-agent-systems 1 networkx 1 orchestration-framework 1 pydantic 1 pygraphviz 1 python3 1 self-directed-learning 1 acl2023 1 math-word-problem 1 agentic-workflow 1 router-query-engine 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos