An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: multi-step-reasoning

adeelahmad/mlx-grpo

🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without expensive RLHF. Apple Silicon optimized. 🚀

Language: Python - Size: 85.9 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 9 - Forks: 1

ahmedmhussein111/mlx-grpo

MLX-GRPO allows you to train your own DeepSeek-R1 models directly on your Mac. This implementation simplifies the process of building advanced reasoning AI, making it accessible for developers. 🐙🌟

Language: Python - Size: 87.9 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

pritamqu/VCRBench

VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models

Language: Python - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

versionHQ/multi-agent-system

Autonomous agent networks for task automation that requires multi-step reasoning

Language: Python - Size: 3.58 MB - Last synced at: 25 days ago - Pushed at: 2 months ago - Stars: 18 - Forks: 3

TianduoWang/MsAT

[ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707

Language: Python - Size: 3.78 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 24 - Forks: 0

LakshitaS/Agentic-RAG-implementation

Implementation of "Building Agentic RAG with LlamaIndex" offered by DeepLearning.AI focusing on developing intelligent research agents using the Retrieval-Augmented Generation (RAG) framework, specifically utilizing LlamaIndex.

Language: Jupyter Notebook - Size: 2.13 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

StonyBrookNLP/ircot

Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23

Language: Jsonnet - Size: 2.01 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 126 - Forks: 16

mukhal/grace

[EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning

Language: Python - Size: 29.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 36 - Forks: 0

wzy6642/PRP

Official implementation for "Get an A in Math: Progressive Rectification Prompting" (AAAI 2024)

Language: Python - Size: 1.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Strong-AI-Lab/Multi-Step-Deductive-Reasoning-Over-Natural-Language

Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation

Language: Python - Size: 10.2 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 2

Strong-AI-Lab/A-Neural-Symbolic-Paradigm

From Symbolic Logic Reasoning to Soft Reasoning: A Neural-Symbolic Paradigm

Language: Python - Size: 650 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 1

Strong-AI-Lab/PARARULE-Plus

PARARULE Plus: A Larger Deep Multi-Step Reasoning Dataset over Natural Language

Language: Python - Size: 10 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

HarshTrivedi/DecomP-ODQA

Official repository for ODQA experiments from Decomposed Prompting: A Modular Approach for Solving Complex Tasks, ICLR23

Language: Jsonnet - Size: 237 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0