GitHub topics: llm-reasoning | Ecosyste.ms: Repos

yinizhilian/ICLR2025-Papers-with-Code

历年ICLR论文和开源项目合集，包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.

Size: 1.47 MB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 383 - Forks: 18

inclusionAI/AReaL

Distributed RL System for LLM Reasoning

Language: Python - Size: 17.4 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2,067 - Forks: 125

cukurovaai/Benchmarking-LLMs-in-Indoor-Navigation

[SIU2025] Implementation of Benchmarking LLM Reasoning in Indoor Robot Navigation

Language: Python - Size: 46.7 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

bruno686/Awesome-RL-based-LLM-Reasoning

Awesome RL-based LLM Reasoning

Size: 67.4 KB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 561 - Forks: 30

IAAR-Shanghai/Awesome-Attention-Heads

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

Language: TeX - Size: 6.07 MB - Last synced at: 16 days ago - Pushed at: 5 months ago - Stars: 355 - Forks: 13

matdev83/llm-reasoning-framework

This project is designed for Answer-then-Think (AoT) processing with Large Language Models (LLMs). It provides a flexible framework to orchestrate complex reasoning tasks by breaking them down into iterative steps, managing LLM interactions, and dynamically adapting based on problem complexity and resource constraints.

Language: Python - Size: 1.68 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 1

mangopy/SearchLM

Official code for "Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers"

Language: Python - Size: 2.25 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 164 - Forks: 0

slowfastai/LLM-Tool-Integrated-Reasoning-TIR-Papers

A curated collection of research papers on LLM Tool-Integrated Reasoning (TIR), where LLMs enhance reasoning by interacting with external tools such as calculators, search engines, and code interpreters.

Size: 13.7 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0

falonss703/Awesome-Uncertainty-based-Reinforcement-Learning

🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL

Size: 10.7 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 31 - Forks: 2

MozerWang/AMPO

[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents

Language: Python - Size: 9.54 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 35 - Forks: 5

Cristian-Curaba/CryptoFormalEval

We introduce a benchmark for testing how well LLMs can find vulnerabilities in cryptographic protocols. By combining LLMs with symbolic reasoning tools like Tamarin, we aim to improve the efficiency and thoroughness of protocol analysis, paving the way for future AI-powered cybersecurity defenses.

Language: Haskell - Size: 7.43 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 6 - Forks: 2

reasoning-survey/Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

Size: 7.42 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 600 - Forks: 56

dingo-actual/mindgraph

Think Faster. Think Deeper. Think MindGraph.

Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

sbhambr1/Trace_Check_QA

Code for Invesitgating Trace-based Knowledge Distillation on Question-Answering

Language: Python - Size: 73.4 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

archersleeping72/CryptoFormalEval

We introduce a benchmark for testing how well LLMs can find vulnerabilities in cryptographic protocols. By combining LLMs with symbolic reasoning tools like Tamarin, we aim to improve the efficiency and thoroughness of protocol analysis, paving the way for future AI-powered cybersecurity defenses.

Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

BennyTMT/GAMETime

Language: Python - Size: 5.39 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 0

Graph-Reasoner/GraphPRM

[KDD 2025] Rewarding Graph Reasoning Process makes LLMs more Generalized Reasoners

Language: Python - Size: 39 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 5 - Forks: 0

Gen-Verse/MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Language: Python - Size: 129 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1,055 - Forks: 47

UKPLab/emnlp2024-code-prompting

Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024

Language: Python - Size: 46.6 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 22 - Forks: 4

nl4opt/ORQA

[AAAI 2025] ORQA is a new QA benchmark designed to assess the reasoning capabilities of LLMs in a specialized technical domain of Operations Research. The benchmark evaluates whether LLMs can emulate the knowledge and reasoning skills of OR experts when presented with complex optimization modeling tasks.

Language: Python - Size: 2.49 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 36 - Forks: 0

inclusionAI/Ling

Ling is a MoE LLM provided and open-sourced by InclusionAI.

Language: Python - Size: 3.36 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 157 - Forks: 15

twittymatteoscott/CryptoFormalEval

We introduce a benchmark for testing how well LLMs can find vulnerabilities in cryptographic protocols. By combining LLMs with symbolic reasoning tools like Tamarin, we aim to improve the efficiency and thoroughness of protocol analysis, paving the way for future AI-powered cybersecurity defenses.

Size: 2.93 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Trae1ounG/Neural_Incompatibility

Official code for ACL'25 Main: "Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in Large Language Models"

Language: Python - Size: 1.45 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 6 - Forks: 1

soulkeeperc5/CryptoFormalEval

We introduce a benchmark for testing how well LLMs can find vulnerabilities in cryptographic protocols. By combining LLMs with symbolic reasoning tools like Tamarin, we aim to improve the efficiency and thoroughness of protocol analysis, paving the way for future AI-powered cybersecurity defenses.

Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

lordlord0whitefox/CryptoFormalEval

We introduce a benchmark for testing how well LLMs can find vulnerabilities in cryptographic protocols. By combining LLMs with symbolic reasoning tools like Tamarin, we aim to improve the efficiency and thoroughness of protocol analysis, paving the way for future AI-powered cybersecurity defenses.

Size: 2.93 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

leekerstopme/CryptoFormalEval-n6

We introduce a benchmark for testing how well LLMs can find vulnerabilities in cryptographic protocols. By combining LLMs with symbolic reasoning tools like Tamarin, we aim to improve the efficiency and thoroughness of protocol analysis, paving the way for future AI-powered cybersecurity defenses.

Size: 2.93 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0