GitHub topics: reasoning-language-models
mims-harvard/TxAgent
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
Language: Python - Size: 55.9 MB - Last synced at: about 19 hours ago - Pushed at: about 19 hours ago - Stars: 420 - Forks: 63

tubiccelavi/Poker-COACH
Ai Vr Machine Learning Natural language Poker Coach
Language: JavaScript - Size: 42 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

krystalan/DRT
Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (arXiv preprint 2024)
Size: 2.16 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 213 - Forks: 9

Trustworthy-ML-Lab/ThinkEdit
An effective weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study uncovering how reasoning length is encoded in the model’s representation space.
Language: Python - Size: 6.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 4 - Forks: 1

XIXUM/XIXUM-modeler
AI Model Generator
Language: Java - Size: 19.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

dvlab-research/Seg-Zero
Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
Language: Python - Size: 4.4 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 298 - Forks: 7

linhaowei1/kumo
☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models
Language: Jupyter Notebook - Size: 630 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 16 - Forks: 0

Ruiyang-061X/Awesome-MLLM-Reasoning
📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.
Size: 7.81 KB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

a-m-team/a-m-models
a-m-team's exploration in large language modeling
Size: 5 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 27 - Forks: 0

The-FinAI/Fino1
This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.
Language: Jupyter Notebook - Size: 137 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 41 - Forks: 7

mdda/getting-to-aha-with-tpus
Reasoning-from-Zero using gemma.JAX.nnx on TPUs
Language: Python - Size: 270 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 9 - Forks: 0

NLPForUA/ZNO
Structured test tasks and model tuning scripts for multiple subjects from ZNO - the Ukrainian External Independent Evaluation (ЗНО)
Language: Python - Size: 2.19 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

mims-harvard/ToolUniverse
ToolUniverse is a collection of biomedical tools designed for AI agents
Language: Python - Size: 2.93 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 0

DolbyUUU/DeepEnlighten
Pure RL without SFT to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
Language: Python - Size: 21.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

DolbyUUU/Logic-RL-Lite
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
Language: Python - Size: 14.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

Wild-Cooperation-Hub/Awesome-MLLM-Reasoning-Benchmarks
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
Size: 89.8 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 26 - Forks: 2

aryan-jadon/Synthetic-Data-Generation-and-Evaluation-using-Reasoning-Models
This repository contains the implementation of our research on optimizing Retrieval-Augmented Generation (RAG) systems for technical domains. Our work addresses the unique challenges of precise information extraction from complex, domain-specific documents by introducing token-aware evaluation metrics and synthetic data generation pipeline.
Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

zihao-ai/BoT
🔥🔥🔥Breaking long thought processes of o1-like LLMs, such as DeepSeek-R1, QwQ
Language: Python - Size: 13.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 19 - Forks: 0

Hyun-Ryu/clover
Official code for "Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning", ICLR 2025.
Language: Python - Size: 404 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

spcl/x1
Official Implementation of "Reasoning Language Models: A Blueprint"
Language: Python - Size: 563 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 37 - Forks: 6

zyuanlim/Awesome-Open-Reasoning
A curated list of awesome open-source and open-weight language models or methods focused on reasoning capabilities.
Size: 2.93 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0
