An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: reasoning-language-models

mims-harvard/TxAgent

TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools

Language: Python - Size: 55.9 MB - Last synced at: about 19 hours ago - Pushed at: about 19 hours ago - Stars: 420 - Forks: 63

tubiccelavi/Poker-COACH

Ai Vr Machine Learning Natural language Poker Coach

Language: JavaScript - Size: 42 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

krystalan/DRT

Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (arXiv preprint 2024)

Size: 2.16 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 213 - Forks: 9

Trustworthy-ML-Lab/ThinkEdit

An effective weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study uncovering how reasoning length is encoded in the model’s representation space.

Language: Python - Size: 6.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 4 - Forks: 1

XIXUM/XIXUM-modeler

AI Model Generator

Language: Java - Size: 19.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

dvlab-research/Seg-Zero

Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

Language: Python - Size: 4.4 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 298 - Forks: 7

linhaowei1/kumo

☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models

Language: Jupyter Notebook - Size: 630 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 16 - Forks: 0

Ruiyang-061X/Awesome-MLLM-Reasoning

📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.

Size: 7.81 KB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

a-m-team/a-m-models

a-m-team's exploration in large language modeling

Size: 5 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 27 - Forks: 0

The-FinAI/Fino1

This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.

Language: Jupyter Notebook - Size: 137 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 41 - Forks: 7

mdda/getting-to-aha-with-tpus

Reasoning-from-Zero using gemma.JAX.nnx on TPUs

Language: Python - Size: 270 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 9 - Forks: 0

NLPForUA/ZNO

Structured test tasks and model tuning scripts for multiple subjects from ZNO - the Ukrainian External Independent Evaluation (ЗНО)

Language: Python - Size: 2.19 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

mims-harvard/ToolUniverse

ToolUniverse is a collection of biomedical tools designed for AI agents

Language: Python - Size: 2.93 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 0

DolbyUUU/DeepEnlighten

Pure RL without SFT to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.

Language: Python - Size: 21.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

DolbyUUU/Logic-RL-Lite

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".

Language: Python - Size: 14.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

Wild-Cooperation-Hub/Awesome-MLLM-Reasoning-Benchmarks

A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.

Size: 89.8 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 26 - Forks: 2

aryan-jadon/Synthetic-Data-Generation-and-Evaluation-using-Reasoning-Models

This repository contains the implementation of our research on optimizing Retrieval-Augmented Generation (RAG) systems for technical domains. Our work addresses the unique challenges of precise information extraction from complex, domain-specific documents by introducing token-aware evaluation metrics and synthetic data generation pipeline.

Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

zihao-ai/BoT

🔥🔥🔥Breaking long thought processes of o1-like LLMs, such as DeepSeek-R1, QwQ

Language: Python - Size: 13.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 19 - Forks: 0

Hyun-Ryu/clover

Official code for "Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning", ICLR 2025.

Language: Python - Size: 404 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

spcl/x1

Official Implementation of "Reasoning Language Models: A Blueprint"

Language: Python - Size: 563 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 37 - Forks: 6

zyuanlim/Awesome-Open-Reasoning

A curated list of awesome open-source and open-weight language models or methods focused on reasoning capabilities.

Size: 2.93 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Related Keywords
reasoning-language-models 21 llm 7 large-language-models 6 reasoning 6 reinforcement-learning 4 deepseek-r1 4 deepseek 4 chain-of-thought 3 reasoning-agent 3 nlp 2 gemma 2 fine-tuning 2 gpt-o1 2 awesome 2 benchmark 2 multimodal 2 post-training 2 generative-ai 2 reasoning-models 2 large-reasoning-models 2 llama 2 precision-medicine 2 therapeutics 2 tool-use 2 ai 2 agents 2 multimodal-reasoning 1 multimodal-large-language-models 1 mllm-reasoning 1 ukrainian-language-dataset 1 ukrainian-language 1 ukraine 1 coaching-platform 1 natural-language-processing 1 math 1 language-model 1 history 1 geography 1 exam 1 evaluation 1 natural-language-understanding 1 transformers 1 test-time-compute 1 machine-learning 1 language-models 1 inference-time-compute 1 cot 1 awesome-list 1 artificial-intelligence 1 rlm 1 reasoning-llms 1 mcts-for-llms 1 lrm 1 logical-reasoning 1 qwq 1 backdoor-attacks 1 ai-agents 1 synthetic-dataset-generation 1 llm-framework 1 llm-evaluation 1 multimodal-reasoning-benchmarks 1 nextjs 1 php 1 segmentation 1 multimodel-large-language-model 1 poker 1 probability-statistics 1 models 1 modeling 1 react 1 knowledge-graph 1 generator 1 cognitive-neuroscience 1 ai-code-generator 1 mechanistic-interpretability 1 interpretable-machine-learning 1 shadcn-ui 1 deep-learning 1 literature-translation 1 machine-translation 1 dataset 1 data-annotation 1 tpu 1 nnx 1 jax 1 csharp 1 llms 1 llamas 1 gpt-4o 1 financial-modeling 1 dotnet 1 slow-thinking 1 openai 1 o3-mini 1 o1 1 multi-modal-large-language-model 1 multi-modal 1 mllm 1 lvlm 1 chain-of-thought-reasoning 1