GitHub topics: reasoning-models

Repositories

lee-messi/RM-IAT

Replication Materials for "Implicit Bias-Like Patterns in Reasoning Models"

Language: Jupyter Notebook - Size: 7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

Abhisang3/xVerify

xVerify: Efficient Answer Verifier for Large Language Model Evaluations

Language: Python - Size: 806 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 1

eric-ai-lab/Soft-Thinking

Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Language: Python - Size: 4.41 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 217 - Forks: 16

microsoft/DSP-Plus

Implementation and subsequent optimization for "Reviving DSP for Advanced Theorem Proving in the Era of Reasoning Models"

Language: Python - Size: 1.32 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 20 - Forks: 2

Zefan-Cai/R-KV

R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

Language: Python - Size: 55.2 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1,105 - Forks: 178

mohammad-gh009/DrugReasoner

Predicting drug approval with reasoning.

Language: Python - Size: 136 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 0

dialexity/dialectical-framework

Turn stories, strategies, or systems into insight. Auto-generate Dialectical Wheels (DWs) from any text to reveal blind spots, surface polarities, and trace dynamic paths toward synthesis. DWs are semantic maps that expose tension, transformation, and coherence within a system—whether narrative, ethical, organizational, or technological.

Language: Python - Size: 6.3 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 8 - Forks: 5

zilliztech/deep-searcher

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Language: Python - Size: 16.8 MB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 6,836 - Forks: 672

ariannamethod/ARIANNA-CHAIN

The chain of Arianna. Resonating and Reasoning Model.

Language: Python - Size: 1.24 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 2 - Forks: 0

DevEvil-AI/Nexus-AI

A simple AI source code that includes chat, reasoning and image features using public APIs like xAI, OpenAI, HuggingFace and Flux.

Language: JavaScript - Size: 24.4 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 4 - Forks: 1

likhonsdev/GenZ

This repository contains the code for training and deploying the GenZ model. The process is fully automated using GitHub Actions.

Language: JavaScript - Size: 611 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

microsoft/BUILD25-LAB333

This repository hosts the instructions and workshop materials for Lab 333 - Evaluate Reasoning Models for Your Generative AI Solutions

Language: Jupyter Notebook - Size: 4.75 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 18 - Forks: 17

kaicheng001/Awesome-R1

A curated list of research papers, models, and resources related to R1-style reasoning models following DeepSeek-R1's breakthrough in January 2025.

Size: 57.6 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

nimishbongale/nimible-researcher

For quick yet deep research on any topic, use this Nimble Researcher. It will generate a response and improve it based on automated feedback until it meets the requirements.

Language: Python - Size: 23.4 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

codelion/pts

Pivotal Token Search

Language: Python - Size: 692 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 111 - Forks: 7

UKPLab/acl2025-diverse-cot

Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"

Language: Python - Size: 20.3 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 29 - Forks: 3

yongchao98/R1-Code-Interpreter

R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning

Size: 973 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 14 - Forks: 2

Abishk-developer/kv

Modern remote KVM solution with minimal setup. Stream video and audio, send input from SBCs, and expose disk images easily. 🌐💻

Language: Crystal - Size: 86.9 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

sinanuozdemir/oreilly-agi

Explore the evolution of AGI through historical context, reasoning models, and agent systems, while gaining hands-on experience with cutting-edge models like Claude 4, DeepSeek-R1, and OpenAI's o3. Learn to critically evaluate AGI benchmarks, understand their limitations, and identify where current models excel or struggle in reasoning tasks.

Language: Jupyter Notebook - Size: 351 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 5

czg1225/VeriThinker

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Language: Python - Size: 2.47 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 47 - Forks: 1

MiniMax-AI/MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Language: Python - Size: 7.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,513 - Forks: 190

OpenSPG/KAG-Thinker

An interactive thinking and deep reasoning model. It provides a cognitive reasoning paradigm for complex multi-hop problems.

Language: Python - Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 15 - Forks: 0

UCSC-VLAA/MedReason

MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs

Language: Python - Size: 4.66 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 177 - Forks: 16

AbhaySingh71/AI-Lawyer-RAG-with-Deepseek

AI Lawyer is an intelligent reasoning legal assistant powered by DeepSeek , Ollama RAG and LangChain, designed to streamline legal research and document analysis. By leveraging retrieval-augmented generation (RAG), it provides precise legal insights, and contract summarization. With an intuitive Streamlit-based UI, analyze legal documents.

Language: Python - Size: 1.15 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 9 - Forks: 3

silentghost2/Using-machine-learning-model-to-detect-bias-correction-of-papers-

This is an ;emi completed evaluation software which is based upon the question paper pattern of certain university. which is going to be intregrated with ML to detect the bias correction and help evaluators

Language: HTML - Size: 44.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

MicDZ/MANBench

MANBench: Is Your Multimodal Model Smarter than Human?

Language: Python - Size: 22.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

aygp-dr/illusion-of-thinking

Project exploring Apple's 'The Illusion of Thinking' research paper on LRM limitations

Language: Scheme - Size: 13.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

IAAR-Shanghai/xVerify

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Language: Python - Size: 826 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 109 - Forks: 7

hao-ai-lab/Dynasor

Simple extension on vLLM to help you speed up reasoning model without training.

Language: Python - Size: 11.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 152 - Forks: 23

fscdc/ReasonMap

ReasonMap

Language: Python - Size: 7.45 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

DolbyUUU/Sudoku4LLM

Sudoku4LLM is a Sudoku dataset generator for training and evaluating reasoning in Large Language Models (LLMs). It offers customizable puzzles, difficulty levels, and 11 serialization formats to support structured data reasoning and Chain of Thought (CoT) experiments.

Language: Python - Size: 29.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

KazKozDev/net-reflective-reasoning-llm

LLM reasoning method: combining reflexive cueing with real-time web search and multi-stage analysis for more accurate and explainable answers.

Language: Python - Size: 18.9 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 1

nitya/model-mondays Fork of microsoft/model-mondays

Model Mondays is a weekly livestreamed series on Microsoft Reactor that helps you make informed model choice decisions with timely updates and model deep-dives. Watch live for the content. Join Discord for the discussions.

Language: Jupyter Notebook - Size: 6.73 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Related Keywords

reasoning-models 38 llm 15 deepseek-r1 8 reasoning 6 reasoning-language-models 6 ai 5 reinforcement-learning 5 large-language-models 5 fine-tuning 4 chatgpt 3 openai 3 benchmark 3 huggingface 3 deepseek 3 post-training 3 gpt-o1 2 agi 2 reasoning-agent 2 neural-network 2 grok 2 llm-inference 2 vlm 2 vector-database 2 python 2 model-catalog 2 azure-ai-foundry 2 generative-ai 2 agents 2 artificial-intelligence 2 deepseek-math 2 evaluation 2 judge-model 2 math-verify 2 open-compass 2 open-r1 2 reliability 2 reliability-tools 2 xverify 2 kvcache 2 hackintosh 1 deepsearchalgorithmus 1 linux 1 deepthinking 1 kag 1 medical-dataset 1 medical-large-language-models 1 chatbot 1 macos-installer 1 macos-mojave 1 faiss-vector-database 1 monterey-hackintosh 1 groqapi 1 semantic-analysis 1 langchain 1 namespace 1 virtual-machine 1 ai-agents 1 artifical-general-inteligence 1 deepseek-r1-distill-llama 1 deepseek-r1-distill-qwen 1 efficiency 1 emulation 1 distributed 1 catalina-hackinotsh 1 bigsur-hackintosh 1 sonoma-hackintosh 1 redis-cluster 1 minimax-m1 1 deepresearch 1 kernel-debugging 1 kvm 1 web-search 1 github-models 1 model-choice 1 model-mondays 1 multilingual-models 1 multimodal-models 1 small-language-models 1 ai-learning 1 entity-extraction 1 knowledge-distillation 1 knowledge-graph 1 inference 1 test-time-computation 1 ttc 1 ai-games 1 civilization 1 nation-states 1 o1 1 o3-mini 1 socioeconomics 1 legal-analytics-and-data-science 1 llm-agent 1 ollama 1 ollamaembeddings 1 retrieval-augmented-generation 1 streamlit 1 machine-learning-algorithms 1 nlp-machine-learning 1 vision-language-model 1