Topic: "llms-reasoning"
karthikv792/LLMs-Planning
An extensible benchmark for evaluating large language models on planning
Language: PDDL - Size: 52 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 351 - Forks: 36

taichengguo/LLM_MultiAgents_Survey_Papers
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
Size: 5.29 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 128 - Forks: 3

SuperBruceJia/Awesome-LLM-Self-Consistency
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
Size: 149 KB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 96 - Forks: 7

eqimp/hogwild_llm
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
Language: Python - Size: 1.7 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 79 - Forks: 4

Trae1ounG/DyPRAG
Official code for Dynamic Parametric RAG.
Language: Python - Size: 22.9 MB - Last synced at: 18 days ago - Pushed at: 30 days ago - Stars: 65 - Forks: 7

epfl-dlab/cc_flows
The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".
Language: Python - Size: 17.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 30 - Forks: 1

SuperBruceJia/Awesome-Mixture-of-Experts
Awesome Mixture of Experts (MoE): A Curated List of Mixture of Experts (MoE) and Mixture of Multimodal Experts (MoME)
Size: 438 KB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 24 - Forks: 3

logikon-ai/cot-eval
A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.
Language: Jupyter Notebook - Size: 2.41 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 2

moemd/app
Causal Inference Analysis for Decreasing Cognitive Load and Enhancing Task Performance via Standardized Markdown Language Routines with Multiple Generative Models
Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

ethicalabs-ai/ouroboros
Self-Improving LLMs Through Iterative Refinement
Language: Python - Size: 429 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

dcaup/app
Unified Pipeline with Crossmodal Data and Decentralized Agents for Causal Analysis of Financial Decision-Making Dynamics
Size: 12.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

cfmzk/app
Zero-Knowledge Proofs Integrated with Crossmodal and Foundational Models for Causal Analysis of Crypto Market Performance
Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

serkd/app
Comparative Causal Network Analysis of Alpha Waves and HRV (Normalized) Using Knowledge Retrieval for Emotion Recognition Systems
Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

dphot/app
Interaction Pipelines with Multiple LLMs for Thought Hypergraph Distillation to Enhance Error Detection with Pre- and Post-Task Anxiety Analysis
Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

xpmoe/app
Mixture of Experts Framework for Enhanced Explainability of Anxiety States Pre- and Post-Intervention Across Experimental Groups
Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

fdhot/app
Fuzzy Logic Distillation for Structuring Thought Hypergraphs to Enhance Citation Analysis and Relevance Assessment of Academic Articles
Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

ogrnv/random-intelligence-tests
Compare the intelligence of different AIs using randomly generated tasks.
Language: HTML - Size: 212 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

xphot/app
Thought Hypergraphs for Enhanced Detection and Explainability of Errors Across Experimental Groups
Language: Jupyter Notebook - Size: 82 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

Masoudjafaripour/Awesome-LLM-Planning
Awesome-LLM-Planning
Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

gen-deeper/app
Análise de Intervenção em Ansiedade com Descoberta Causal
Language: Python - Size: 51.8 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

sano-explorar/app
Causal Analysis of Sleep Effects on Anxiety, Correct Responses, and Neurophysiological Measures Using Advanced Statistical Techniques
Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

suave-ritual/app
Orchestration of Self-Adaptive Unsupervised Augmentation Variations for Enhanced Causal Analysis of Post-Intervention Anxiety and Correct Responses Relationships
Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

ayahaustine/agentic-memory
Can LLMs really 'Think'? This repo contains implementation of different types of memory for LLMs.
Language: Python - Size: 2.14 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Ipazia-AI/latent-explorer
Latent-Explorer is the Python implementation of the framework proposed in the paper "Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph".
Language: Python - Size: 5.83 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 2

CodeMindICML/CodeMindICML
CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that enables in-depth analysis of the results.
Size: 55.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Fbxfax/llm-confidence-scorer
A set of auxiliary systems designed to provide a measure of estimated confidence for the outputs generated by Large Language Models.
Language: Python - Size: 96.7 KB - Last synced at: about 18 hours ago - Pushed at: about 19 hours ago - Stars: 0 - Forks: 0

ronniross/llm-confidence-scorer
A set of auxiliary systems designed to provide a measure of estimated confidence for the outputs generated by Large Language Models.
Language: Python - Size: 0 Bytes - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

dimits-ts/synthetic_moderation_experiments
Experiments relating to synthetic LLM user-agents and LLM facilitators in online discussions
Language: Jupyter Notebook - Size: 108 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

xhinini/LLM-Reasoning-Review
A curated collection of research papers on reasoning capabilities of Large Language Models (LLMs). This repository organizes and categorizes works that evaluate, benchmark, and analyze reasoning in LLMs, including methods, techniques, datasets, and survey papers.
Size: 26.4 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

Vignesh010101/Intelligent-Health-LLM-System
An Intelligent Health LLM System for Personalized Medication Guidance and Support.
Language: Jupyter Notebook - Size: 615 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Alejandro-Candela/smol-agency
ReAct-powered autonomous agents for office task automation. They process various file formats, generate Excel spreadsheets, conduct deep research, and manage local emails and calendars. Utilizes Gemini 1.5 Flash, fine-tuned open-source models, and Deepseek R1 for efficient and economical execution.
Language: Python - Size: 146 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Bhazantri/CoT-Image_Generation
CoT Reasoning in Autoregressive Image Generation
Language: Python - Size: 4.22 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ogrnv/Quantifying-how-close-an-AI-is-to-AGI-at-any-given-time
Quantifying how close an AI is to AGI at any given time
Size: 23.4 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
