An open API service providing repository metadata for many open source software ecosystems.

Topic: "llms-reasoning"

karthikv792/LLMs-Planning

An extensible benchmark for evaluating large language models on planning

Language: PDDL - Size: 52 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 351 - Forks: 36

taichengguo/LLM_MultiAgents_Survey_Papers

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

Size: 5.29 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 128 - Forks: 3

SuperBruceJia/Awesome-LLM-Self-Consistency

Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models

Size: 149 KB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 96 - Forks: 7

eqimp/hogwild_llm

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

Language: Python - Size: 1.7 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 79 - Forks: 4

Trae1ounG/DyPRAG

Official code for Dynamic Parametric RAG.

Language: Python - Size: 22.9 MB - Last synced at: 18 days ago - Pushed at: 30 days ago - Stars: 65 - Forks: 7

epfl-dlab/cc_flows

The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".

Language: Python - Size: 17.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 30 - Forks: 1

SuperBruceJia/Awesome-Mixture-of-Experts

Awesome Mixture of Experts (MoE): A Curated List of Mixture of Experts (MoE) and Mixture of Multimodal Experts (MoME)

Size: 438 KB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 24 - Forks: 3

logikon-ai/cot-eval

A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.

Language: Jupyter Notebook - Size: 2.41 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 2

moemd/app

Causal Inference Analysis for Decreasing Cognitive Load and Enhancing Task Performance via Standardized Markdown Language Routines with Multiple Generative Models

Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

ethicalabs-ai/ouroboros

Self-Improving LLMs Through Iterative Refinement

Language: Python - Size: 429 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

dcaup/app

Unified Pipeline with Crossmodal Data and Decentralized Agents for Causal Analysis of Financial Decision-Making Dynamics

Size: 12.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

cfmzk/app

Zero-Knowledge Proofs Integrated with Crossmodal and Foundational Models for Causal Analysis of Crypto Market Performance

Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

serkd/app

Comparative Causal Network Analysis of Alpha Waves and HRV (Normalized) Using Knowledge Retrieval for Emotion Recognition Systems

Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

dphot/app

Interaction Pipelines with Multiple LLMs for Thought Hypergraph Distillation to Enhance Error Detection with Pre- and Post-Task Anxiety Analysis

Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

xpmoe/app

Mixture of Experts Framework for Enhanced Explainability of Anxiety States Pre- and Post-Intervention Across Experimental Groups

Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

fdhot/app

Fuzzy Logic Distillation for Structuring Thought Hypergraphs to Enhance Citation Analysis and Relevance Assessment of Academic Articles

Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

ogrnv/random-intelligence-tests

Compare the intelligence of different AIs using randomly generated tasks.

Language: HTML - Size: 212 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

xphot/app

Thought Hypergraphs for Enhanced Detection and Explainability of Errors Across Experimental Groups

Language: Jupyter Notebook - Size: 82 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

Masoudjafaripour/Awesome-LLM-Planning

Awesome-LLM-Planning

Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

gen-deeper/app

Análise de Intervenção em Ansiedade com Descoberta Causal

Language: Python - Size: 51.8 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

sano-explorar/app

Causal Analysis of Sleep Effects on Anxiety, Correct Responses, and Neurophysiological Measures Using Advanced Statistical Techniques

Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

suave-ritual/app

Orchestration of Self-Adaptive Unsupervised Augmentation Variations for Enhanced Causal Analysis of Post-Intervention Anxiety and Correct Responses Relationships

Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

ayahaustine/agentic-memory

Can LLMs really 'Think'? This repo contains implementation of different types of memory for LLMs.

Language: Python - Size: 2.14 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Ipazia-AI/latent-explorer

Latent-Explorer is the Python implementation of the framework proposed in the paper "Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph".

Language: Python - Size: 5.83 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 2

CodeMindICML/CodeMindICML

CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that enables in-depth analysis of the results.

Size: 55.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Fbxfax/llm-confidence-scorer

A set of auxiliary systems designed to provide a measure of estimated confidence for the outputs generated by Large Language Models.

Language: Python - Size: 96.7 KB - Last synced at: about 18 hours ago - Pushed at: about 19 hours ago - Stars: 0 - Forks: 0

ronniross/llm-confidence-scorer

A set of auxiliary systems designed to provide a measure of estimated confidence for the outputs generated by Large Language Models.

Language: Python - Size: 0 Bytes - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

dimits-ts/synthetic_moderation_experiments

Experiments relating to synthetic LLM user-agents and LLM facilitators in online discussions

Language: Jupyter Notebook - Size: 108 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

xhinini/LLM-Reasoning-Review

A curated collection of research papers on reasoning capabilities of Large Language Models (LLMs). This repository organizes and categorizes works that evaluate, benchmark, and analyze reasoning in LLMs, including methods, techniques, datasets, and survey papers.

Size: 26.4 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

Vignesh010101/Intelligent-Health-LLM-System

An Intelligent Health LLM System for Personalized Medication Guidance and Support.

Language: Jupyter Notebook - Size: 615 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Alejandro-Candela/smol-agency

ReAct-powered autonomous agents for office task automation. They process various file formats, generate Excel spreadsheets, conduct deep research, and manage local emails and calendars. Utilizes Gemini 1.5 Flash, fine-tuned open-source models, and Deepseek R1 for efficient and economical execution.

Language: Python - Size: 146 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Bhazantri/CoT-Image_Generation

CoT Reasoning in Autoregressive Image Generation

Language: Python - Size: 4.22 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ogrnv/Quantifying-how-close-an-AI-is-to-AGI-at-any-given-time

Quantifying how close an AI is to AGI at any given time

Size: 23.4 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Related Topics
llms 12 llm 9 llms-benchmarking 7 ai 4 reasoning 4 shap 3 fci 3 mixture-of-experts 3 artificial-intelligence 3 causal-discovery 3 large-language-models 3 llm-evaluation 3 dataset 2 agi 2 artificial-general-intelligence 2 intelligence 2 iq 2 psychometics 2 random 2 test 2 testing 2 testing-tools 2 deepseek-r1 2 datasets 2 llm-evaluation-framework 2 llm-evaluation-metrics 2 llm-evaluation-toolkit 2 llm-training 2 llms-efficency 2 llms-evalution 2 llm-inference 2 multi-agents 2 anxiety-prediction 2 explainability-metric 2 cognitive-load 2 dataset-generation 2 agents 2 nlp 2 code 2 planning 2 tests 2 chain-of-thought 2 double-machine-learning 1 causal-analysis 1 causal-hypergraph 1 crossmodal-model 1 crypto-market 1 crypto-market-performace 1 foundation-model 1 zero-knowledge-proof 1 logical-consistency 1 causality-algorithms 1 causal-consistency 1 financial-decision-making 1 self-consistent-generation 1 crossmodal-data 1 decentralized-agents 1 doubleml 1 standardized-markdown 1 task-performance 1 alpha-waves 1 causal-networks 1 emotion-recognition 1 hrv 1 knowledge-retrieval 1 lineardml 1 bug-tracker 1 causal-machine-learning 1 dependency-graph 1 experimental-psychology 1 semantics-preserving 1 semantics-consistency 1 hypergraph-neural-network 1 pattern-matching 1 semantics 1 hypothetical-consistency 1 gpt-4 1 gpt-3 1 factual-consistency 1 compositional-consistency 1 sparse 1 sparse-mixture-of-experts 1 sparse-mixture-of-multimodal-experts 1 sparse-moe 1 chatgpt 1 qwq 1 rag 1 paper 1 open-source 1 leaderboard 1 gen-ai 1 intervention-study 1 llm-planning 1 planning-algorithms 1 self-consistency-learning 1 self-consistency-benchmark 1 expert-network 1 foundation-models 1 gating-network 1 large-language-model 1