An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: test-time-compute

haizelabs/verdict

Scale your LLM-as-a-judge.

Language: Jupyter Notebook - Size: 10 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 236 - Forks: 16

RylanSchaeffer/KoyejoLab-Large-How-Do-Language-Monkey-Power-Get-Their-Power

Code for ICML 2025 How Do Large Language Monkeys Get Their Power (Laws)?

Language: Python - Size: 328 MB - Last synced at: about 22 hours ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

Vortx-AI/memories-dev

Test-Time Memory Framework: Control Hallucinations in Foundation Models

Language: Python - Size: 59.2 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 8 - Forks: 2

sethkarten/pokechamp

Official repository of the paper, PokeChamp: an Expert-level Minimax Language Agent for Competitive Pokemon.

Language: Python - Size: 9.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 44 - Forks: 4

brendanm12345/Onboarding-Agents

A Framework Enabling Web Agents to Master Workflows From Human Demonstration

Language: Python - Size: 606 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Shalev-Lifshitz/MultiAgentVerification

Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers

Language: Python - Size: 1.23 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

brotSchimmelt/LLM-MCTS-Inference

An experimental project using MCTS to refine LLM responses for better accuracy and decision-making.

Language: Python - Size: 51.8 KB - Last synced at: 26 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 1