GitHub / expectopatronm / AsyncHow-Based-Agentic-Systems-Evaluation-Dataset
This repository hosts an Agentic Systems Evaluation Dataset based on AsyncHow, created to evaluate the performance of agentic systems driven by Large Language Models (LLMs). This dataset is designed to assess dynamic task decomposition, tool selection, and task execution.
Stars: 2
Forks: 0
Open issues: 0
License: None
Language: Python
Size: 9.21 MB
Dependencies parsed at: Pending
Created at: 7 months ago
Updated at: 2 months ago
Pushed at: 2 months ago
Last synced at: 2 months ago
Topics: agentic-framework, neurips