AsyncHow-Based-Agentic-Systems-Evaluation-Dataset

This repository hosts an Agentic Systems Evaluation Dataset based on AsyncHow, created to evaluate the performance of agentic systems driven by Large Language Models (LLMs). This dataset is designed to assess dynamic task decomposition, tool selection, and task execution.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/expectopatronm%2FAsyncHow-Based-Agentic-Systems-Evaluation-Dataset

Stars: 2
Forks: 0
Open issues: 0

License: None
Language: Python
Size: 9.21 MB
Dependencies parsed at: Pending

Created at: 7 months ago
Updated at: 2 months ago
Pushed at: 2 months ago
Last synced at: 2 months ago

Topics: agentic-framework, neurips

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

GitHub / expectopatronm / AsyncHow-Based-Agentic-Systems-Evaluation-Dataset