GitHub topics: benchmark-datasets

Repositories

dreadnode/AIRTBench-Code

Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models

Language: Jupyter Notebook - Size: 671 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 59 - Forks: 8

milaan9/Clustering-Datasets

This repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.

Size: 99.2 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 312 - Forks: 236

futianfan/clinical-trial-outcome-prediction

benchmark dataset and Deep learning method (Hierarchical Interaction Network, HINT) for clinical trial approval probability prediction, published in Cell Patterns 2022.

Language: Python - Size: 100 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 123 - Forks: 36

Omar-Sharif/BAD-Bangla-Aggressive-Text-Dataset

Size: 14.1 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 3

Seyed-Ali-Ahmadi/Awesome_Satellite_Benchmark_Datasets

Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.

Language: Jupyter Notebook - Size: 7.12 MB - Last synced at: about 17 hours ago - Pushed at: over 1 year ago - Stars: 354 - Forks: 28

Event-AHU/OpenPAR

[OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch

Language: Python - Size: 68.1 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 128 - Forks: 17

google-deepmind/forest_typology

Datasets to protect Earth's forests and biodiversity

Language: Jupyter Notebook - Size: 2.95 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 75 - Forks: 9

sile/nasbench-rs

A Rust port of NASBench: https://github.com/google-research/nasbench

Language: Rust - Size: 43.9 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 2

Belluxx/LocalAIME

Test your local LLMs on the AIME problems

Language: Python - Size: 569 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 1

gagolews/clustering-data-v1

A framework for benchmarking clustering algorithms – Benchmark suite, version 1

Language: Jupyter Notebook - Size: 179 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 8 - Forks: 1

KaiyangZhou/Dassl.pytorch

A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.

Language: Python - Size: 454 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1,322 - Forks: 183

gagolews/clustering-results-v1

A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)

Language: Python - Size: 362 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

soubhiksanyal/RingNet

Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

Language: Python - Size: 12.4 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 846 - Forks: 172

RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

Language: Python - Size: 23.1 MB - Last synced at: 2 months ago - Pushed at: about 6 years ago - Stars: 491 - Forks: 126

rohit901/VANE-Bench

[NAACL'25] Contains code and documentation for our VANE-Bench paper.

Language: Python - Size: 38.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 11 - Forks: 1

soubhiksanyal/now_evaluation

This is the official repository for evaluation on the NoW Benchmark Dataset. The goal of the NoW benchmark is to introduce a standard evaluation metric to measure the accuracy and robustness of 3D face reconstruction methods from a single image under variations in viewing angle, lighting, and common occlusions.

Language: Python - Size: 578 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 109 - Forks: 16

PasanBhanu/time-series-forcasting-benchmark-dataset-preprocessing

Benchmark Datasets for Time Series Forecasting Preprocessing - NASA HTTP Dataset, WorldCup98 Dataset

Language: Jupyter Notebook - Size: 2.95 MB - Last synced at: about 3 hours ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

krishnanlab/obnb

A Python toolkit for setting up benchmarking dataset using biomedical networks

Language: Python - Size: 2.66 MB - Last synced at: about 4 hours ago - Pushed at: about 2 months ago - Stars: 22 - Forks: 1

ali-vilab/IDEA-Bench

Official repository of IDEA-Bench

Language: Python - Size: 14.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 21 - Forks: 1

karthiksoman/biomixQA

Repository for BiomixQA benchmark dataset

Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0

idirlab/freebases

Properly pre-processed full-scale Freebase datasets

Language: Python - Size: 5.91 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

tlu-dt-nlp/EstGEC-L2-Corpus

Estonian Grammatical Error Correction (GEC) test and development corpus that contains L2 learner texts error-annotated in the M2 format.

Language: Python - Size: 729 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

qianghuangwhu/benchtemp

BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks

Language: Python - Size: 4.93 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 3

AI-team-UoA/GeoQuestions1089

Crowdsourced Geospatial Question-Answering dataset containing triples of question-queries-answers.

Language: PostScript - Size: 83.6 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 5 - Forks: 1

khalidhabiburahman/kgc-non-benchmark-employee

Source code for experiments in the papers "Beyond Benchmarks: Assessing Knowledge Graph Completion Methods on Non-Benchmark Employee Data" (IEEE 2024, yet to be published)

Language: Jupyter Notebook - Size: 14.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

taoshen99/MUBDsyn

The official repository for the bioRxiv preprint "Deep Reinforcement Learning Enables Better Bias Control in Benchmark for Virtual Screening".

Language: Python - Size: 67.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

swdev1202/argoverse-kitti-adapter Fork of yzhou377/argoverse-kitti-adapter

A tool to translate Argoverse into KITTI dataset format

Size: 158 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

AlexSWong/COVID-Net

Launched in March 2020 in response to the coronavirus disease 2019 (COVID-19) pandemic, COVID-Net is a global open source, open access initiative dedicated to accelerating advancement in machine learning to aid front-line healthcare workers and clinical institutions around the world fighting the continuing pandemic. Towards this goal, our global multi-disciplinary team of researchers, developers, and clinicians have made publicly available a suite of tailored deep neural network models for tackling different challenges ranging from screening to risk stratification to treatment planning for patients with the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Furthermore, we have made available fully curated, open access benchmark datasets comprised of some of the largest, most diverse patient cohorts from around the world.

Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 10

Related Keywords

benchmark-datasets 36 machine-learning 13 deep-learning 9 benchmark 7 dataset 7 datasets 5 clustering 3 computer-vision 3 knowledge-graph 3 python 3 natural-language-processing 2 large-language-models 2 remote-sensing 2 earth-observation 2 python3 2 graph-neural-networks 2 ai 2 triplet-loss 2 data 2 single-image-reconstruction 2 pytorch 2 2d-3d 2 3d-data 2 3d-landmarks 2 3d-mesh 2 flame-model 2 face 2 face-reconstruction 2 flame 2 artificial-intelligence 2 knowledge-graph-embeddings 2 knowledge-graph-completion 2 text 2 test-images 2 segmentation 2 recognition 2 printed 2 ocr 2 kannada 2 ground-truth 2 synthetic-data 2 clinical-data 2 document-analysis 2 gold-standard 1 sparql 1 benchmark-framework 1 healthcare 1 question-answering 1 ml 1 geospatial-data 1 estonian-language 1 dynamic-graph 1 geosparql 1 neural-networks 1 temporal-graph-networks 1 dynamic-node-classification 1 dynamic-link-prediction 1 error-corpora 1 corpus 1 generative-model 1 reinforcement-learning 1 virtual-screening 1 argoverse 1 autonomous-driving 1 knowledge-graph-construction 1 kitti-dataset 1 stereo-vision 1 chest-ct-images 1 chest-xray-images 1 language-resources 1 covid-19 1 grammatical-error-correction 1 covid-net 1 dl 1 edgeai 1 fibrosis 1 tinyml 1 keras 1 mortality-prediction 1 numpy 1 pandas 1 sklearn 1 comprehensive 1 deep-algorithms 1 deep-networks 1 evaluation-metrics 1 survey 1 underwater-images 1 visual-comparisons 1 benchmark-data 1 merged-characters 1 merged-symbols 1 subword-images 1 cut-characters 1 degraded 1 old-books 1 split-characters 1 word-images 1 hate-speech 1 nlp 1