An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: benchmark-datasets

Event-AHU/OpenPAR

[OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch

Language: Python - Size: 68.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 128 - Forks: 17

Belluxx/LocalAIME

Test your local LLMs on the AIME problems

Language: Python - Size: 569 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 16 - Forks: 1

gagolews/clustering-data-v1

A framework for benchmarking clustering algorithms – Benchmark suite, version 1

Language: Jupyter Notebook - Size: 179 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 8 - Forks: 1

KaiyangZhou/Dassl.pytorch

A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.

Language: Python - Size: 454 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 1,322 - Forks: 183

gagolews/clustering-results-v1

A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)

Language: Python - Size: 362 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 1 - Forks: 0

Seyed-Ali-Ahmadi/Awesome_Satellite_Benchmark_Datasets

Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.

Language: Jupyter Notebook - Size: 7.12 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 349 - Forks: 28

soubhiksanyal/RingNet

Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

Language: Python - Size: 12.4 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 846 - Forks: 172

shawnwun/RNNLG

RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

Language: Python - Size: 23.1 MB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 491 - Forks: 126

rohit901/VANE-Bench

[NAACL'25] Contains code and documentation for our VANE-Bench paper.

Language: Python - Size: 38.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 11 - Forks: 1

milaan9/Clustering-Datasets

This repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.

Size: 99.2 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 302 - Forks: 223

soubhiksanyal/now_evaluation

This is the official repository for evaluation on the NoW Benchmark Dataset. The goal of the NoW benchmark is to introduce a standard evaluation metric to measure the accuracy and robustness of 3D face reconstruction methods from a single image under variations in viewing angle, lighting, and common occlusions.

Language: Python - Size: 578 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 109 - Forks: 16

PasanBhanu/time-series-forcasting-benchmark-dataset-preprocessing

Benchmark Datasets for Time Series Forecasting Preprocessing - NASA HTTP Dataset, WorldCup98 Dataset

Language: Jupyter Notebook - Size: 2.95 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

krishnanlab/obnb

A Python toolkit for setting up benchmarking dataset using biomedical networks

Language: Python - Size: 2.66 MB - Last synced at: 5 days ago - Pushed at: 17 days ago - Stars: 22 - Forks: 1

ali-vilab/IDEA-Bench

Official repository of IDEA-Bench

Language: Python - Size: 14.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 21 - Forks: 1

karthiksoman/biomixQA

Repository for BiomixQA benchmark dataset

Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

sile/nasbench-rs

A Rust port of NASBench: https://github.com/google-research/nasbench

Language: Rust - Size: 43.9 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 2 - Forks: 2

idirlab/freebases

Properly pre-processed full-scale Freebase datasets

Language: Python - Size: 5.91 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 5 - Forks: 0

tlu-dt-nlp/EstGEC-L2-Corpus

Estonian Grammatical Error Correction (GEC) test and development corpus that contains L2 learner texts error-annotated in the M2 format.

Language: Python - Size: 729 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

qianghuangwhu/benchtemp

BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks

Language: Python - Size: 4.93 MB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 3

AI-team-UoA/GeoQuestions1089

Crowdsourced Geospatial Question-Answering dataset containing triples of question-queries-answers.

Language: PostScript - Size: 83.6 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 5 - Forks: 1

futianfan/clinical-trial-outcome-prediction

benchmark dataset and Deep learning method (Hierarchical Interaction Network, HINT) for clinical trial approval probability prediction, published in Cell Patterns 2022.

Language: Python - Size: 102 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 84 - Forks: 21

khalidhabiburahman/kgc-non-benchmark-employee

Source code for experiments in the papers "Beyond Benchmarks: Assessing Knowledge Graph Completion Methods on Non-Benchmark Employee Data" (IEEE 2024, yet to be published)

Language: Jupyter Notebook - Size: 14.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

taoshen99/MUBDsyn

The official repository for the bioRxiv preprint "Deep Reinforcement Learning Enables Better Bias Control in Benchmark for Virtual Screening".

Language: Python - Size: 67.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

swdev1202/argoverse-kitti-adapter Fork of yzhou377/argoverse-kitti-adapter

A tool to translate Argoverse into KITTI dataset format

Size: 158 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

AlexSWong/COVID-Net

Launched in March 2020 in response to the coronavirus disease 2019 (COVID-19) pandemic, COVID-Net is a global open source, open access initiative dedicated to accelerating advancement in machine learning to aid front-line healthcare workers and clinical institutions around the world fighting the continuing pandemic. Towards this goal, our global multi-disciplinary team of researchers, developers, and clinicians have made publicly available a suite of tailored deep neural network models for tackling different challenges ranging from screening to risk stratification to treatment planning for patients with the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Furthermore, we have made available fully curated, open access benchmark datasets comprised of some of the largest, most diverse patient cohorts from around the world.

Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 10

phusroyal/ViHOS

Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)

Language: Jupyter Notebook - Size: 5.68 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 26 - Forks: 6

mdv3101/CDeCNet

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Language: Python - Size: 23.1 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 126 - Forks: 30

AKSW/irbench

Open Information Retrieval Benchmark Framework

Language: Java - Size: 12.3 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

vincenzorusso3/mimic-iv-benchmarks

FDSML Course Project 2020/21

Language: Python - Size: 25.2 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 3

omar-sharif03/BAD-Bangla-Aggressive-Text-Dataset

Size: 14.1 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3

saeed-anwar/UWSurvey

Diving Deeper into Underwater Image Enhancement: A Survey, accepted in Signal Processing: Image Communication.

Size: 26.9 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 10 - Forks: 1

4kills/zlib_benchmark

Large benchmark data for 4kills/go-zlib and 4kills/go-libdeflate, removed from the original go library/repository itself to minimize library size.

Size: 3.79 MB - Last synced at: 1 day ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

MILE-IISc/MergedSymbolsKannada

Benchmarking dataset of merged symbols in Kannada along with their associated ground truth Unicode text

Language: Shell - Size: 3.64 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

MILE-IISc/DegradedWordsKannada

Benchmarking dataset of degraded word images (with character splits) in Kannada along with their associated ground truth Unicode text

Language: Shell - Size: 7.48 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

Related Keywords
benchmark-datasets 34 machine-learning 12 deep-learning 8 dataset 7 benchmark 6 datasets 5 python 3 computer-vision 3 clustering 3 knowledge-graph 3 data 2 3d-landmarks 2 3d-mesh 2 ground-truth 2 face 2 face-reconstruction 2 flame 2 flame-model 2 document-analysis 2 synthetic-data 2 single-image-reconstruction 2 triplet-loss 2 knowledge-graph-embeddings 2 knowledge-graph-completion 2 graph-neural-networks 2 large-language-models 2 natural-language-processing 2 text 2 test-images 2 clinical-data 2 segmentation 2 pytorch 2 recognition 2 printed 2 ocr 2 python3 2 2d-3d 2 3d-data 2 kannada 2 healthcare 1 ml 1 neurocomputing 1 neural-networks 1 vihos 1 vietnamese-nlp 1 vietnamese-dataset 1 span-prediction 1 tinyml 1 hate-speech 1 nlp 1 sequence-labeling 1 social-media-mining 1 span-detection 1 drug-development 1 life-sciences 1 therapeutics 1 knowledge-graph-construction 1 generative-model 1 reinforcement-learning 1 virtual-screening 1 argoverse 1 autonomous-driving 1 kitti-dataset 1 stereo-vision 1 ai 1 chest-ct-images 1 chest-xray-images 1 covid-19 1 covid-net 1 dl 1 edgeai 1 fibrosis 1 text-classification 1 toxicity 1 comprehensive 1 deep-algorithms 1 deep-networks 1 evaluation-metrics 1 survey 1 underwater-images 1 visual-comparisons 1 benchmark-data 1 merged-characters 1 merged-symbols 1 subword-images 1 cut-characters 1 degraded 1 old-books 1 split-characters 1 word-images 1 cdec-net 1 object-detection 1 sota 1 table 1 table-detection 1 table-detection-using-deep-learning 1 evaluation 1 latex 1 measures 1 qald 1