Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: benchmark-datasets
Psycoy/MixEval
MixEval, a ground-truth-based dynamic benchmark derived from off-the-shelf benchmark mixtures, which evaluates LLMs with a highly capable model ranking (i.e., 0.96 correlation with Chatbot Arena) while running locally and quickly (6% the time and cost of running MMLU), with its queries being stably updated every month to avoid contamination.
Language: Python - Size: 2.83 MB - Last synced: about 1 hour ago - Pushed: about 4 hours ago - Stars: 1 - Forks: 0
KaiyangZhou/Dassl.pytorch
A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.
Language: Python - Size: 454 KB - Last synced: 20 days ago - Pushed: 7 months ago - Stars: 1,088 - Forks: 160
krishnanlab/obnb
A Python toolkit for setting up benchmarking dataset using biomedical networks
Language: Python - Size: 2.77 MB - Last synced: about 2 hours ago - Pushed: about 1 month ago - Stars: 20 - Forks: 0
qianghuangwhu/benchtemp
BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks
Language: Python - Size: 4.93 MB - Last synced: 24 days ago - Pushed: 3 months ago - Stars: 12 - Forks: 3
futianfan/clinical-trial-outcome-prediction
benchmark dataset and Deep learning method (Hierarchical Interaction Network, HINT) for clinical trial approval probability prediction, published in Cell Patterns 2022.
Language: Python - Size: 102 MB - Last synced: about 1 month ago - Pushed: 11 months ago - Stars: 84 - Forks: 21
Seyed-Ali-Ahmadi/Awesome_Satellite_Benchmark_Datasets
Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.
Language: Jupyter Notebook - Size: 7.12 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 271 - Forks: 26
soubhiksanyal/now_evaluation
This is the official repository for evaluation on the NoW Benchmark Dataset. The goal of the NoW benchmark is to introduce a standard evaluation metric to measure the accuracy and robustness of 3D face reconstruction methods from a single image under variations in viewing angle, lighting, and common occlusions.
Language: Python - Size: 578 KB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 97 - Forks: 16
khalidhabiburahman/kgc-non-benchmark-employee
Source code for experiments in the papers "Beyond Benchmarks: Assessing Knowledge Graph Completion Methods on Non-Benchmark Employee Data" (IEEE 2024, yet to be published)
Language: Jupyter Notebook - Size: 14.9 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
taoshen99/MUBDsyn
The official repository for the bioRxiv preprint "Deep Reinforcement Learning Enables Better Bias Control in Benchmark for Virtual Screening".
Language: Python - Size: 67.7 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 4 - Forks: 0
shawnwun/RNNLG
RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.
Language: Python - Size: 23.1 MB - Last synced: 4 months ago - Pushed: almost 5 years ago - Stars: 489 - Forks: 128
swdev1202/argoverse-kitti-adapter Fork of yzhou377/argoverse-kitti-adapter
A tool to translate Argoverse into KITTI dataset format
Size: 158 KB - Last synced: 5 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 1
AlexSWong/COVID-Net
Launched in March 2020 in response to the coronavirus disease 2019 (COVID-19) pandemic, COVID-Net is a global open source, open access initiative dedicated to accelerating advancement in machine learning to aid front-line healthcare workers and clinical institutions around the world fighting the continuing pandemic. Towards this goal, our global multi-disciplinary team of researchers, developers, and clinicians have made publicly available a suite of tailored deep neural network models for tackling different challenges ranging from screening to risk stratification to treatment planning for patients with the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Furthermore, we have made available fully curated, open access benchmark datasets comprised of some of the largest, most diverse patient cohorts from around the world.
Size: 1.25 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 43 - Forks: 10
idirlab/freebases
Properly pre-processed full-scale Freebase datasets
Language: Python - Size: 5.89 MB - Last synced: about 2 months ago - Pushed: 6 months ago - Stars: 4 - Forks: 0
tlu-dt-nlp/EstGEC-L2-Corpus
Estonian Grammatical Error Correction (GEC) test and development corpus that contains L2 learner texts error-annotated in the M2 format.
Size: 506 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
phusroyal/ViHOS
Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)
Language: Jupyter Notebook - Size: 5.68 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 26 - Forks: 6
mdv3101/CDeCNet
CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images
Language: Python - Size: 23.1 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 126 - Forks: 30
milaan9/Clustering-Datasets
This repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.
Size: 99.2 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 244 - Forks: 212
sile/nasbench-rs
A Rust port of NASBench: https://github.com/google-research/nasbench
Language: Rust - Size: 42 KB - Last synced: 8 days ago - Pushed: 7 months ago - Stars: 2 - Forks: 2
soubhiksanyal/RingNet
Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision
Language: Python - Size: 12.4 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 758 - Forks: 167
AKSW/irbench
Open Information Retrieval Benchmark Framework
Language: Java - Size: 12.3 MB - Last synced: 10 days ago - Pushed: over 3 years ago - Stars: 4 - Forks: 0
gagolews/clustering-results-v1
A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)
Language: Python - Size: 318 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
gagolews/clustering-data-v1
A framework for benchmarking clustering algorithms – Benchmark suite, version 1
Language: Jupyter Notebook - Size: 173 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 2 - Forks: 1
vincenzorusso3/mimic-iv-benchmarks
FDSML Course Project 2020/21
Language: Python - Size: 25.2 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 4 - Forks: 3
omar-sharif03/BAD-Bangla-Aggressive-Text-Dataset
Size: 14.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 7 - Forks: 3
saeed-anwar/UWSurvey
Diving Deeper into Underwater Image Enhancement: A Survey, accepted in Signal Processing: Image Communication.
Size: 26.9 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 10 - Forks: 1
4kills/zlib_benchmark
Large benchmark data for 4kills/go-zlib and 4kills/go-libdeflate, removed from the original go library/repository itself to minimize library size.
Size: 3.79 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
MILE-IISc/MergedSymbolsKannada
Benchmarking dataset of merged symbols in Kannada along with their associated ground truth Unicode text
Language: Shell - Size: 3.64 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 2 - Forks: 0
MILE-IISc/DegradedWordsKannada
Benchmarking dataset of degraded word images (with character splits) in Kannada along with their associated ground truth Unicode text
Language: Shell - Size: 7.48 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 2 - Forks: 0