Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: benchmark-datasets

Psycoy/MixEval

MixEval, a ground-truth-based dynamic benchmark derived from off-the-shelf benchmark mixtures, which evaluates LLMs with a highly capable model ranking (i.e., 0.96 correlation with Chatbot Arena) while running locally and quickly (6% the time and cost of running MMLU), with its queries being stably updated every month to avoid contamination.

Language: Python - Size: 2.83 MB - Last synced: about 1 hour ago - Pushed: about 4 hours ago - Stars: 1 - Forks: 0

KaiyangZhou/Dassl.pytorch

A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.

Language: Python - Size: 454 KB - Last synced: 20 days ago - Pushed: 7 months ago - Stars: 1,088 - Forks: 160

krishnanlab/obnb

A Python toolkit for setting up benchmarking dataset using biomedical networks

Language: Python - Size: 2.77 MB - Last synced: about 2 hours ago - Pushed: about 1 month ago - Stars: 20 - Forks: 0

qianghuangwhu/benchtemp

BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks

Language: Python - Size: 4.93 MB - Last synced: 24 days ago - Pushed: 3 months ago - Stars: 12 - Forks: 3

futianfan/clinical-trial-outcome-prediction

benchmark dataset and Deep learning method (Hierarchical Interaction Network, HINT) for clinical trial approval probability prediction, published in Cell Patterns 2022.

Language: Python - Size: 102 MB - Last synced: about 1 month ago - Pushed: 11 months ago - Stars: 84 - Forks: 21

Seyed-Ali-Ahmadi/Awesome_Satellite_Benchmark_Datasets

Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.

Language: Jupyter Notebook - Size: 7.12 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 271 - Forks: 26

soubhiksanyal/now_evaluation

This is the official repository for evaluation on the NoW Benchmark Dataset. The goal of the NoW benchmark is to introduce a standard evaluation metric to measure the accuracy and robustness of 3D face reconstruction methods from a single image under variations in viewing angle, lighting, and common occlusions.

Language: Python - Size: 578 KB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 97 - Forks: 16

khalidhabiburahman/kgc-non-benchmark-employee

Source code for experiments in the papers "Beyond Benchmarks: Assessing Knowledge Graph Completion Methods on Non-Benchmark Employee Data" (IEEE 2024, yet to be published)

Language: Jupyter Notebook - Size: 14.9 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

taoshen99/MUBDsyn

The official repository for the bioRxiv preprint "Deep Reinforcement Learning Enables Better Bias Control in Benchmark for Virtual Screening".

Language: Python - Size: 67.7 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 4 - Forks: 0

shawnwun/RNNLG

RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

Language: Python - Size: 23.1 MB - Last synced: 4 months ago - Pushed: almost 5 years ago - Stars: 489 - Forks: 128

swdev1202/argoverse-kitti-adapter Fork of yzhou377/argoverse-kitti-adapter

A tool to translate Argoverse into KITTI dataset format

Size: 158 KB - Last synced: 5 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 1

AlexSWong/COVID-Net

Launched in March 2020 in response to the coronavirus disease 2019 (COVID-19) pandemic, COVID-Net is a global open source, open access initiative dedicated to accelerating advancement in machine learning to aid front-line healthcare workers and clinical institutions around the world fighting the continuing pandemic. Towards this goal, our global multi-disciplinary team of researchers, developers, and clinicians have made publicly available a suite of tailored deep neural network models for tackling different challenges ranging from screening to risk stratification to treatment planning for patients with the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Furthermore, we have made available fully curated, open access benchmark datasets comprised of some of the largest, most diverse patient cohorts from around the world.

Size: 1.25 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 43 - Forks: 10

idirlab/freebases

Properly pre-processed full-scale Freebase datasets

Language: Python - Size: 5.89 MB - Last synced: about 2 months ago - Pushed: 6 months ago - Stars: 4 - Forks: 0

tlu-dt-nlp/EstGEC-L2-Corpus

Estonian Grammatical Error Correction (GEC) test and development corpus that contains L2 learner texts error-annotated in the M2 format.

Size: 506 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

phusroyal/ViHOS

Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)

Language: Jupyter Notebook - Size: 5.68 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 26 - Forks: 6

mdv3101/CDeCNet

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Language: Python - Size: 23.1 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 126 - Forks: 30

milaan9/Clustering-Datasets

This repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.

Size: 99.2 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 244 - Forks: 212

sile/nasbench-rs

A Rust port of NASBench: https://github.com/google-research/nasbench

Language: Rust - Size: 42 KB - Last synced: 8 days ago - Pushed: 7 months ago - Stars: 2 - Forks: 2

soubhiksanyal/RingNet

Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

Language: Python - Size: 12.4 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 758 - Forks: 167

AKSW/irbench

Open Information Retrieval Benchmark Framework

Language: Java - Size: 12.3 MB - Last synced: 10 days ago - Pushed: over 3 years ago - Stars: 4 - Forks: 0

gagolews/clustering-results-v1

A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)

Language: Python - Size: 318 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

gagolews/clustering-data-v1

A framework for benchmarking clustering algorithms – Benchmark suite, version 1

Language: Jupyter Notebook - Size: 173 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 2 - Forks: 1

vincenzorusso3/mimic-iv-benchmarks

FDSML Course Project 2020/21

Language: Python - Size: 25.2 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 4 - Forks: 3

omar-sharif03/BAD-Bangla-Aggressive-Text-Dataset

Size: 14.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 7 - Forks: 3

saeed-anwar/UWSurvey

Diving Deeper into Underwater Image Enhancement: A Survey, accepted in Signal Processing: Image Communication.

Size: 26.9 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 10 - Forks: 1

4kills/zlib_benchmark

Large benchmark data for 4kills/go-zlib and 4kills/go-libdeflate, removed from the original go library/repository itself to minimize library size.

Size: 3.79 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

MILE-IISc/MergedSymbolsKannada

Benchmarking dataset of merged symbols in Kannada along with their associated ground truth Unicode text

Language: Shell - Size: 3.64 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 2 - Forks: 0

MILE-IISc/DegradedWordsKannada

Benchmarking dataset of degraded word images (with character splits) in Kannada along with their associated ground truth Unicode text

Language: Shell - Size: 7.48 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 2 - Forks: 0

Related Keywords
benchmark-datasets 28 machine-learning 12 deep-learning 9 benchmark 7 dataset 6 datasets 4 clustering 3 python 3 computer-vision 3 graph-neural-networks 2 clinical-data 2 2d-3d 2 3d-data 2 3d-landmarks 2 3d-mesh 2 face 2 data 2 face-reconstruction 2 flame 2 flame-model 2 python3 2 single-image-reconstruction 2 triplet-loss 2 knowledge-graph 2 knowledge-graph-completion 2 knowledge-graph-embeddings 2 synthetic-data 2 natural-language-processing 2 pytorch 2 kannada 2 ground-truth 2 document-analysis 2 ocr 2 printed 2 recognition 2 segmentation 2 test-images 2 text 2 evaluation 2 uci 1 uci-dataset 1 uci-machine-learning 1 synthetic-datasets 1 neural-architecture-search 1 real-world-datasets 1 old-books 1 clustering-datasets 1 rust 1 3d-face-reconstruction 1 split-characters 1 3d-models 1 benchmark-mixture 1 hate-speech 1 nlp 1 sequence-labeling 1 social-media-mining 1 span-detection 1 span-prediction 1 vietnamese-dataset 1 vietnamese-nlp 1 vihos 1 cdec-net 1 object-detection 1 sota 1 table 1 table-detection 1 table-detection-using-deep-learning 1 word-images 1 cluster 1 cluster-labels 1 survey 1 evaluation-metrics 1 underwater-images 1 deep-networks 1 deep-algorithms 1 comprehensive 1 visual-comparisons 1 toxicity 1 text-classification 1 neurocomputing 1 bengali-nlp 1 aggression-identification 1 sklearn 1 pandas 1 numpy 1 mortality-prediction 1 benchmark-data 1 keras 1 clinical-prediction-tasks 1 merged-characters 1 merged-symbols 1 trec 1 qald 1 subword-images 1 cut-characters 1 measures 1 latex 1 tensorflow 1 degraded 1 ringnet 1