GitHub topics: webdataset
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Language: Python - Size: 51.6 MB - Last synced at: about 12 hours ago - Pushed at: 1 day ago - Stars: 2,583 - Forks: 205

huggingface/chug
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
Language: Python - Size: 146 KB - Last synced at: about 11 hours ago - Pushed at: about 1 year ago - Stars: 157 - Forks: 11

npuichigo/tarzan
High-level API for tar-based dataset
Language: Python - Size: 27.3 KB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

HemuManju/carla-data-collector
Scripts to collect data from CARLA and save them as Webdataset
Language: Python - Size: 210 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 1

robvanvolt/DALLE-tools
DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.
Language: Python - Size: 3.84 MB - Last synced at: about 1 hour ago - Pushed at: about 3 years ago - Stars: 15 - Forks: 9

loickntb/Wiki-Mind Fork of Feuille2912/Wiki-Mind
Web project, using SparQL on dbpedia and wikidata for mental disorder search feature
Language: PHP - Size: 796 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

MichaelNoya/nih-chest-xray-webdataset-subset
A sample subset of the NIH Chest X-ray Dataset. At only 2.4% of the size of the original dataset, it allows creating an accurate classifier using the Augmented Chest X-Ray repository.
Size: 1.74 GB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

mlfoundations/webdataset-resharder
Efficiently process webdatasets
Language: Python - Size: 69.3 KB - Last synced at: 17 days ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

yuleiqin/fopro
This repo is the official released code of FoPro (AAAI-2023)
Language: Python - Size: 6.97 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

sutd-visual-computing-group/Fourier-Discrepancies-CNN-Detection
[CVPR 2021: Oral] In this work, we show that high frequency Fourier spectrum decay discrepancies are not inherent characteristics for existing CNN-based generative models.
Language: Python - Size: 2.99 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 23 - Forks: 5
