Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: dataset-manager
experimaestro/datamaestro_text
Plugin for the dataset module containing information access related datasets
Language: Python - Size: 630 KB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 3 - Forks: 1
girishmm/almirah
a dataset management tool
Language: Python - Size: 4.67 MB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0
StarlangSoftware/Classification-Swift
Machine Learning Library for Classification Tasks
Language: Swift - Size: 180 KB - Last synced: 8 days ago - Pushed: 9 days ago - Stars: 2 - Forks: 2
StarlangSoftware/Classification-Js
Machine Learning Library for Classification Tasks
Language: TypeScript - Size: 938 KB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 0 - Forks: 0
StarlangSoftware/Classification-CS
Machine learning library for classification tasks
Language: C# - Size: 593 KB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 0 - Forks: 1
StarlangSoftware/Classification-Py
Machine learning library for classification tasks
Language: Python - Size: 721 KB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 11 - Forks: 3
StarlangSoftware/Classification-Cy
Machine learning library for classification tasks
Language: Cython - Size: 724 KB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 0 - Forks: 0
StarlangSoftware/Classification
Machine learning library for classification tasks
Language: Java - Size: 1.25 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 4 - Forks: 6
vicolab/ml-pyxis
Tool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.
Language: Python - Size: 72.3 KB - Last synced: 19 days ago - Pushed: over 3 years ago - Stars: 116 - Forks: 17
cosmaadrian/acumen-indexer
Utility for constructing highly efficient in-memory / on-disk datasets.
Language: Python - Size: 16.6 KB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 3 - Forks: 0
ynop/audiomate
Python library for handling audio datasets.
Language: Python - Size: 9.07 MB - Last synced: 24 days ago - Pushed: 11 months ago - Stars: 130 - Forks: 25
midusi/handshape_datasets
A single library to (down)load all existing sign language handshape datasets.
Language: Python - Size: 6.62 MB - Last synced: 14 days ago - Pushed: about 3 years ago - Stars: 13 - Forks: 2
MDAnalysis/MDAnalysisData
Access to data for workshops and extended tests of MDAnalysis.
Language: Python - Size: 7.09 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 11 - Forks: 5
agarsev/quevedo
Tool for managing datasets of images with compositional semantics, part of VisSE project.
Language: Python - Size: 9.41 MB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 3 - Forks: 0
midusi/sign_language_datasets 📦
A single library to (down)load all existing sign language video datasets.
Language: Python - Size: 85 KB - Last synced: 13 days ago - Pushed: almost 5 years ago - Stars: 6 - Forks: 1
x-CK-x/Dataset-Curation-Tool
A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well as S.O.T.A. diffusion and clips models for your purposes. Custom datasets can be added!
Language: Python - Size: 13.7 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 19 - Forks: 6
KoStyle/dataset_manager
A python tool to perform operations on specific datasets (i.e. APP dataset and IMDB dataset)
Language: Python - Size: 190 KB - Last synced: 5 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
overshiki/datasets
handle datasets such as mnist, cifar, coil20, fer2013 and so on.
Language: Python - Size: 17.5 MB - Last synced: 2 months ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 2
experimaestro/datamaestro
Scripts to automatize and standardize dataset handling
Language: Python - Size: 656 KB - Last synced: 16 days ago - Pushed: 3 months ago - Stars: 12 - Forks: 5
marizombie/headless_directory_viewer
:rocket: Whenever you need to look through huge pile of images and cannot use force of file explorer, or you just work on a remote headless machine, you can use this tool. It also allows to move files from one folder to another, creating destination if it does not exist. Work in progress.
Language: HTML - Size: 336 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 4 - Forks: 0
dunky11/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
Language: Python - Size: 57 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 195 - Forks: 31
tkp-archive/nannotate 📦
Automate ML dataset labelling
Language: JavaScript - Size: 1.64 MB - Last synced: 15 days ago - Pushed: almost 2 years ago - Stars: 11 - Forks: 2
silenterus/deepspeech-cleaner
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
Language: Python - Size: 389 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 47 - Forks: 7
StarlangSoftware/Classification-CPP
Machine learning library for classification tasks
Language: C++ - Size: 64 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 3 - Forks: 0
geo-c/OCT-Core
Module of the Open City Toolkit to visualize use of open datasets by applications:
Language: JavaScript - Size: 1.24 MB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 1
dekdevy/idm
Tag images, batch resize, export, string interpolated descriptions + common image dataset utilities
Language: Svelte - Size: 3.19 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 9 - Forks: 1
TheAnachronism/ImageDataSetTagEditor
A tag editor written in C# and WPF
Language: C# - Size: 944 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 11 - Forks: 1
moon-strider/jeeja-image-labeller
A windows application designed to label and split datasets
Language: C# - Size: 40 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
XIMDEX/xlyre 📦
Linked Open Data Management Module for Ximdex CMS
Language: PHP - Size: 1000 KB - Last synced: about 1 month ago - Pushed: over 9 years ago - Stars: 3 - Forks: 2
bbenligiray/nus_wide_formatter
A tool to download and format NUS-WIDE dataset for multilabel classification
Language: Python - Size: 8.79 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 4 - Forks: 2
scarletcho/prep4kaldi
Data preparation code for building Kaldi ASR system
Language: Python - Size: 22.5 KB - Last synced: about 1 year ago - Pushed: about 7 years ago - Stars: 12 - Forks: 9
harveyslash/ms-celeb-extractor
Extraction tool to parse MS Celeb dataset
Language: Python - Size: 37.1 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 3 - Forks: 1
bbenligiray/ms_coco_formatter
A tool to download and format MS COCO dataset for multilabel classification
Language: Python - Size: 358 KB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 6 - Forks: 1
bbenligiray/pascal_voc2007_formatter
A tool to download and format PASCAL VOC 2007 dataset for multilabel classification
Language: Python - Size: 9.77 KB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 12 - Forks: 3
dmvieira/dataset-manager
Manage dataset for data science projects
Language: Python - Size: 40.8 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 1
progressionnetwork/PE-Dataset-Sorter
Tool for removing dublicates, validate, sort and separate PE files (for x86\64, .NET\Native, PE\NOT PE) in specified directories
Language: Jupyter Notebook - Size: 387 KB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 0 - Forks: 0
fredrike/dataverse-social_network
Facebook Social Network dataset manipulator
Language: Jupyter Notebook - Size: 95.7 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
hyzhak/tfdatasets
middleware in pipeline between dataset and TensorFlow classifier
Language: Python - Size: 34.2 KB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
empiricalci/dataset-cache
:floppy_disk: A solution for downloading and caching datasets
Language: JavaScript - Size: 12.7 KB - Last synced: about 1 year ago - Pushed: over 7 years ago - Stars: 0 - Forks: 0
andrewhrytsiv/dataset-manager
Build on Heroku
Language: JavaScript - Size: 9.56 MB - Last synced: about 1 year ago - Pushed: over 7 years ago - Stars: 1 - Forks: 1