Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: dataset-manager

experimaestro/datamaestro_text

Plugin for the dataset module containing information access related datasets

Language: Python - Size: 630 KB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 3 - Forks: 1

girishmm/almirah

a dataset management tool

Language: Python - Size: 4.67 MB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0

StarlangSoftware/Classification-Swift

Machine Learning Library for Classification Tasks

Language: Swift - Size: 180 KB - Last synced: 8 days ago - Pushed: 9 days ago - Stars: 2 - Forks: 2

StarlangSoftware/Classification-Js

Machine Learning Library for Classification Tasks

Language: TypeScript - Size: 938 KB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 0 - Forks: 0

StarlangSoftware/Classification-CS

Machine learning library for classification tasks

Language: C# - Size: 593 KB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 0 - Forks: 1

StarlangSoftware/Classification-Py

Machine learning library for classification tasks

Language: Python - Size: 721 KB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 11 - Forks: 3

StarlangSoftware/Classification-Cy

Machine learning library for classification tasks

Language: Cython - Size: 724 KB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 0 - Forks: 0

StarlangSoftware/Classification

Machine learning library for classification tasks

Language: Java - Size: 1.25 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 4 - Forks: 6

vicolab/ml-pyxis

Tool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.

Language: Python - Size: 72.3 KB - Last synced: 19 days ago - Pushed: over 3 years ago - Stars: 116 - Forks: 17

cosmaadrian/acumen-indexer

Utility for constructing highly efficient in-memory / on-disk datasets.

Language: Python - Size: 16.6 KB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 3 - Forks: 0

ynop/audiomate

Python library for handling audio datasets.

Language: Python - Size: 9.07 MB - Last synced: 24 days ago - Pushed: 11 months ago - Stars: 130 - Forks: 25

midusi/handshape_datasets

A single library to (down)load all existing sign language handshape datasets.

Language: Python - Size: 6.62 MB - Last synced: 14 days ago - Pushed: about 3 years ago - Stars: 13 - Forks: 2

MDAnalysis/MDAnalysisData

Access to data for workshops and extended tests of MDAnalysis.

Language: Python - Size: 7.09 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 11 - Forks: 5

agarsev/quevedo

Tool for managing datasets of images with compositional semantics, part of VisSE project.

Language: Python - Size: 9.41 MB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 3 - Forks: 0

midusi/sign_language_datasets 📦

A single library to (down)load all existing sign language video datasets.

Language: Python - Size: 85 KB - Last synced: 13 days ago - Pushed: almost 5 years ago - Stars: 6 - Forks: 1

x-CK-x/Dataset-Curation-Tool

A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well as S.O.T.A. diffusion and clips models for your purposes. Custom datasets can be added!

Language: Python - Size: 13.7 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 19 - Forks: 6

KoStyle/dataset_manager

A python tool to perform operations on specific datasets (i.e. APP dataset and IMDB dataset)

Language: Python - Size: 190 KB - Last synced: 5 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

overshiki/datasets

handle datasets such as mnist, cifar, coil20, fer2013 and so on.

Language: Python - Size: 17.5 MB - Last synced: 2 months ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 2

experimaestro/datamaestro

Scripts to automatize and standardize dataset handling

Language: Python - Size: 656 KB - Last synced: 16 days ago - Pushed: 3 months ago - Stars: 12 - Forks: 5

marizombie/headless_directory_viewer

:rocket: Whenever you need to look through huge pile of images and cannot use force of file explorer, or you just work on a remote headless machine, you can use this tool. It also allows to move files from one folder to another, creating destination if it does not exist. Work in progress.

Language: HTML - Size: 336 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 4 - Forks: 0

dunky11/voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Language: Python - Size: 57 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 195 - Forks: 31

tkp-archive/nannotate 📦

Automate ML dataset labelling

Language: JavaScript - Size: 1.64 MB - Last synced: 15 days ago - Pushed: almost 2 years ago - Stars: 11 - Forks: 2

silenterus/deepspeech-cleaner

Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework

Language: Python - Size: 389 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 47 - Forks: 7

StarlangSoftware/Classification-CPP

Machine learning library for classification tasks

Language: C++ - Size: 64 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 3 - Forks: 0

geo-c/OCT-Core

Module of the Open City Toolkit to visualize use of open datasets by applications:

Language: JavaScript - Size: 1.24 MB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 1

dekdevy/idm

Tag images, batch resize, export, string interpolated descriptions + common image dataset utilities

Language: Svelte - Size: 3.19 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 9 - Forks: 1

TheAnachronism/ImageDataSetTagEditor

A tag editor written in C# and WPF

Language: C# - Size: 944 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 11 - Forks: 1

moon-strider/jeeja-image-labeller

A windows application designed to label and split datasets

Language: C# - Size: 40 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

XIMDEX/xlyre 📦

Linked Open Data Management Module for Ximdex CMS

Language: PHP - Size: 1000 KB - Last synced: about 1 month ago - Pushed: over 9 years ago - Stars: 3 - Forks: 2

bbenligiray/nus_wide_formatter

A tool to download and format NUS-WIDE dataset for multilabel classification

Language: Python - Size: 8.79 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 4 - Forks: 2

scarletcho/prep4kaldi

Data preparation code for building Kaldi ASR system

Language: Python - Size: 22.5 KB - Last synced: about 1 year ago - Pushed: about 7 years ago - Stars: 12 - Forks: 9

harveyslash/ms-celeb-extractor

Extraction tool to parse MS Celeb dataset

Language: Python - Size: 37.1 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 3 - Forks: 1

bbenligiray/ms_coco_formatter

A tool to download and format MS COCO dataset for multilabel classification

Language: Python - Size: 358 KB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 6 - Forks: 1

bbenligiray/pascal_voc2007_formatter

A tool to download and format PASCAL VOC 2007 dataset for multilabel classification

Language: Python - Size: 9.77 KB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 12 - Forks: 3

dmvieira/dataset-manager

Manage dataset for data science projects

Language: Python - Size: 40.8 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 1

progressionnetwork/PE-Dataset-Sorter

Tool for removing dublicates, validate, sort and separate PE files (for x86\64, .NET\Native, PE\NOT PE) in specified directories

Language: Jupyter Notebook - Size: 387 KB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 0 - Forks: 0

fredrike/dataverse-social_network

Facebook Social Network dataset manipulator

Language: Jupyter Notebook - Size: 95.7 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

hyzhak/tfdatasets

middleware in pipeline between dataset and TensorFlow classifier

Language: Python - Size: 34.2 KB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

empiricalci/dataset-cache

:floppy_disk: A solution for downloading and caching datasets

Language: JavaScript - Size: 12.7 KB - Last synced: about 1 year ago - Pushed: over 7 years ago - Stars: 0 - Forks: 0

andrewhrytsiv/dataset-manager

Build on Heroku

Language: JavaScript - Size: 9.56 MB - Last synced: about 1 year ago - Pushed: over 7 years ago - Stars: 1 - Forks: 1

Related Keywords
dataset-manager 40 decision-tree-classifier 7 deep-neural-networks 7 pca 7 data-science 6 subset-selection 6 statistical-tests 6 rocchio-algorithm 6 random-forest-classifier 6 quadratic-discriminant-analysis 6 naive-bayes-classifier 6 multilayer-perceptron-network 6 multilayer-perceptron 6 linear-discriminant-analysis 6 feature-selection 6 decision-stumps 6 bagging-trees 6 dataset 6 knn-algorithm 4 classification-algorithm 4 image-classification 4 python 4 datasets 4 autoencoder-classification 4 machine-learning 4 dataset-filtering 4 multilabel-classification 3 python3 3 open-data 2 speech-recognition 2 dataset-creation 2 sign-language 2 tagging 2 downloader 2 corpus-tools 2 audio-datasets 2 metadata 2 dataset-catalog 2 deep-learning 2 labeling-tool 2 support-vector-machine-svm 2 classification-algorithms 2 csharp 1 image-labeling 1 spark-java 1 specific-datasets 1 image-processing 1 windows-forms 1 catalog 1 dataset-generation 1 dms 1 linked-open-data 1 linked-data 1 stable-diffusion 1 dotnet 1 search-interface 1 rest-api 1 open-science 1 open-government 1 open-city-toolkit 1 open-access 1 metadata-store 1 metadata-api 1 database-management 1 multilanguage 1 mozilla 1 postgresql 1 java 1 doggy-jam 1 database 1 angular 1 science 1 research-data-management 1 dataset-cache 1 tensorflow 1 pipeline 1 social-network-analysis 1 facebook 1 pre-ml 1 pe 1 binary-classification 1 pandas 1 data-handling 1 pascal-voc 1 ms-coco 1 microsoft-research 1 face-recognition 1 kaldi 1 asr 1 nus-wide 1 xml 1 ximdex-cms 1 transparency 1 cifar 1 nlp 1 ia 1 imageboard-grabber 1 data-curation 1 captioning-videos 1 captioning-images 1