Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: dataset-generation

SimGus/Chatette

A powerful dataset generator for Rasa NLU, inspired by Chatito

Language: Python - Size: 16.1 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 309 - Forks: 54

RoloEdits/scrapetoon

A tool for scraping information from Webtoons.

Language: Rust - Size: 7.48 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 8 - Forks: 1

arian-askari/SOLID

A dataset of Intent-Aware LLM-generated Information-Seeking Dialogues useful for various tasks such as training/evaluating User Intent Predictors with the possibility to training/evaluating on real human dialogues. The backbone LLM of SOLID is Zephyr-7b-beta.

Language: Python - Size: 30.6 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

realm-tech/docgen

A document generator used to fully create training and evaluation datasets for OCR applications

Language: Python - Size: 32.5 MB - Last synced: 3 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0

scalexi/scalexi

scalexi is a versatile open-source Python library, optimized for Python 3.11+, focuses on facilitating low-code development and fine-tuning of diverse Large Language Models (LLMs).

Language: Python - Size: 31.2 MB - Last synced: 18 days ago - Pushed: about 2 months ago - Stars: 11 - Forks: 1

CDInstitute/Building-Dataset-Generator

Procedural 3D data generation pipeline for architecture

Language: Python - Size: 174 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 67 - Forks: 14

stupidcucumber/SimpleCOCO

Simple dataset creator in COCO-format.

Language: Python - Size: 2.78 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

TheoCoombes/crawlingathome

A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.

Language: Python - Size: 138 KB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 30 - Forks: 7

chun92/LoLDataHarvester

dataset generation for league of legends match information

Language: Python - Size: 17.6 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 2 - Forks: 0

zenetio/synthetic-dataset-od

Synthetic dataset for Object Detection

Language: Jupyter Notebook - Size: 22.9 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 4 - Forks: 0

JadynHax/scpscraper

A Python library designed for scraping data from the SCP wiki.

Language: Python - Size: 216 KB - Last synced: 28 days ago - Pushed: over 3 years ago - Stars: 13 - Forks: 4

nikolito/kuzushiji-label

A management system for annotating Japanese classic kuzushiji characters.

Language: JavaScript - Size: 1.66 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

harveybc/agent-multi

Automated trading agent for an OpenAI Gym enviroment with multiple simultaneous trading of symbols(currency pairs) using separate action and observation timeseries.

Language: Python - Size: 20.5 KB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 15 - Forks: 6

markuryy/easy-dataset-tagger

A simple, user-friendly web application designed to streamline the process of tagging and annotating images for Stable Diffusion model training.

Language: JavaScript - Size: 10.7 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

dvdblk/ohwr-datapal

iOS App for creating Handwriting Recognition datasets. (uni/multi-stroke supported)

Language: Swift - Size: 508 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

v-pnk/long-img-org

Tool for making long-term image dataset capturing more organized.

Language: Python - Size: 78.1 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

samly97/COMSOL-Pipeline

Generate 2D Li-ion microstructure, galvanostatic discharge of battery, and saving of FEM data. Accompanying work for publication in Applied Energy.

Language: MATLAB - Size: 197 KB - Last synced: about 2 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

katrinmisel/yolococo

Create a YOLO-format subset of the COCO dataset

Language: Python - Size: 5.56 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

ngzhili/SynTable

The official code implementation for SynTable - A Synthetic Data Generation Pipeline for Cluttered Tabletop Scenes

Language: Python - Size: 207 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 4 - Forks: 0

DT6A/GSM8K-AI-SubQ

Author's repository for GSM8K-AI-SubQ reasoning dataset

Language: Python - Size: 14.9 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

magantoine/JobSkape

JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching

Language: Python - Size: 51.2 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

avriiil/stream-this-dataset

Code to convert static datasets into simulated data streams

Language: Python - Size: 693 KB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 11 - Forks: 0

CristianTuretta/DDoS-Network-Flow-Forensics-Analyser-

We are developing a tool for analyse recorded network traffic in order to detect and investigate about IP source address which may had contribute in a DDoS UDP flood attack. This tool also generates sample pcap datasets.

Language: Python - Size: 637 KB - Last synced: 5 days ago - Pushed: almost 5 years ago - Stars: 9 - Forks: 2

geru-scotland/videogames-dataset-analysis

Este repositorio contiene un dataset y scripts para producirlo. A su vez, analiza información sobre videojuegos, utilizando datos de la API de RAWG. El análisis se centra en identificar tendencias y patrones en la popularidad de diferentes géneros de videojuegos a lo largo del tiempo.

Language: Python - Size: 4.94 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

Spphire/RM-labeling-tool

It's a simulator based on Unity for RoboMaster. You can use it to get some labeled dataset for deep learning

Language: HTML - Size: 110 MB - Last synced: 4 months ago - Pushed: 12 months ago - Stars: 77 - Forks: 6

colddsam/LocalStore

LocalStore is a Python library that provides various operations on a local database of products, which can be used to store and manage information about items in a local store inventory.

Language: Python - Size: 11.5 MB - Last synced: 26 days ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

ArtificialOSS/WebCrawl

Crawls the web to generate a huge dataset for training

Language: Python - Size: 18.6 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 3 - Forks: 0

MyNameIsPHP/COCO-traffic-sign-dataset-generator

This Python script generates a synthetic dataset of traffic sign images in COCO format, intended for training and testing object detection models. The dataset includes various traffic sign overlays placed on diverse background images, offering a wide range of scenarios to enhance model robustness.

Language: Python - Size: 65.6 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 2 - Forks: 0

draganjovanovich/sharegpt-vim-editor

sharegpt jsonl vim editor

Language: Vim Script - Size: 4.88 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

serhaturtis/TOOL-FastBatchImageCrop

A simple UI tool to batch crop images to prepare datasets from images and videos.

Language: Python - Size: 955 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 17 - Forks: 2

JulesBelveze/time-series-dataset

:wrench: Easy-to-use PyTorch Dataset object for multivariate time series :wrench:

Language: Python - Size: 11.7 KB - Last synced: 13 days ago - Pushed: over 1 year ago - Stars: 25 - Forks: 11

5ANTI-726/Lunar-data-fusion

Deep learning model for lunar feature recognition, data preprocessing and automated database creation for nonexclusive use with the NASA-USGS PILOT platform. Still at database creation/preprocessing stage.

Language: Python - Size: 16.4 MB - Last synced: 4 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

bryanlusse/HousePrices__Webscraper

Web scraper that creates a dataset of house data from www.funda.nl

Language: Jupyter Notebook - Size: 5.56 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 5 - Forks: 1

functorism/snapcrop

CLI for crop/resize of large amounts of images with configurable resolutions

Language: Rust - Size: 17.5 MB - Last synced: 4 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

CristianCosci/BTC_dataset_Generator_Glassnode

Python script to create a dataset with all the features available on Glassnode for the analysis of the Bitcoin cryptocurrency.

Language: Jupyter Notebook - Size: 27.3 KB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 8 - Forks: 2

dennis-barrett/dimdates-dot-com

Source code for the Kimball-style date dimension generator dimdates.com.

Language: JavaScript - Size: 839 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

BenediktAlkin/ImageNetSubsetGenerator

Creates subsets of ImageNet (e.g. ImageNet100)

Language: Python - Size: 2.98 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 8 - Forks: 0

thesagarsehgal/SwatchBharatUrbanCrawler

This is a Crawler built in Scrapy to crawl over the https://sbmurban.org/ website. This is the repository that crawls ASP.NET websites using Scrapy using the __VIEWSTATE.

Language: Python - Size: 1.24 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

ibrahimmansur4/HOID

This project focuses on the development of a robust Convolutional Neural Network (CNN) for the precise detection of human-object interactions in images.

Language: Jupyter Notebook - Size: 2.88 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 1

Santosh2611/Smart-Odisha-Hackathon Fork of MusicViking/SOH

Lack of alumni tracking and poor alumni interaction among students who have graduated from educational institutions across Odisha.

Language: HTML - Size: 16.5 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

Brandon82/llm-dataset-gen

Using LLMs (OpenAI API) to generate and add data to datasets

Language: Python - Size: 198 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

RandomGamingDev/mcskins-net-scraper

A basic scraper you can use to get the name, description, and actual skin off of https://www.minecraftskins.net easily, whether it be just for storing the data or for something like use in a ML project

Language: Python - Size: 42 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 1

esa/auromat

AUROra MApping Toolkit - Python library / CLI tools for creating and working with georeferenced images for aurora research.

Language: Python - Size: 3.39 MB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 17 - Forks: 11

ZoeLeBlanc/dhq_scraper

A small repository for compiling Digital Humanities Quarterly into a dataset

Language: Jupyter Notebook - Size: 12.6 MB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

PSS1998/OCR-Dataset-Image-Augmentation

OCR Dataset creation and Image Augmentations like scan, curve and perspective noise

Language: Python - Size: 1.95 KB - Last synced: 5 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

tomjackman/dataset-scenario-generator

Load Testing Dataset Scenario Generator

Language: JavaScript - Size: 48.8 KB - Last synced: 5 months ago - Pushed: over 7 years ago - Stars: 1 - Forks: 0

Tox2401/BoulderDimensionsCalculator

This program processes geospatial data from vector (.shp) and raster (.tif) files to generate a dataset containing information about boulders on seabed. It calculates the dimensions, coordinates, and depth of each boulder and exports the results to a new shapefile (.shp).

Language: Python - Size: 126 KB - Last synced: 5 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

Harshdeep1996/cite-classifications-wiki

Citation Classification using hybrid neural network model for Wikipedia References

Language: Jupyter Notebook - Size: 732 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 24 - Forks: 5

dylanseychell/mask-to-annotation

mask-to-annotation is a powerful and efficient tool for automatically generating annotations in popular computer vision formats such as COCO, YOLO, and VGG from binary masks.

Language: Jupyter Notebook - Size: 42.2 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 2 - Forks: 0

kunegis/konect-extr

Network dataset extraction library – part of the KONECT project by Jérôme Kunegis, University of Namur

Language: MATLAB - Size: 1.68 MB - Last synced: 2 months ago - Pushed: about 1 year ago - Stars: 22 - Forks: 13

JoseRuiz01/SplicingAndCopyMoveDatasetGenerator

Dataset Generator that uses the TIMIT dataset to generate audio with splicing and copy-move forgery.

Language: Python - Size: 23.4 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

81reap/TensorFlow-Image-Classifier

Scripts to set up a dataset and create a simple TensorFlow Image Classifier. These scripts work best with a CUDA supported Nvidia GPU.

Language: Python - Size: 30.3 KB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0

ArenaGrenade/bpycv3d

Blender Python Package for extracting internal data from blender scenes for 3d related data generation purposes.

Language: Python - Size: 430 KB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 5 - Forks: 0

proger/uk

Фонограми та синтагми: інструменти обробки

Language: Python - Size: 6.54 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 18 - Forks: 0

uleroboticsgroup/SVCP4CDataset

Vulnerable Source Code Collected from Open Source Repositories for Dataset Generation

Size: 178 MB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 5 - Forks: 2

jesusgraterol/binance-futures-dataset-builder

The dataset builder script extracts the most relevant market data straight from Binance's API and builds a series of datasets that can be used in data science and machine learning projects.

Language: TypeScript - Size: 20.5 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

jesusgraterol/bitcoin-lightning-network-stats-dataset-builder

The dataset builder script extracts Bitcoin's Lightnining Network statistics through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

Language: TypeScript - Size: 11.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 1

jesusgraterol/bitcoin-blockchain-dataset-builder

The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

Language: TypeScript - Size: 19.5 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

jazibdawre/DatasetCreator

Script for creating a dataset for AI, ML applications

Language: Python - Size: 72.9 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 4 - Forks: 1

daspartho/DistillClassifier

Easily generate synthetic data for classification tasks using LLMs

Language: Python - Size: 1020 KB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 2 - Forks: 0

Mr-Nobody1/DatasetMaker

Language: Jupyter Notebook - Size: 6.84 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

RxstydnR/Image_Patch_Generator

Patch image maker for PNU Learning (https://github.com/RxstydnR/CoSPA).

Language: Python - Size: 1.6 MB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

HC200ok/manual-data-masking

A lightweight javascript library for manual data masking

Language: JavaScript - Size: 46.8 MB - Last synced: 4 months ago - Pushed: almost 2 years ago - Stars: 18 - Forks: 0

F33RNI/DataSer

Image dataset generator for training neural networks. Capable of randomly modifying various image parameters, enhancing the image dataset

Language: Python - Size: 1.47 MB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 2 - Forks: 0

joris-gentinetta/wikipedia_trees

Using Markov chains to write wikipedia articles. Including algorithm to construct training sets trough the Wikipedia API.

Language: Jupyter Notebook - Size: 53.7 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

Silviatulli/dyadic-minigrid

multiplayer game and chat for collecting data on human counterfactual explanations in a collaborative learning task

Language: JavaScript - Size: 2.45 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1

suvojit-0x55aa/celebA-HQ-dataset-download

Get started with CelebA-HQ dataset in under 5 mins !

Language: Python - Size: 16.6 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 135 - Forks: 23

KSMubasshir/bd-newspaper-crawlers

A collection of Bangla newspaper and blog crawlers. Can be used to mine bangla text data for Natural Language Processing tasks.

Language: Python - Size: 71.3 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 4 - Forks: 2

yc9701/pansori

Tools for ASR Corpus Generation from Online Video

Language: Python - Size: 13.7 KB - Last synced: 7 months ago - Pushed: over 5 years ago - Stars: 139 - Forks: 28

Robin-WZQ/AGFD-20K

A Generated Face Dataset: AGFD-20K. A Realistic, High-resolution, Vary & Balanced face dataset, generated by stable diffusion.

Size: 2.97 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 4 - Forks: 0

Kabanosk/noising-website

Website that helps me prepare a dataset for my Engineering Thesis

Language: Python - Size: 2.93 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

LogIN-/fluprint

Import script needed to build FluPRINT database from source :computer:

Language: PHP - Size: 96.7 KB - Last synced: 25 days ago - Pushed: about 5 years ago - Stars: 11 - Forks: 3

darkreactions/ESCALATE_Capture

Data capture and experimental interfacing software for chemistry (part 1 of 2)

Language: Python - Size: 14.9 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 6 - Forks: 2

aqeelanwar/MaskTheFace

Convert face dataset to masked dataset

Language: Python - Size: 230 MB - Last synced: 7 months ago - Pushed: 10 months ago - Stars: 528 - Forks: 152

glaucomunsberg/kootstrap

Kootstrap is a bootstrap to Keras. It is a technique of compile and loading a datasets into a Keras application by means of a few initial instructions that enable the introduction of the rest of the program from an input device. Read more at

Language: Python - Size: 2.4 GB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 2 - Forks: 0

yfq512/data_generation_tools

datasets generation for deep learning

Language: Python - Size: 480 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

vipchengrui/MASG

microphone array speech generator (MASG) in room acoustic

Language: Jupyter Notebook - Size: 21.1 MB - Last synced: 7 months ago - Pushed: over 4 years ago - Stars: 31 - Forks: 10

AbhinavThukral97/FaceRecognition

Built on OpenCV 3.2.0 and Python 3.6.0/Anaconda 4.3.0. Code to detect faces using Haar Cascade and match faces using LBPH (Local Binary Patterns Histogram) Recognition on a live web camera.

Language: Python - Size: 160 KB - Last synced: 7 months ago - Pushed: almost 7 years ago - Stars: 8 - Forks: 2

Ojaswy/IIM-A-IBM-Dataset-Generation-Hackathon

Repo for the 2019 IIM-A - IBM Data set Generation Hackathon.

Language: Go - Size: 3.82 MB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

hernanrazo/split-videos-to-frames

Python script that splits videos into individual frames.

Language: Python - Size: 6.84 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 5 - Forks: 2

KID-22/CTAR

CTAR dataset and generation code

Language: Python - Size: 990 KB - Last synced: 7 months ago - Pushed: about 2 years ago - Stars: 7 - Forks: 3

inboxpraveen/ASR-Accuracy-Tool

🎙️📝 A powerful Flask-based web application that leverages the latest Hugging Face ASR models to provide real-time speech-to-text (STT) transcripts with an intuitive user interface for easy correction. Perfect for enhancing the quality of training datasets for ASR models. 🚀

Language: Python - Size: 9.05 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

harveybc/q-analyzer

Generate a csv file and a graphic with the correlations between an ideal training signal and each of the channels of a ssa decomposition of the close price and the sum of the channels with a variable number of channels. the objective is to decide the most convenient number of channels that to feed a trading agent in the gym-forex OpenAI environment.

Language: Python - Size: 6.84 KB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 4 - Forks: 1

hdigital/parlgov-snippets

Snippets for data set generation and analyses with ParlGov 🧑🏻‍💻📊

Language: HTML - Size: 29.3 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 9 - Forks: 0

jgurakuqi/mitsuba-snapshot-tool

The goal of this project is to develop a powerful and user-friendly tool that allows users to produce a dataset of synthetic images for the purpose of testing Shape from Polarization methods

Language: Python - Size: 102 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

AtelierArith/SegRCDB.jl

Unofficial Julia implementation of SegRCDB.jl

Language: Julia - Size: 135 KB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

mxchinegod/swan_scrape

High-efficiency text & file scraper with smart tracking for building language model datasets.

Language: Jupyter Notebook - Size: 128 KB - Last synced: 24 days ago - Pushed: 9 months ago - Stars: 2 - Forks: 0

aitor-alvarez/MIR-song-dataset-collection

Scripts to create Music Information Retrieval datasets from streaming services for singer identification tasks

Language: Python - Size: 16.6 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

ThePhenomenon1/IMBD-Database

Creating your own dataset. Scraping multiple pages of data from the IMDB website, in a single script, to fetch top 1000 movies metadata.

Language: Jupyter Notebook - Size: 43 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

RezuwanHassan262/Last-100-plus-years-Earthquake-Data-Analysis-And-Visualization

Data analysis using the last 100+ years of Earthquake data covering South Asia and Bangladesh region

Language: Jupyter Notebook - Size: 2.75 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 3 - Forks: 0

philipperemy/japanese-street-addresses-scraper

Scraper for Japanese street addresses (住所).

Language: Python - Size: 7.02 MB - Last synced: 24 days ago - Pushed: over 2 years ago - Stars: 7 - Forks: 2

IndraSigicharla/GDSC_Datathon

GDSC House Of Developers Datathon - I'm somewhat of a cybersecurity analyst myself

Size: 8.79 KB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

MaxAdams0/StarryRight

Dataset preparation for VikX

Language: Python - Size: 11.7 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

fahadyaseen001/locationgrabber

A live location data gather app to generate dataset using MERN stack

Language: JavaScript - Size: 3.01 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

aidanastridge/wheres-willow

Like a famous puzzle book series but it's in a dataset.

Language: Python - Size: 181 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 2 - Forks: 0

EmilianoMusso/sql2tsv

Exports SQL Server Table Data in TSV Format

Language: C# - Size: 11.7 KB - Last synced: 8 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

muzammilaz/Labl.it

Image dataset labeling utility for image classification tasks

Language: Python - Size: 35.4 MB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 1

laggui/image-search-scraper

Dataset builder tool from web image scraping

Language: Python - Size: 4.57 MB - Last synced: 9 months ago - Pushed: about 5 years ago - Stars: 3 - Forks: 2

cbaziotis/twitter-stream-downloader

A service for downloading twitter streaming data. You can save the data either in text files on disk, or in a database (MongoDB).

Language: Python - Size: 16.6 KB - Last synced: 6 months ago - Pushed: over 5 years ago - Stars: 22 - Forks: 3

JoshWarn/Multi-Label-Shapes-Toy-Dataset-Generator

An easy-to-use multi-label image dataset generator.

Language: Python - Size: 103 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0