Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: dataset-generation
SimGus/Chatette
A powerful dataset generator for Rasa NLU, inspired by Chatito
Language: Python - Size: 16.1 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 309 - Forks: 54
RoloEdits/scrapetoon
A tool for scraping information from Webtoons.
Language: Rust - Size: 7.48 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 8 - Forks: 1
arian-askari/SOLID
A dataset of Intent-Aware LLM-generated Information-Seeking Dialogues useful for various tasks such as training/evaluating User Intent Predictors with the possibility to training/evaluating on real human dialogues. The backbone LLM of SOLID is Zephyr-7b-beta.
Language: Python - Size: 30.6 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
realm-tech/docgen
A document generator used to fully create training and evaluation datasets for OCR applications
Language: Python - Size: 32.5 MB - Last synced: 3 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0
scalexi/scalexi
scalexi is a versatile open-source Python library, optimized for Python 3.11+, focuses on facilitating low-code development and fine-tuning of diverse Large Language Models (LLMs).
Language: Python - Size: 31.2 MB - Last synced: 18 days ago - Pushed: about 2 months ago - Stars: 11 - Forks: 1
CDInstitute/Building-Dataset-Generator
Procedural 3D data generation pipeline for architecture
Language: Python - Size: 174 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 67 - Forks: 14
stupidcucumber/SimpleCOCO
Simple dataset creator in COCO-format.
Language: Python - Size: 2.78 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
TheoCoombes/crawlingathome
A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
Language: Python - Size: 138 KB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 30 - Forks: 7
chun92/LoLDataHarvester
dataset generation for league of legends match information
Language: Python - Size: 17.6 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 2 - Forks: 0
zenetio/synthetic-dataset-od
Synthetic dataset for Object Detection
Language: Jupyter Notebook - Size: 22.9 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 4 - Forks: 0
JadynHax/scpscraper
A Python library designed for scraping data from the SCP wiki.
Language: Python - Size: 216 KB - Last synced: 28 days ago - Pushed: over 3 years ago - Stars: 13 - Forks: 4
nikolito/kuzushiji-label
A management system for annotating Japanese classic kuzushiji characters.
Language: JavaScript - Size: 1.66 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
harveybc/agent-multi
Automated trading agent for an OpenAI Gym enviroment with multiple simultaneous trading of symbols(currency pairs) using separate action and observation timeseries.
Language: Python - Size: 20.5 KB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 15 - Forks: 6
markuryy/easy-dataset-tagger
A simple, user-friendly web application designed to streamline the process of tagging and annotating images for Stable Diffusion model training.
Language: JavaScript - Size: 10.7 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
dvdblk/ohwr-datapal
iOS App for creating Handwriting Recognition datasets. (uni/multi-stroke supported)
Language: Swift - Size: 508 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
v-pnk/long-img-org
Tool for making long-term image dataset capturing more organized.
Language: Python - Size: 78.1 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
samly97/COMSOL-Pipeline
Generate 2D Li-ion microstructure, galvanostatic discharge of battery, and saving of FEM data. Accompanying work for publication in Applied Energy.
Language: MATLAB - Size: 197 KB - Last synced: about 2 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
katrinmisel/yolococo
Create a YOLO-format subset of the COCO dataset
Language: Python - Size: 5.56 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
ngzhili/SynTable
The official code implementation for SynTable - A Synthetic Data Generation Pipeline for Cluttered Tabletop Scenes
Language: Python - Size: 207 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 4 - Forks: 0
DT6A/GSM8K-AI-SubQ
Author's repository for GSM8K-AI-SubQ reasoning dataset
Language: Python - Size: 14.9 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
magantoine/JobSkape
JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching
Language: Python - Size: 51.2 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
avriiil/stream-this-dataset
Code to convert static datasets into simulated data streams
Language: Python - Size: 693 KB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 11 - Forks: 0
CristianTuretta/DDoS-Network-Flow-Forensics-Analyser-
We are developing a tool for analyse recorded network traffic in order to detect and investigate about IP source address which may had contribute in a DDoS UDP flood attack. This tool also generates sample pcap datasets.
Language: Python - Size: 637 KB - Last synced: 5 days ago - Pushed: almost 5 years ago - Stars: 9 - Forks: 2
geru-scotland/videogames-dataset-analysis
Este repositorio contiene un dataset y scripts para producirlo. A su vez, analiza información sobre videojuegos, utilizando datos de la API de RAWG. El análisis se centra en identificar tendencias y patrones en la popularidad de diferentes géneros de videojuegos a lo largo del tiempo.
Language: Python - Size: 4.94 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
Spphire/RM-labeling-tool
It's a simulator based on Unity for RoboMaster. You can use it to get some labeled dataset for deep learning
Language: HTML - Size: 110 MB - Last synced: 4 months ago - Pushed: 12 months ago - Stars: 77 - Forks: 6
colddsam/LocalStore
LocalStore is a Python library that provides various operations on a local database of products, which can be used to store and manage information about items in a local store inventory.
Language: Python - Size: 11.5 MB - Last synced: 26 days ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
ArtificialOSS/WebCrawl
Crawls the web to generate a huge dataset for training
Language: Python - Size: 18.6 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 3 - Forks: 0
MyNameIsPHP/COCO-traffic-sign-dataset-generator
This Python script generates a synthetic dataset of traffic sign images in COCO format, intended for training and testing object detection models. The dataset includes various traffic sign overlays placed on diverse background images, offering a wide range of scenarios to enhance model robustness.
Language: Python - Size: 65.6 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 2 - Forks: 0
draganjovanovich/sharegpt-vim-editor
sharegpt jsonl vim editor
Language: Vim Script - Size: 4.88 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
serhaturtis/TOOL-FastBatchImageCrop
A simple UI tool to batch crop images to prepare datasets from images and videos.
Language: Python - Size: 955 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 17 - Forks: 2
JulesBelveze/time-series-dataset
:wrench: Easy-to-use PyTorch Dataset object for multivariate time series :wrench:
Language: Python - Size: 11.7 KB - Last synced: 13 days ago - Pushed: over 1 year ago - Stars: 25 - Forks: 11
5ANTI-726/Lunar-data-fusion
Deep learning model for lunar feature recognition, data preprocessing and automated database creation for nonexclusive use with the NASA-USGS PILOT platform. Still at database creation/preprocessing stage.
Language: Python - Size: 16.4 MB - Last synced: 4 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
bryanlusse/HousePrices__Webscraper
Web scraper that creates a dataset of house data from www.funda.nl
Language: Jupyter Notebook - Size: 5.56 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 5 - Forks: 1
functorism/snapcrop
CLI for crop/resize of large amounts of images with configurable resolutions
Language: Rust - Size: 17.5 MB - Last synced: 4 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0
CristianCosci/BTC_dataset_Generator_Glassnode
Python script to create a dataset with all the features available on Glassnode for the analysis of the Bitcoin cryptocurrency.
Language: Jupyter Notebook - Size: 27.3 KB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 8 - Forks: 2
dennis-barrett/dimdates-dot-com
Source code for the Kimball-style date dimension generator dimdates.com.
Language: JavaScript - Size: 839 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
BenediktAlkin/ImageNetSubsetGenerator
Creates subsets of ImageNet (e.g. ImageNet100)
Language: Python - Size: 2.98 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 8 - Forks: 0
thesagarsehgal/SwatchBharatUrbanCrawler
This is a Crawler built in Scrapy to crawl over the https://sbmurban.org/ website. This is the repository that crawls ASP.NET websites using Scrapy using the __VIEWSTATE.
Language: Python - Size: 1.24 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
ibrahimmansur4/HOID
This project focuses on the development of a robust Convolutional Neural Network (CNN) for the precise detection of human-object interactions in images.
Language: Jupyter Notebook - Size: 2.88 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 1
Santosh2611/Smart-Odisha-Hackathon Fork of MusicViking/SOH
Lack of alumni tracking and poor alumni interaction among students who have graduated from educational institutions across Odisha.
Language: HTML - Size: 16.5 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
Brandon82/llm-dataset-gen
Using LLMs (OpenAI API) to generate and add data to datasets
Language: Python - Size: 198 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
RandomGamingDev/mcskins-net-scraper
A basic scraper you can use to get the name, description, and actual skin off of https://www.minecraftskins.net easily, whether it be just for storing the data or for something like use in a ML project
Language: Python - Size: 42 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 1
esa/auromat
AUROra MApping Toolkit - Python library / CLI tools for creating and working with georeferenced images for aurora research.
Language: Python - Size: 3.39 MB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 17 - Forks: 11
ZoeLeBlanc/dhq_scraper
A small repository for compiling Digital Humanities Quarterly into a dataset
Language: Jupyter Notebook - Size: 12.6 MB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
PSS1998/OCR-Dataset-Image-Augmentation
OCR Dataset creation and Image Augmentations like scan, curve and perspective noise
Language: Python - Size: 1.95 KB - Last synced: 5 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
tomjackman/dataset-scenario-generator
Load Testing Dataset Scenario Generator
Language: JavaScript - Size: 48.8 KB - Last synced: 5 months ago - Pushed: over 7 years ago - Stars: 1 - Forks: 0
Tox2401/BoulderDimensionsCalculator
This program processes geospatial data from vector (.shp) and raster (.tif) files to generate a dataset containing information about boulders on seabed. It calculates the dimensions, coordinates, and depth of each boulder and exports the results to a new shapefile (.shp).
Language: Python - Size: 126 KB - Last synced: 5 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
Harshdeep1996/cite-classifications-wiki
Citation Classification using hybrid neural network model for Wikipedia References
Language: Jupyter Notebook - Size: 732 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 24 - Forks: 5
dylanseychell/mask-to-annotation
mask-to-annotation is a powerful and efficient tool for automatically generating annotations in popular computer vision formats such as COCO, YOLO, and VGG from binary masks.
Language: Jupyter Notebook - Size: 42.2 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 2 - Forks: 0
kunegis/konect-extr
Network dataset extraction library – part of the KONECT project by Jérôme Kunegis, University of Namur
Language: MATLAB - Size: 1.68 MB - Last synced: 2 months ago - Pushed: about 1 year ago - Stars: 22 - Forks: 13
JoseRuiz01/SplicingAndCopyMoveDatasetGenerator
Dataset Generator that uses the TIMIT dataset to generate audio with splicing and copy-move forgery.
Language: Python - Size: 23.4 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
81reap/TensorFlow-Image-Classifier
Scripts to set up a dataset and create a simple TensorFlow Image Classifier. These scripts work best with a CUDA supported Nvidia GPU.
Language: Python - Size: 30.3 KB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0
ArenaGrenade/bpycv3d
Blender Python Package for extracting internal data from blender scenes for 3d related data generation purposes.
Language: Python - Size: 430 KB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 5 - Forks: 0
proger/uk
Фонограми та синтагми: інструменти обробки
Language: Python - Size: 6.54 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 18 - Forks: 0
uleroboticsgroup/SVCP4CDataset
Vulnerable Source Code Collected from Open Source Repositories for Dataset Generation
Size: 178 MB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 5 - Forks: 2
jesusgraterol/binance-futures-dataset-builder
The dataset builder script extracts the most relevant market data straight from Binance's API and builds a series of datasets that can be used in data science and machine learning projects.
Language: TypeScript - Size: 20.5 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0
jesusgraterol/bitcoin-lightning-network-stats-dataset-builder
The dataset builder script extracts Bitcoin's Lightnining Network statistics through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
Language: TypeScript - Size: 11.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 1
jesusgraterol/bitcoin-blockchain-dataset-builder
The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
Language: TypeScript - Size: 19.5 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0
jazibdawre/DatasetCreator
Script for creating a dataset for AI, ML applications
Language: Python - Size: 72.9 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 4 - Forks: 1
daspartho/DistillClassifier
Easily generate synthetic data for classification tasks using LLMs
Language: Python - Size: 1020 KB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 2 - Forks: 0
Mr-Nobody1/DatasetMaker
Language: Jupyter Notebook - Size: 6.84 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
RxstydnR/Image_Patch_Generator
Patch image maker for PNU Learning (https://github.com/RxstydnR/CoSPA).
Language: Python - Size: 1.6 MB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
HC200ok/manual-data-masking
A lightweight javascript library for manual data masking
Language: JavaScript - Size: 46.8 MB - Last synced: 4 months ago - Pushed: almost 2 years ago - Stars: 18 - Forks: 0
F33RNI/DataSer
Image dataset generator for training neural networks. Capable of randomly modifying various image parameters, enhancing the image dataset
Language: Python - Size: 1.47 MB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 2 - Forks: 0
joris-gentinetta/wikipedia_trees
Using Markov chains to write wikipedia articles. Including algorithm to construct training sets trough the Wikipedia API.
Language: Jupyter Notebook - Size: 53.7 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
Silviatulli/dyadic-minigrid
multiplayer game and chat for collecting data on human counterfactual explanations in a collaborative learning task
Language: JavaScript - Size: 2.45 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1
suvojit-0x55aa/celebA-HQ-dataset-download
Get started with CelebA-HQ dataset in under 5 mins !
Language: Python - Size: 16.6 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 135 - Forks: 23
KSMubasshir/bd-newspaper-crawlers
A collection of Bangla newspaper and blog crawlers. Can be used to mine bangla text data for Natural Language Processing tasks.
Language: Python - Size: 71.3 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 4 - Forks: 2
yc9701/pansori
Tools for ASR Corpus Generation from Online Video
Language: Python - Size: 13.7 KB - Last synced: 7 months ago - Pushed: over 5 years ago - Stars: 139 - Forks: 28
Robin-WZQ/AGFD-20K
A Generated Face Dataset: AGFD-20K. A Realistic, High-resolution, Vary & Balanced face dataset, generated by stable diffusion.
Size: 2.97 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 4 - Forks: 0
Kabanosk/noising-website
Website that helps me prepare a dataset for my Engineering Thesis
Language: Python - Size: 2.93 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
LogIN-/fluprint
Import script needed to build FluPRINT database from source :computer:
Language: PHP - Size: 96.7 KB - Last synced: 25 days ago - Pushed: about 5 years ago - Stars: 11 - Forks: 3
darkreactions/ESCALATE_Capture
Data capture and experimental interfacing software for chemistry (part 1 of 2)
Language: Python - Size: 14.9 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 6 - Forks: 2
aqeelanwar/MaskTheFace
Convert face dataset to masked dataset
Language: Python - Size: 230 MB - Last synced: 7 months ago - Pushed: 10 months ago - Stars: 528 - Forks: 152
glaucomunsberg/kootstrap
Kootstrap is a bootstrap to Keras. It is a technique of compile and loading a datasets into a Keras application by means of a few initial instructions that enable the introduction of the rest of the program from an input device. Read more at
Language: Python - Size: 2.4 GB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 2 - Forks: 0
yfq512/data_generation_tools
datasets generation for deep learning
Language: Python - Size: 480 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0
vipchengrui/MASG
microphone array speech generator (MASG) in room acoustic
Language: Jupyter Notebook - Size: 21.1 MB - Last synced: 7 months ago - Pushed: over 4 years ago - Stars: 31 - Forks: 10
AbhinavThukral97/FaceRecognition
Built on OpenCV 3.2.0 and Python 3.6.0/Anaconda 4.3.0. Code to detect faces using Haar Cascade and match faces using LBPH (Local Binary Patterns Histogram) Recognition on a live web camera.
Language: Python - Size: 160 KB - Last synced: 7 months ago - Pushed: almost 7 years ago - Stars: 8 - Forks: 2
Ojaswy/IIM-A-IBM-Dataset-Generation-Hackathon
Repo for the 2019 IIM-A - IBM Data set Generation Hackathon.
Language: Go - Size: 3.82 MB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0
hernanrazo/split-videos-to-frames
Python script that splits videos into individual frames.
Language: Python - Size: 6.84 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 5 - Forks: 2
KID-22/CTAR
CTAR dataset and generation code
Language: Python - Size: 990 KB - Last synced: 7 months ago - Pushed: about 2 years ago - Stars: 7 - Forks: 3
inboxpraveen/ASR-Accuracy-Tool
🎙️📝 A powerful Flask-based web application that leverages the latest Hugging Face ASR models to provide real-time speech-to-text (STT) transcripts with an intuitive user interface for easy correction. Perfect for enhancing the quality of training datasets for ASR models. 🚀
Language: Python - Size: 9.05 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
harveybc/q-analyzer
Generate a csv file and a graphic with the correlations between an ideal training signal and each of the channels of a ssa decomposition of the close price and the sum of the channels with a variable number of channels. the objective is to decide the most convenient number of channels that to feed a trading agent in the gym-forex OpenAI environment.
Language: Python - Size: 6.84 KB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 4 - Forks: 1
hdigital/parlgov-snippets
Snippets for data set generation and analyses with ParlGov 🧑🏻💻📊
Language: HTML - Size: 29.3 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 9 - Forks: 0
jgurakuqi/mitsuba-snapshot-tool
The goal of this project is to develop a powerful and user-friendly tool that allows users to produce a dataset of synthetic images for the purpose of testing Shape from Polarization methods
Language: Python - Size: 102 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
AtelierArith/SegRCDB.jl
Unofficial Julia implementation of SegRCDB.jl
Language: Julia - Size: 135 KB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
mxchinegod/swan_scrape
High-efficiency text & file scraper with smart tracking for building language model datasets.
Language: Jupyter Notebook - Size: 128 KB - Last synced: 24 days ago - Pushed: 9 months ago - Stars: 2 - Forks: 0
aitor-alvarez/MIR-song-dataset-collection
Scripts to create Music Information Retrieval datasets from streaming services for singer identification tasks
Language: Python - Size: 16.6 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
ThePhenomenon1/IMBD-Database
Creating your own dataset. Scraping multiple pages of data from the IMDB website, in a single script, to fetch top 1000 movies metadata.
Language: Jupyter Notebook - Size: 43 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
RezuwanHassan262/Last-100-plus-years-Earthquake-Data-Analysis-And-Visualization
Data analysis using the last 100+ years of Earthquake data covering South Asia and Bangladesh region
Language: Jupyter Notebook - Size: 2.75 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 3 - Forks: 0
philipperemy/japanese-street-addresses-scraper
Scraper for Japanese street addresses (住所).
Language: Python - Size: 7.02 MB - Last synced: 24 days ago - Pushed: over 2 years ago - Stars: 7 - Forks: 2
IndraSigicharla/GDSC_Datathon
GDSC House Of Developers Datathon - I'm somewhat of a cybersecurity analyst myself
Size: 8.79 KB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
MaxAdams0/StarryRight
Dataset preparation for VikX
Language: Python - Size: 11.7 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
fahadyaseen001/locationgrabber
A live location data gather app to generate dataset using MERN stack
Language: JavaScript - Size: 3.01 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
aidanastridge/wheres-willow
Like a famous puzzle book series but it's in a dataset.
Language: Python - Size: 181 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 2 - Forks: 0
EmilianoMusso/sql2tsv
Exports SQL Server Table Data in TSV Format
Language: C# - Size: 11.7 KB - Last synced: 8 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
muzammilaz/Labl.it
Image dataset labeling utility for image classification tasks
Language: Python - Size: 35.4 MB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 1
laggui/image-search-scraper
Dataset builder tool from web image scraping
Language: Python - Size: 4.57 MB - Last synced: 9 months ago - Pushed: about 5 years ago - Stars: 3 - Forks: 2
cbaziotis/twitter-stream-downloader
A service for downloading twitter streaming data. You can save the data either in text files on disk, or in a database (MongoDB).
Language: Python - Size: 16.6 KB - Last synced: 6 months ago - Pushed: over 5 years ago - Stars: 22 - Forks: 3
JoshWarn/Multi-Label-Shapes-Toy-Dataset-Generator
An easy-to-use multi-label image dataset generator.
Language: Python - Size: 103 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0