Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: datageneration

databrickslabs/dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

Language: Python - Size: 10 MB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 270 - Forks: 53

ydataai/ydata-synthetic

Synthetic data generators for tabular and time-series data

Language: Jupyter Notebook - Size: 16 MB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 1,313 - Forks: 220

snowfela/SDV

Mini Project about synthetic data generation by implementing CTGAN algorithm on tabular data

Language: Python - Size: 1.86 MB - Last synced: 28 days ago - Pushed: 28 days ago - Stars: 0 - Forks: 0

kevinscaria/TarGEN

Targeted Data Generation with Large Language Models

Language: Jupyter Notebook - Size: 2.24 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 10 - Forks: 2

NextBrain-ai/Data-Anonymizer-Tool

NextBrain's data Anonymizer tool ensures top-tier privacy by irreversibly obscuring personal identifiers without storing any data. Ideal for businesses prioritizing data security and compliance, it offers a reliable solution for safeguarding sensitive information.

Language: Python - Size: 73.2 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 1

PawanSirsat/Generate-RandomUser-API

RandomUserGenerator is a web application that utilizes a random user data API to generate and display detailed user profiles

Language: JavaScript - Size: 24.4 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

pflooky/data-caterer-docs

Documentation for Data Caterer

Language: HTML - Size: 6.95 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 2

Rajwrita/NIPResearch

NIP Research project working NIP-RW, DIP, and Subcellular location from Uniprot.org. Work in progress.

Language: Jupyter Notebook - Size: 1.49 MB - Last synced: 8 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

kosmobiker/synthetic_data_sandbox

A repositorium for experiments with syntethic data generation (bank tranasctions, text, sequences, etc.)

Language: Jupyter Notebook - Size: 58.6 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

PouyaPouryaie/materializedview

this sample, show how we can use materialized view in spring-framework

Language: Java - Size: 57.6 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

JingyuanHe1222/Visual_Dataset_Generation_Basic_English_Words 📦

Visual data generation for the basic English words by Ogden.

Language: Python - Size: 15.6 KB - Last synced: 11 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

Sushil-Deore/Predictive-Maintenance

The aim of this study is to determine the machine failure by construction of classifier model on predictive maintenance dataset. The class imbalance data compromise the performance of the constructed model and this is addressed by assessing the oversampling methods with Multi-Task Learning (MTL)architecture. Also, to gauge the performance of auxiliary learning towards the advancement of the primary task learning.

Language: Jupyter Notebook - Size: 5.35 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0