An open API service providing repository metadata for many open source software ecosystems.

GitHub / databrickslabs / dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/databrickslabs%2Fdbldatagen

Stars: 401
Forks: 72
Open issues: 28

License: other
Language: Python
Size: 11.1 MB
Dependencies parsed at: Pending

Created at: almost 6 years ago
Updated at: 26 days ago
Pushed at: 2 months ago
Last synced at: 24 days ago

Commit Stats

Commits: 253
Authors: 9
Mean commits per author: 28.11
Development Distribution Score: 0.257
More commit stats: https://commits.ecosyste.ms/hosts/GitHub/repositories/databrickslabs/dbldatagen

Topics: data-generation, databricks, datagen, datageneration, datagenerator, delta-live-tables, deltalake, faker, pyspark, python, spark, spark-streaming, synthetic-data

    Loading...