Topic: "data-generation"
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language: Jupyter Notebook - Size: 152 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 16,872 - Forks: 1,532

sdv-dev/SDV
Synthetic data generation for tabular data
Language: Python - Size: 31.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,150 - Forks: 380

benkeen/generatedata
A powerful, feature-rich, random test data generator.
Language: TypeScript - Size: 78.5 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 2,261 - Forks: 617

AgaMiko/data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
Size: 3.59 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 1,647 - Forks: 207

neomatrix369/awesome-ai-ml-dl
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Language: Jupyter Notebook - Size: 76.5 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 1,574 - Forks: 361

shuttle-hq/synth
The Declarative Data Generator
Language: Rust - Size: 32.3 MB - Last synced at: 12 days ago - Pushed at: 12 months ago - Stars: 1,443 - Forks: 109

sdv-dev/CTGAN
Conditional GAN for generating synthetic tabular data.
Language: Python - Size: 1.84 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1,442 - Forks: 320

whatyouhide/stream_data
Data generation and property-based testing for Elixir. 🔮
Language: Elixir - Size: 521 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 915 - Forks: 73

Westlake-AI/openmixup
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
Language: Python - Size: 3.69 MB - Last synced at: 6 days ago - Pushed at: 15 days ago - Stars: 658 - Forks: 61

sdv-dev/Copulas
A library to model multivariate data using copulas.
Language: Python - Size: 30.5 MB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 608 - Forks: 117

nomemory/mockneat
MockNeat - the modern faker lib.
Language: Java - Size: 2.65 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 534 - Forks: 47

tom-lord/regexp-examples
Generate strings that match a given regular expression
Language: Ruby - Size: 683 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 522 - Forks: 31

MTG/DeepConvSep
Deep Convolutional Neural Networks for Musical Source Separation
Language: Python - Size: 36.3 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 478 - Forks: 110

databrickslabs/dbldatagen
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Language: Python - Size: 11.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 425 - Forks: 79

cieslarmichal/faker-cxx
C++ Faker library for generating fake (but realistic) data.
Language: C++ - Size: 24.6 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 381 - Forks: 181

open-sciencelab/GraphGen
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
Language: Python - Size: 13.9 MB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 335 - Forks: 28

microsoft/genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
Language: Jupyter Notebook - Size: 14.6 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 332 - Forks: 32

tabularis-ai/be_great
A novel approach for synthesizing tabular data using pretrained large language models
Language: Python - Size: 4.29 MB - Last synced at: 1 day ago - Pushed at: 2 months ago - Stars: 321 - Forks: 52

tirthajyoti/pydbgen
Random dataframe and database table generator
Language: Python - Size: 687 KB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 309 - Forks: 58

trinker/wakefield
Generate random data sets
Language: R - Size: 3.78 MB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 257 - Forks: 28

worldbank/REaLTabFormer
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 22 days ago - Pushed at: about 2 months ago - Stars: 234 - Forks: 29

UnrealZoo/unrealzoo-gym Fork of zfw1226/gym-unrealcv
Large-scale photo-realistic virtual worlds for embodied AI
Language: Python - Size: 112 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 159 - Forks: 10

rapiddweller/rapiddweller-benerator-ce
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
Language: Java - Size: 35.3 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 150 - Forks: 26

finos/datahelix 📦
The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation
Language: Java - Size: 14.5 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 144 - Forks: 50

gretelai/awesome-synthetic-data
📖 A curated list of resources dedicated to synthetic data
Size: 40 KB - Last synced at: 11 days ago - Pushed at: about 3 years ago - Stars: 133 - Forks: 10

manumerous/wb_humanoid_mpc
Whole-Body Nonlinear MPC for Realtime Humanoid Loco-Manipulation Planning and Control
Language: C++ - Size: 21.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 125 - Forks: 29

tinybirdco/mockingbird
Mockingbird is a mock streaming data generator
Language: TypeScript - Size: 2.57 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 121 - Forks: 18

sdv-dev/DeepEcho
Synthetic Data Generation for mixed-type, multivariate time series.
Language: Python - Size: 767 KB - Last synced at: 20 days ago - Pushed at: 28 days ago - Stars: 116 - Forks: 16

louisYen/Gen4Gen
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
Language: Python - Size: 181 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 105 - Forks: 5

mjkvaak/ImageDataAugmentor
Custom image data generator for TF Keras that supports the modern augmentation module albumentations
Language: Python - Size: 2.51 MB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 86 - Forks: 26

kgoldfeld/simstudy
simstudy: Illuminating research methods through data generation
Language: R - Size: 68.7 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 85 - Forks: 8

ykang/gratis
GRATIS: GeneRAting TIme Series with diverse and controllable characteristics
Language: R - Size: 3.44 MB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 76 - Forks: 30

br0kej/bin2ml
A command line tool for extracting machine learning ready data from software binaries powered by Radare2
Language: Rust - Size: 1.61 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 69 - Forks: 5

data-catering/data-caterer Fork of pflooky/data-caterer
Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.
Language: Scala - Size: 47.6 MB - Last synced at: 7 days ago - Pushed at: 18 days ago - Stars: 68 - Forks: 8

eliabntt/GRADE-RR
GRADE: Generating Animated Dynamic Environments for Robotics Research
Language: Python - Size: 236 MB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 64 - Forks: 7

dmey/synthia
📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python
Language: Python - Size: 19.7 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 64 - Forks: 10

smartcat-labs/ranger
Ranger is contextual data generator used to make sensible data for integration tests or to play with it in the database
Language: Java - Size: 686 KB - Last synced at: 27 days ago - Pushed at: over 5 years ago - Stars: 60 - Forks: 11

MarijaGolubovic/robo_imitate
End-to-end robot control based on generative diffusion model
Language: Python - Size: 132 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 58 - Forks: 7

grafana/k6-example-data-generation
Example repository showing how to utilise k6 and faker to load test using generated data
Language: JavaScript - Size: 151 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 57 - Forks: 16

microsoft/CodeMixed-Text-Generator
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
Language: Jupyter Notebook - Size: 3.79 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 55 - Forks: 11

edyan/neuralyzer
Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)
Language: PHP - Size: 56.8 MB - Last synced at: 17 days ago - Pushed at: 10 months ago - Stars: 51 - Forks: 11

Cambalab/fake-data-generator
Just a small open-source script to create fake data given a simple JSON model.
Language: JavaScript - Size: 923 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 50 - Forks: 14

Stranger6667/hypothesis-graphql
Generate arbitrary queries matching your GraphQL schema, and use them to verify your backend implementation.
Language: Python - Size: 944 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 46 - Forks: 3

tosiron/jazznet
jazznet dataset of piano patterns for music audio machine learning research
Language: Python - Size: 4.24 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 43 - Forks: 0

YuriyIvon/DatabaseBenchmark
A universal database query benchmark tool
Language: C# - Size: 822 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 40 - Forks: 3

noisemix/noisemix
NoiseMix - data generation for natural language
Language: Python - Size: 2.18 MB - Last synced at: 16 days ago - Pushed at: over 7 years ago - Stars: 40 - Forks: 7

BUAADreamer/SPN4CIR
[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
Language: Python - Size: 4.16 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 36 - Forks: 4

eyalroz/ssb-dbgen Fork of greenlion/ssb-dbgen
Star Schema Benchmark data set generator (dbgen) - unified repository
Language: C - Size: 174 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 36 - Forks: 15

fuelen/seed_factory
A toolkit for test data generation
Language: Elixir - Size: 105 KB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 0

starfishdata/starfish
Synthetic data generation to fuel AI models
Language: Python - Size: 14 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 30 - Forks: 1

1x-technologies/wb-humanoid-mpc
Realtime Physics-Based Procedural Loco-Manipulation Planning and Control
Language: C++ - Size: 21.4 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 30 - Forks: 12

PaulSorensen/linux-tools
Comprehensive privacy-conscious list of Linux applications, tools, and distributions - powered by a generic Python CLI that lets you manage and export your own custom lists in multiple formats.
Language: Python - Size: 387 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 29 - Forks: 2

gretelai/trainer
Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.
Language: Python - Size: 1.8 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 29 - Forks: 7

Bauhinia-AI/evol-character
Based on the Evol-character framework and OpenAI API, enabling fine-grained role-playing data generation 🎭🧩.
Size: 348 KB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 0

GoodarzMehr/SimBEV
SimBEV is a configurable and scalable synthetic driving data generation tool based on the CARLA Simulator.
Language: Python - Size: 143 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 28 - Forks: 2

snaplet/docs
Snaplet Documentation
Language: HTML - Size: 13.7 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 28 - Forks: 10

glynnbird/datamaker
Data generator command-line tool and library. Create JSON, CSV, XML data from templates.
Language: JavaScript - Size: 505 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 27 - Forks: 7

rapiddweller/datamimic
🧠 Model-Driven test data generation platform enabling developers to create realistic, scalable, and privacy-compliant test data. Features model-driven data generation, GDPR compliance, and seamless Python integration.
Language: Python - Size: 14.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 26 - Forks: 2

farlee2121/FsSpec
FsSpec represents value constraints as data to reuse one constraint declaration for validation, data generation, error explanation, and more.
Language: F# - Size: 342 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 25 - Forks: 0

ngzhili/SynTable
The official code implementation for SynTable - A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop Scenes
Language: Python - Size: 207 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 24 - Forks: 0

leezythu/FlexKBQA
FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering
Language: Python - Size: 41.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 1

SimonOuellette35/ARC_gym
ARC gym: a data generation framework for the Abstraction & Reasoning Corpus
Language: Python - Size: 354 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 22 - Forks: 2

sebhaan/TabPFGen
TabPFGen: Synthetic Tabular Data Generation with TabPFN
Language: Python - Size: 152 KB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 22 - Forks: 3

codelion/ellora
Enhancing LLMs with LoRA
Language: Jupyter Notebook - Size: 2.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 1

umutcanbolat/Autofillr
A browser extension that fills registration forms with randomly but consistently generated fake data.
Language: JavaScript - Size: 513 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 19 - Forks: 0

ICTMCG/GenFEND
Let Silence Speak: Enhancing Fake News Detection with Generated Comments from Large Language Models, CIKM 2024.
Language: Python - Size: 92.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 18 - Forks: 1

lmb-freiburg/optical-flow-2d-data-generation
Caffe(v1)-compatible codebase to generate optical flow training data on-the-fly; used for the IJCV 2018 paper "What Makes Good Synthetic Training Data for Learning Disparity and Optical Flow Estimation?" (http://dx.doi.org/10.1007/s11263-018-1082-6)
Language: C++ - Size: 1.2 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 18 - Forks: 2

alexandrosstergiou/Traffic-Sign-Recognition-basd-on-Synthesised-Training-Data
Using synthetic data in combination with Deep Learning, to determine if a system can be made that will be able to recognise and classify correctly real traffic signs.
Language: Jupyter Notebook - Size: 1.58 GB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 18 - Forks: 10

synthesized-io/tdk-demo
This is a collection of TDK demo projects that use different databases and options
Language: YAML - Size: 69.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 17 - Forks: 4

tarantool/sdvg
Synthetic Data Values Generator
Language: Go - Size: 817 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 17 - Forks: 5

StefanHeng/ProgGen
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
Language: Python - Size: 62.3 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 2

smtlaissezfaire/fixturereplacement
FixtureReplacement rails plugin
Language: Ruby - Size: 517 KB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 4

BeamNG/impactgen
Python script and Lua extension using BeamNG.tech to generate low impact crash scenarios and ground truth data for imitation learning.
Language: Python - Size: 487 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 16 - Forks: 5

VCL3D/BlenderScripts
Scripts for data generation using Blender and 3D datasets like Matterport3D.
Language: Python - Size: 37.1 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 16 - Forks: 3

Infineon/StreamGen
Python framework for generating streams of labeled data.
Language: Python - Size: 64.5 MB - Last synced at: 5 days ago - Pushed at: 19 days ago - Stars: 15 - Forks: 0

phrocker/nifi-datasynthesizer
Apache NiFi Data Synthesizer
Language: Java - Size: 2.72 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 14 - Forks: 3

hammoudiproject/SuperpixelGridMasks
SuperpixelGridMasks is an approach for sensor-based data augmentation towards image classification tasks and so on.
Size: 14.5 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 14 - Forks: 2

matousc89/signalz
Data generators in Python
Language: Python - Size: 227 KB - Last synced at: 16 days ago - Pushed at: about 6 years ago - Stars: 14 - Forks: 3

ma7555/kerasgen 📦
A Keras/Tensorflow compatible image data generator for TripletLoss
Language: Python - Size: 2.31 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 13 - Forks: 10

HKUNLP/SymGen
[EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models
Language: Python - Size: 4.88 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 13 - Forks: 1

daffidwilde/edo
A library for generating artificial datasets through genetic evolution.
Language: Python - Size: 8.24 MB - Last synced at: 5 days ago - Pushed at: over 4 years ago - Stars: 13 - Forks: 0

DFKI-NI/syclops
Syclops is a tool for creating synthetic data from 3D virtual environments with photorealistic renderings and pixel-perfect annotations.
Language: Python - Size: 29.6 MB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 12 - Forks: 2

algoprog/SynTOD
Synthetic data generation for TODs
Language: Python - Size: 58.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 12 - Forks: 0

MaxLSB/le-carnet
LeCarnet is a 2 M+ corpus of simple French stories, with end‑to‑end data generation, evaluation and training pipelines
Language: Python - Size: 6.77 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 2

peirong26/PEPSI
[MICCAI 2024] PEPSI: Pathology-Enhanced Pulse-Sequence-Invariant Representations for Brain MRI
Language: Python - Size: 1.21 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 1

HKUNLP/ProGen
[EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.
Language: Python - Size: 970 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 0

shaoyijia/CMG
Code for ECML-PKDD 2022 Paper --- CMG: A Class-Mixed Generation Approach to Out-of-Distribution Detection
Language: Python - Size: 334 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 1

ipjohnson/SimpleFixture
Testing fixture for .Net
Language: C# - Size: 3.3 MB - Last synced at: 24 days ago - Pushed at: about 3 years ago - Stars: 11 - Forks: 2

basiralab/GSR-Net
Graph SuperResolution Network using geometric deep learning.
Language: Python - Size: 16.3 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 11 - Forks: 0

djin31/VesselExtract
U-net based CNN for segmenting blood vessel and thereafter removal of vessels from fundus image
Language: Jupyter Notebook - Size: 9.43 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 1

Marmiya/VCCSim
VCCSIM is a comprehensive platform designed for 3D mapping and embodied AI agent training in large-scale open-world environments. The system integrates a suite of sensor components specifically engineered for expansive outdoor scenarios, intelligent agents, scene analysis and evaluation modules, and corresponding cross-platform APIs.
Language: C++ - Size: 338 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 10 - Forks: 1

shams-sam/ICD10Data.com
http://icd10data.com/ data scraping
Language: Python - Size: 533 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 2

sykwon/teddy-dream
[VLDB'22] Cardinality Estimation of Approximate Substring Queries using Deep Learning.
Language: Python - Size: 104 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 9 - Forks: 1

hypervectorio/hypervector-wrapper 📦
Python wrapper for the Hypervector API (https://hypervector.io)
Language: Python - Size: 482 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 9 - Forks: 7

benkimbuilds/self-driving-car-simulator
A self driving car created using Python, Keras/Tensorflow, Udacity simulator engine.
Language: Python - Size: 88.3 MB - Last synced at: 12 days ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 3

kbernst30/django-data-seeder
A data seeder for models for Django
Language: Python - Size: 41 KB - Last synced at: 9 days ago - Pushed at: about 6 years ago - Stars: 9 - Forks: 0

ironcev/public-talks
My public talks, their abstracts, code snippets, and sample projects
Language: C# - Size: 147 KB - Last synced at: 5 months ago - Pushed at: almost 7 years ago - Stars: 9 - Forks: 0

munichpavel/fake-data-for-learning
Sample interesting fake data for machine and human learning
Language: Python - Size: 475 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 8 - Forks: 0

AlgoMathITMO/CLSGAN
Synthetic financial time series generation with regime clustering
Language: Jupyter Notebook - Size: 7.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 4

chaturv3di/absynthe
A (branching) Behaviour Synthesiser -- Simulates the generation of application or process logs, where multiple modules (or processes) can execute simultaneously, in a distributed deployment, and dump the log messages in an interleaved manner in a single log file.
Language: Python - Size: 1.37 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 3
