GitHub topics: data-generation
sdv-dev/SDV
Synthetic data generation for tabular data
Language: Python - Size: 31.6 MB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 3,038 - Forks: 365

msaleme/Utilities-Generator-API
MuleSoft RAML-based API for generating realistic utilities test data including meter readings, power consumption, outages, and fault events. Supports development, testing, and demo environments.
Language: RAML - Size: 4.88 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

StevenRice99/Unity-Camera-Calibration
Allows for configuring simulated physical cameras in Unity and extracting screenshots along with calibrated data for external use in pixel matching.
Language: C# - Size: 108 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

tinybirdco/mockingbird
Mockingbird is a mock streaming data generator
Language: TypeScript - Size: 2.57 MB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 119 - Forks: 18

IhebBelhadj/synthetic-time-series-hr-data
A Python project that transforms a static HR employee snapshot into a rich, historized dataset of event logs, perfect for powering HR analytics and testing ELT/DWH pipelines.
Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

doachyz/IIoT-simulator
An advanced Industrial IoT (IIoT) simulator for Smart Factory 4.0 environments using Python, MQTT, and Docker. Emulates configurable production lines with realistic sensor data (vibration, temperature, quality) and predictive alerts.
Language: Python - Size: 10.7 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

igor-olikh/syntetic-data-generator
A comprehensive toolkit for generating high-quality synthetic datasets using Meta's Llama Synthetic Data Kit. Supports PDFs, videos, documents & more for AI fine-tuning and testing.
Size: 393 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

microsoft/genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
Language: Jupyter Notebook - Size: 14.6 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 329 - Forks: 34

sdv-dev/Copulas
A library to model multivariate data using copulas.
Language: Python - Size: 31.7 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 595 - Forks: 116

shuttle-hq/synth
The Declarative Data Generator
Language: Rust - Size: 32.3 MB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 1,425 - Forks: 108

TsingZ0/EvolveGen
Cloud-Edge Collaboration Platform for Automated Synthetic Dataset Generation
Language: Python - Size: 95.7 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 0

whatyouhide/stream_data
Data generation and property-based testing for Elixir. ๐ฎ
Language: Elixir - Size: 521 KB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 902 - Forks: 71

synthesized-io/tdk-demo
This is a collection of TDK demo projects that use different databases and options
Language: YAML - Size: 69.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 17 - Forks: 4

Marmiya/VCCSim
VCCSIM is a comprehensive platform designed for 3D mapping and embodied AI agent training in large-scale open-world environments. The system integrates a suite of sensor components specifically engineered for expansive outdoor scenarios, intelligent agents, scene analysis and evaluation modules, and corresponding cross-platform APIs.
Language: C++ - Size: 338 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 10 - Forks: 1

glynnbird/datamaker
Data generator command-line tool and library. Create JSON, CSV, XML data from templates.
Language: JavaScript - Size: 489 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 27 - Forks: 7

PrasannaPulakurthi/papers
Size: 14.3 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

sdv-dev/CTGAN
Conditional GAN for generating synthetic tabular data.
Language: Python - Size: 1.83 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,410 - Forks: 314

data-catering/data-caterer-example Fork of pflooky/data-caterer-example
Example API implementation for Data Caterer
Language: Scala - Size: 2.08 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 0

UnrealZoo/unrealzoo-gym Fork of zfw1226/gym-unrealcv
Large-scale photo-realistic virtual worlds for embodied AI
Language: Python - Size: 112 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 140 - Forks: 9

AmritaVeshin/Product-Analytics-User-Behavior-Analysis
Python-based User Behavior Analysis Project conducted in Google Colab. Explore, analyze, and optimize user experiences. ๐๐ #DataScience #ProductAnalytics
Language: Jupyter Notebook - Size: 321 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

gretelai/awesome-synthetic-data
๐ A curated list of resources dedicated to synthetic data
Size: 40 KB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 131 - Forks: 10

SigVarGen/SigVarGen
SigVarGen is a Python framework for time-series signal generation, data augmentation, and anomaly simulation. It creates diverse 1D signal variants under controlled conditions, including idle-state, perturbed, and noisy signals.
Language: Jupyter Notebook - Size: 84.3 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 0

PaulSorensen/linux-tools
Comprehensive privacy-conscious list of Linux applications, tools, and distributions - powered by a generic Python CLI that lets you manage and export your own custom lists in multiple formats.
Language: Python - Size: 179 KB - Last synced at: 4 days ago - Pushed at: 16 days ago - Stars: 29 - Forks: 2

data-catering/data-caterer Fork of pflooky/data-caterer
Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.
Language: Scala - Size: 2.8 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 57 - Forks: 8

eliabntt/GRADE-RR
GRADE: Generating Animated Dynamic Environments for Robotics Research
Language: Python - Size: 236 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 58 - Forks: 7

Westlake-AI/openmixup
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
Language: Python - Size: 3.68 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 650 - Forks: 59

StaRainJ/MINIMA Fork of LSXI7/MINIMA
[CVPR 2025] MINIMA: Modality Invariant Image Matching
Language: Python - Size: 44.3 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 5 - Forks: 0

BBahtiri/Space-Filling-Algorithm-Data-Generation-Technique
A space-filling procedure to generate data from a constitutive model (viscoelastic-viscoplastic-damage) including moisture, strain rate, and nanoparticle volume fraction dependency.
Language: MATLAB - Size: 78.1 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 2

manumerous/wb_humanoid_mpc
Whole-Body Nonlinear MPC for Realtime Humanoid Loco-Manipulation Planning and Control
Language: C++ - Size: 21.4 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 101 - Forks: 25

open-sciencelab/GraphGen
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
Language: Python - Size: 13.8 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 188 - Forks: 19

tabularis-ai/be_great
A novel approach for synthesizing tabular data using pretrained large language models
Language: Python - Size: 4.29 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 312 - Forks: 52

eriknovak/anonipy
Data anonymization package, supporting different anonymization strategies
Language: Python - Size: 1.1 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 3

kgoldfeld/simstudy
simstudy: Illuminating research methods through data generation
Language: R - Size: 67.6 MB - Last synced at: 10 days ago - Pushed at: 11 months ago - Stars: 84 - Forks: 8

databrickslabs/dbldatagen
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Language: Python - Size: 11.1 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 407 - Forks: 74

worldbank/REaLTabFormer
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
Language: Jupyter Notebook - Size: 12.3 MB - Last synced at: 21 days ago - Pushed at: 25 days ago - Stars: 228 - Forks: 28

starfishdata/starfish
Synthetic data generation to fuel AI models
Language: Python - Size: 14 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 30 - Forks: 1

rapiddweller/datamimic
๐ง Model-Driven test data generation platform enabling developers to create realistic, scalable, and privacy-compliant test data. Features model-driven data generation, GDPR compliance, and seamless Python integration.
Language: Python - Size: 14.3 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 25 - Forks: 2

neomatrix369/awesome-ai-ml-dl
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Language: Jupyter Notebook - Size: 76.5 MB - Last synced at: 28 days ago - Pushed at: 11 months ago - Stars: 1,538 - Forks: 363

cieslarmichal/faker-cxx
C++ Faker library for generating fake (but realistic) data.
Language: C++ - Size: 24.6 MB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 353 - Forks: 177

mantzaris/BenchmarkDataNLP.jl
Generate synthetic text from a variety of methods, eg. Context Free Grammars (CFGs), with parameterized complexity to test your NLP methods (like LLMs)
Language: Julia - Size: 1.4 MB - Last synced at: 6 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 1

Stranger6667/hypothesis-graphql
Generate arbitrary queries matching your GraphQL schema, and use them to verify your backend implementation.
Language: Python - Size: 944 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 44 - Forks: 3

sebhaan/TabPFGen
TabPFGen: Synthetic Tabular Data Generation with TabPFN
Language: Python - Size: 140 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 0

dmey/synthia
๐ ๐ Multidimensional synthetic data generation with Copula and fPCA models in Python
Language: Python - Size: 19.7 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 10

nomemory/mockneat
MockNeat - the modern faker lib.
Language: Java - Size: 2.65 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 534 - Forks: 47

IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language: Jupyter Notebook - Size: 152 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 16,333 - Forks: 1,493

DFKI-NI/syclops
Syclops is a tool for creating synthetic data from 3D virtual environments with photorealistic renderings and pixel-perfect annotations.
Language: Python - Size: 29.6 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 11 - Forks: 2

microsoft/CodeMixed-Text-Generator
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
Language: Jupyter Notebook - Size: 3.79 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 55 - Forks: 12

sdv-dev/DeepEcho
Synthetic Data Generation for mixed-type, multivariate time series.
Language: Python - Size: 760 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 115 - Forks: 16

Mmodarre/AusHealthSim
A comprehensive simulation system that generates realistic health insurance data for the Australian market
Language: Python - Size: 1.86 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

grafana/k6-example-data-generation
Example repository showing how to utilise k6 and faker to load test using generated data
Language: JavaScript - Size: 151 KB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 57 - Forks: 16

ngzhili/SynTable
The official code implementation for SynTable - A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop Scenes
Language: Python - Size: 207 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 24 - Forks: 0

esotericenderman/scp-secret-laboratory-translations-generator
A piece of code to generate updated SCP:SL translations.
Language: TypeScript - Size: 116 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

Preciousclement/Maternal-Experiences-In-Nigeria
This repository contains a Python-based project that generates realistic synthetic data simulating the maternal health journey of 5,000 women in Nigeria.
Size: 3.91 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

PrinceV-hub/GAN-Generation-of-Synthetic-Data-
Generate and evaluate synthetic tabular data using GANs with visual comparisons.
Language: Python - Size: 1.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

benkeen/generatedata
A powerful, feature-rich, random test data generator.
Language: TypeScript - Size: 78.5 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 2,255 - Forks: 615

ComputationalDesignLab/blackbox
Blackbox package provides a way to generate data which can then be used for building/testing a surrogate model or for any other purpose.
Language: Python - Size: 42.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

eyalroz/ssb-dbgen Fork of greenlion/ssb-dbgen
Star Schema Benchmark data set generator (dbgen) - unified repository
Language: C - Size: 174 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 36 - Forks: 15

Infineon/StreamGen
Python framework for generating streams of labeled data.
Language: Python - Size: 64.4 MB - Last synced at: 2 days ago - Pushed at: 6 months ago - Stars: 12 - Forks: 0

GoodarzMehr/SimBEV
SimBEV is a configurable and scalable synthetic driving data generation tool based on the CARLA Simulator.
Language: Python - Size: 41.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 14 - Forks: 0

fillol/IIoT-simulator
An advanced Industrial IoT (IIoT) simulator for Smart Factory 4.0 environments using Python, MQTT, and Docker. Emulates configurable production lines with realistic sensor data (vibration, temperature, quality) and predictive alerts.
Language: Python - Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

AgaMiko/data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
Size: 3.59 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 1,632 - Forks: 207

zahramh99/Synthetic-Data-Generation-with-Generative-AI
Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

BeamNG/impactgen
Python script and Lua extension using BeamNG.tech to generate low impact crash scenarios and ground truth data for imitation learning.
Language: Python - Size: 487 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 16 - Forks: 5

YuriyIvon/DatabaseBenchmark
A universal database query benchmark tool
Language: C# - Size: 822 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 40 - Forks: 3

XeTute/Synthetic-Alpaca
Minimal script to generate alpaca-style datasets.
Language: Python - Size: 81.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

br0kej/bin2ml
A command line tool for extracting machine learning ready data from software binaries powered by Radare2
Language: Rust - Size: 1.61 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 69 - Forks: 5

Buba98/regex_enumerator
Enumerate all strings that match a given regex
Language: Python - Size: 113 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

yzhan238/TELEClass
The source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision", published in WWW 2025.
Language: Python - Size: 144 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Sheikyon/VCFify
๐ A straightforward, non-random contact generator capable of exporting contacts in a vCard-compatible format.
Language: Python - Size: 20.5 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

louisYen/Gen4Gen
๐๏ธ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
Language: Python - Size: 181 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 105 - Forks: 5

trinker/wakefield
Generate random data sets
Language: R - Size: 3.78 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 256 - Forks: 28

IKatsuba/simulon
Simulon API is a HTTP server for generating fake data
Language: TypeScript - Size: 69.3 KB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

ICTMCG/GenFEND
Let Silence Speak: Enhancing Fake News Detection with Generated Comments from Large Language Models, CIKM 2024.
Language: Python - Size: 92.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 18 - Forks: 1

tirthajyoti/pydbgen
Random dataframe and database table generator
Language: Python - Size: 687 KB - Last synced at: 25 days ago - Pushed at: about 4 years ago - Stars: 309 - Forks: 58

Bauhinia-AI/evol-character
Based on the Evol-character framework and OpenAI API, enabling fine-grained role-playing data generation ๐ญ๐งฉ.
Size: 348 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 0

BUAADreamer/SPN4CIR
[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
Language: Python - Size: 4.2 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 30 - Forks: 3

ichbincoo/IFakeNumber
IFakeNumber is a lightweight JavaScript library that generates fictitious numerical data for testing and demonstration purposes. It provides easy-to-use functions to create fake numbers with customizable ranges and precision.
Size: 1000 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

gretelai/trainer
Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.
Language: Python - Size: 1.8 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 29 - Forks: 7

MarkusJx/datagen
Random data generator based on JSON schemas
Language: Rust - Size: 1.1 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 4 - Forks: 1

tiago-aguiar-moreira/Bogus.CLI
Bogus.CLI is a command-line tool built on the powerful Bogus library by Brian Chavez. It simplifies fake data generation, allowing you to create flexible and efficient data directly from your terminal.
Language: C# - Size: 113 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

snaplet/docs
Snaplet Documentation
Language: HTML - Size: 13.7 MB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 28 - Forks: 10

tom-draper/persona
Probabilistic character profile generation using real-world demographic data.
Language: Python - Size: 4.17 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 6 - Forks: 1

munichpavel/fake-data-for-learning
Sample interesting fake data for machine and human learning
Language: Python - Size: 475 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 0

peirong26/PEPSI
[MICCAI 2024] PEPSI: Pathology-Enhanced Pulse-Sequence-Invariant Representations for Brain MRI
Language: Python - Size: 1.21 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 11 - Forks: 1

Cambalab/fake-data-generator
Just a small open-source script to create fake data given a simple JSON model.
Language: JavaScript - Size: 923 KB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 49 - Forks: 14

StefanHeng/ProgGen
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
Language: Python - Size: 62.3 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 2

KryptixOne/Spherical-Data-Generation-For-3D-Meshes
Data Generation: Data is a spherical projection of the 3-D meshes.
Language: Python - Size: 4.86 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Supercili0usMe/table-filler
Table-Filler: A simple Python library for generating test data for SQL, JSON, and CSV. Supports Faker and custom data types.
Language: Python - Size: 26.4 KB - Last synced at: 13 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

fanny8148/Fake-Identity-Generator-No-Crack
This repository provides a tool for generating fake identities, useful for privacy protection, testing, or development purposes. The generated data includes random names, addresses, emails, and other personal details to simulate realistic identities.
Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

fastent/fastent
custom models for named-entity recognition
Language: Python - Size: 2.58 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 2

edyan/neuralyzer
Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)
Language: PHP - Size: 56.8 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 51 - Forks: 11

tom-lord/regexp-examples
Generate strings that match a given regular expression
Language: Ruby - Size: 683 KB - Last synced at: about 5 hours ago - Pushed at: about 1 year ago - Stars: 521 - Forks: 31

paulwritescode/json-api-server
A simple API server that returns a JSON response with a dynamically generated current datetime in ISO 8601 format (UTC), along with some predefined information about a user.
Language: Go - Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

QuantLet/DataGenerationForCausalInference
Generates synthetic data to apply simulations for causal inference
Language: R - Size: 958 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 7 - Forks: 7

1x-technologies/wb-humanoid-mpc
Realtime Physics-Based Procedural Loco-Manipulation Planning and Control
Language: C++ - Size: 21.4 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 30 - Forks: 12

ProjectNeura/COOTA
A powerful data-generating python library.
Language: Python - Size: 262 KB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

RozhakXD/IFakeNumber
Language: Python - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

pflooky/data-caterer
Data generation and validation tool for any data source
Language: Scala - Size: 331 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 9

Sepuliini/Surrogate_Classifier
Multi-objective optimization project featuring data generation for test problems, surrogate model training, and ELA feature extractionโaiming to classify which surrogates work best for given problem landscapes.
Language: Python - Size: 306 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

noisemix/noisemix
NoiseMix - data generation for natural language
Language: Python - Size: 2.18 MB - Last synced at: 5 days ago - Pushed at: about 7 years ago - Stars: 40 - Forks: 7
