An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-generation

sdv-dev/SDV

Synthetic data generation for tabular data

Language: Python - Size: 31.6 MB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 3,038 - Forks: 365

msaleme/Utilities-Generator-API

MuleSoft RAML-based API for generating realistic utilities test data including meter readings, power consumption, outages, and fault events. Supports development, testing, and demo environments.

Language: RAML - Size: 4.88 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

StevenRice99/Unity-Camera-Calibration

Allows for configuring simulated physical cameras in Unity and extracting screenshots along with calibrated data for external use in pixel matching.

Language: C# - Size: 108 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

tinybirdco/mockingbird

Mockingbird is a mock streaming data generator

Language: TypeScript - Size: 2.57 MB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 119 - Forks: 18

IhebBelhadj/synthetic-time-series-hr-data

A Python project that transforms a static HR employee snapshot into a rich, historized dataset of event logs, perfect for powering HR analytics and testing ELT/DWH pipelines.

Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

doachyz/IIoT-simulator

An advanced Industrial IoT (IIoT) simulator for Smart Factory 4.0 environments using Python, MQTT, and Docker. Emulates configurable production lines with realistic sensor data (vibration, temperature, quality) and predictive alerts.

Language: Python - Size: 10.7 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

igor-olikh/syntetic-data-generator

A comprehensive toolkit for generating high-quality synthetic datasets using Meta's Llama Synthetic Data Kit. Supports PDFs, videos, documents & more for AI fine-tuning and testing.

Size: 393 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

microsoft/genalog

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

Language: Jupyter Notebook - Size: 14.6 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 329 - Forks: 34

sdv-dev/Copulas

A library to model multivariate data using copulas.

Language: Python - Size: 31.7 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 595 - Forks: 116

shuttle-hq/synth

The Declarative Data Generator

Language: Rust - Size: 32.3 MB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 1,425 - Forks: 108

TsingZ0/EvolveGen

Cloud-Edge Collaboration Platform for Automated Synthetic Dataset Generation

Language: Python - Size: 95.7 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 0

whatyouhide/stream_data

Data generation and property-based testing for Elixir. ๐Ÿ”ฎ

Language: Elixir - Size: 521 KB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 902 - Forks: 71

synthesized-io/tdk-demo

This is a collection of TDK demo projects that use different databases and options

Language: YAML - Size: 69.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 17 - Forks: 4

Marmiya/VCCSim

VCCSIM is a comprehensive platform designed for 3D mapping and embodied AI agent training in large-scale open-world environments. The system integrates a suite of sensor components specifically engineered for expansive outdoor scenarios, intelligent agents, scene analysis and evaluation modules, and corresponding cross-platform APIs.

Language: C++ - Size: 338 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 10 - Forks: 1

glynnbird/datamaker

Data generator command-line tool and library. Create JSON, CSV, XML data from templates.

Language: JavaScript - Size: 489 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 27 - Forks: 7

PrasannaPulakurthi/papers

Size: 14.3 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

sdv-dev/CTGAN

Conditional GAN for generating synthetic tabular data.

Language: Python - Size: 1.83 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,410 - Forks: 314

data-catering/data-caterer-example Fork of pflooky/data-caterer-example

Example API implementation for Data Caterer

Language: Scala - Size: 2.08 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 0

UnrealZoo/unrealzoo-gym Fork of zfw1226/gym-unrealcv

Large-scale photo-realistic virtual worlds for embodied AI

Language: Python - Size: 112 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 140 - Forks: 9

AmritaVeshin/Product-Analytics-User-Behavior-Analysis

Python-based User Behavior Analysis Project conducted in Google Colab. Explore, analyze, and optimize user experiences. ๐Ÿ“Š๐Ÿš€ #DataScience #ProductAnalytics

Language: Jupyter Notebook - Size: 321 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

gretelai/awesome-synthetic-data

๐Ÿ“– A curated list of resources dedicated to synthetic data

Size: 40 KB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 131 - Forks: 10

SigVarGen/SigVarGen

SigVarGen is a Python framework for time-series signal generation, data augmentation, and anomaly simulation. It creates diverse 1D signal variants under controlled conditions, including idle-state, perturbed, and noisy signals.

Language: Jupyter Notebook - Size: 84.3 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 0

PaulSorensen/linux-tools

Comprehensive privacy-conscious list of Linux applications, tools, and distributions - powered by a generic Python CLI that lets you manage and export your own custom lists in multiple formats.

Language: Python - Size: 179 KB - Last synced at: 4 days ago - Pushed at: 16 days ago - Stars: 29 - Forks: 2

data-catering/data-caterer Fork of pflooky/data-caterer

Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.

Language: Scala - Size: 2.8 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 57 - Forks: 8

eliabntt/GRADE-RR

GRADE: Generating Animated Dynamic Environments for Robotics Research

Language: Python - Size: 236 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 58 - Forks: 7

Westlake-AI/openmixup

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

Language: Python - Size: 3.68 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 650 - Forks: 59

StaRainJ/MINIMA Fork of LSXI7/MINIMA

[CVPR 2025] MINIMA: Modality Invariant Image Matching

Language: Python - Size: 44.3 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 5 - Forks: 0

BBahtiri/Space-Filling-Algorithm-Data-Generation-Technique

A space-filling procedure to generate data from a constitutive model (viscoelastic-viscoplastic-damage) including moisture, strain rate, and nanoparticle volume fraction dependency.

Language: MATLAB - Size: 78.1 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 2

manumerous/wb_humanoid_mpc

Whole-Body Nonlinear MPC for Realtime Humanoid Loco-Manipulation Planning and Control

Language: C++ - Size: 21.4 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 101 - Forks: 25

open-sciencelab/GraphGen

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

Language: Python - Size: 13.8 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 188 - Forks: 19

tabularis-ai/be_great

A novel approach for synthesizing tabular data using pretrained large language models

Language: Python - Size: 4.29 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 312 - Forks: 52

eriknovak/anonipy

Data anonymization package, supporting different anonymization strategies

Language: Python - Size: 1.1 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 3

kgoldfeld/simstudy

simstudy: Illuminating research methods through data generation

Language: R - Size: 67.6 MB - Last synced at: 10 days ago - Pushed at: 11 months ago - Stars: 84 - Forks: 8

databrickslabs/dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

Language: Python - Size: 11.1 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 407 - Forks: 74

worldbank/REaLTabFormer

A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.

Language: Jupyter Notebook - Size: 12.3 MB - Last synced at: 21 days ago - Pushed at: 25 days ago - Stars: 228 - Forks: 28

starfishdata/starfish

Synthetic data generation to fuel AI models

Language: Python - Size: 14 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 30 - Forks: 1

rapiddweller/datamimic

๐Ÿง  Model-Driven test data generation platform enabling developers to create realistic, scalable, and privacy-compliant test data. Features model-driven data generation, GDPR compliance, and seamless Python integration.

Language: Python - Size: 14.3 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 25 - Forks: 2

neomatrix369/awesome-ai-ml-dl

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.

Language: Jupyter Notebook - Size: 76.5 MB - Last synced at: 28 days ago - Pushed at: 11 months ago - Stars: 1,538 - Forks: 363

cieslarmichal/faker-cxx

C++ Faker library for generating fake (but realistic) data.

Language: C++ - Size: 24.6 MB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 353 - Forks: 177

mantzaris/BenchmarkDataNLP.jl

Generate synthetic text from a variety of methods, eg. Context Free Grammars (CFGs), with parameterized complexity to test your NLP methods (like LLMs)

Language: Julia - Size: 1.4 MB - Last synced at: 6 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 1

Stranger6667/hypothesis-graphql

Generate arbitrary queries matching your GraphQL schema, and use them to verify your backend implementation.

Language: Python - Size: 944 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 44 - Forks: 3

sebhaan/TabPFGen

TabPFGen: Synthetic Tabular Data Generation with TabPFN

Language: Python - Size: 140 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 0

dmey/synthia

๐Ÿ“ˆ ๐Ÿ Multidimensional synthetic data generation with Copula and fPCA models in Python

Language: Python - Size: 19.7 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 10

nomemory/mockneat

MockNeat - the modern faker lib.

Language: Java - Size: 2.65 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 534 - Forks: 47

IDEA-Research/Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language: Jupyter Notebook - Size: 152 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 16,333 - Forks: 1,493

DFKI-NI/syclops

Syclops is a tool for creating synthetic data from 3D virtual environments with photorealistic renderings and pixel-perfect annotations.

Language: Python - Size: 29.6 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 11 - Forks: 2

microsoft/CodeMixed-Text-Generator

This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.

Language: Jupyter Notebook - Size: 3.79 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 55 - Forks: 12

sdv-dev/DeepEcho

Synthetic Data Generation for mixed-type, multivariate time series.

Language: Python - Size: 760 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 115 - Forks: 16

Mmodarre/AusHealthSim

A comprehensive simulation system that generates realistic health insurance data for the Australian market

Language: Python - Size: 1.86 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

grafana/k6-example-data-generation

Example repository showing how to utilise k6 and faker to load test using generated data

Language: JavaScript - Size: 151 KB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 57 - Forks: 16

ngzhili/SynTable

The official code implementation for SynTable - A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop Scenes

Language: Python - Size: 207 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 24 - Forks: 0

esotericenderman/scp-secret-laboratory-translations-generator

A piece of code to generate updated SCP:SL translations.

Language: TypeScript - Size: 116 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

Preciousclement/Maternal-Experiences-In-Nigeria

This repository contains a Python-based project that generates realistic synthetic data simulating the maternal health journey of 5,000 women in Nigeria.

Size: 3.91 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

PrinceV-hub/GAN-Generation-of-Synthetic-Data-

Generate and evaluate synthetic tabular data using GANs with visual comparisons.

Language: Python - Size: 1.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

benkeen/generatedata

A powerful, feature-rich, random test data generator.

Language: TypeScript - Size: 78.5 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 2,255 - Forks: 615

ComputationalDesignLab/blackbox

Blackbox package provides a way to generate data which can then be used for building/testing a surrogate model or for any other purpose.

Language: Python - Size: 42.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

eyalroz/ssb-dbgen Fork of greenlion/ssb-dbgen

Star Schema Benchmark data set generator (dbgen) - unified repository

Language: C - Size: 174 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 36 - Forks: 15

Infineon/StreamGen

Python framework for generating streams of labeled data.

Language: Python - Size: 64.4 MB - Last synced at: 2 days ago - Pushed at: 6 months ago - Stars: 12 - Forks: 0

GoodarzMehr/SimBEV

SimBEV is a configurable and scalable synthetic driving data generation tool based on the CARLA Simulator.

Language: Python - Size: 41.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 14 - Forks: 0

fillol/IIoT-simulator

An advanced Industrial IoT (IIoT) simulator for Smart Factory 4.0 environments using Python, MQTT, and Docker. Emulates configurable production lines with realistic sensor data (vibration, temperature, quality) and predictive alerts.

Language: Python - Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

AgaMiko/data-augmentation-review

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.

Size: 3.59 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 1,632 - Forks: 207

zahramh99/Synthetic-Data-Generation-with-Generative-AI

Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

BeamNG/impactgen

Python script and Lua extension using BeamNG.tech to generate low impact crash scenarios and ground truth data for imitation learning.

Language: Python - Size: 487 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 16 - Forks: 5

YuriyIvon/DatabaseBenchmark

A universal database query benchmark tool

Language: C# - Size: 822 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 40 - Forks: 3

XeTute/Synthetic-Alpaca

Minimal script to generate alpaca-style datasets.

Language: Python - Size: 81.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

br0kej/bin2ml

A command line tool for extracting machine learning ready data from software binaries powered by Radare2

Language: Rust - Size: 1.61 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 69 - Forks: 5

Buba98/regex_enumerator

Enumerate all strings that match a given regex

Language: Python - Size: 113 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

yzhan238/TELEClass

The source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision", published in WWW 2025.

Language: Python - Size: 144 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Sheikyon/VCFify

๐Ÿ“ž A straightforward, non-random contact generator capable of exporting contacts in a vCard-compatible format.

Language: Python - Size: 20.5 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

louisYen/Gen4Gen

๐Ÿž๏ธ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"

Language: Python - Size: 181 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 105 - Forks: 5

trinker/wakefield

Generate random data sets

Language: R - Size: 3.78 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 256 - Forks: 28

IKatsuba/simulon

Simulon API is a HTTP server for generating fake data

Language: TypeScript - Size: 69.3 KB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

ICTMCG/GenFEND

Let Silence Speak: Enhancing Fake News Detection with Generated Comments from Large Language Models, CIKM 2024.

Language: Python - Size: 92.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 18 - Forks: 1

tirthajyoti/pydbgen

Random dataframe and database table generator

Language: Python - Size: 687 KB - Last synced at: 25 days ago - Pushed at: about 4 years ago - Stars: 309 - Forks: 58

Bauhinia-AI/evol-character

Based on the Evol-character framework and OpenAI API, enabling fine-grained role-playing data generation ๐ŸŽญ๐Ÿงฉ.

Size: 348 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 0

BUAADreamer/SPN4CIR

[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives

Language: Python - Size: 4.2 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 30 - Forks: 3

ichbincoo/IFakeNumber

IFakeNumber is a lightweight JavaScript library that generates fictitious numerical data for testing and demonstration purposes. It provides easy-to-use functions to create fake numbers with customizable ranges and precision.

Size: 1000 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

gretelai/trainer

Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.

Language: Python - Size: 1.8 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 29 - Forks: 7

MarkusJx/datagen

Random data generator based on JSON schemas

Language: Rust - Size: 1.1 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 4 - Forks: 1

tiago-aguiar-moreira/Bogus.CLI

Bogus.CLI is a command-line tool built on the powerful Bogus library by Brian Chavez. It simplifies fake data generation, allowing you to create flexible and efficient data directly from your terminal.

Language: C# - Size: 113 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

snaplet/docs

Snaplet Documentation

Language: HTML - Size: 13.7 MB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 28 - Forks: 10

tom-draper/persona

Probabilistic character profile generation using real-world demographic data.

Language: Python - Size: 4.17 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 6 - Forks: 1

munichpavel/fake-data-for-learning

Sample interesting fake data for machine and human learning

Language: Python - Size: 475 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 0

peirong26/PEPSI

[MICCAI 2024] PEPSI: Pathology-Enhanced Pulse-Sequence-Invariant Representations for Brain MRI

Language: Python - Size: 1.21 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 11 - Forks: 1

Cambalab/fake-data-generator

Just a small open-source script to create fake data given a simple JSON model.

Language: JavaScript - Size: 923 KB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 49 - Forks: 14

StefanHeng/ProgGen

Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"

Language: Python - Size: 62.3 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 2

KryptixOne/Spherical-Data-Generation-For-3D-Meshes

Data Generation: Data is a spherical projection of the 3-D meshes.

Language: Python - Size: 4.86 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Supercili0usMe/table-filler

Table-Filler: A simple Python library for generating test data for SQL, JSON, and CSV. Supports Faker and custom data types.

Language: Python - Size: 26.4 KB - Last synced at: 13 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

fanny8148/Fake-Identity-Generator-No-Crack

This repository provides a tool for generating fake identities, useful for privacy protection, testing, or development purposes. The generated data includes random names, addresses, emails, and other personal details to simulate realistic identities.

Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

fastent/fastent

custom models for named-entity recognition

Language: Python - Size: 2.58 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 2

edyan/neuralyzer

Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)

Language: PHP - Size: 56.8 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 51 - Forks: 11

tom-lord/regexp-examples

Generate strings that match a given regular expression

Language: Ruby - Size: 683 KB - Last synced at: about 5 hours ago - Pushed at: about 1 year ago - Stars: 521 - Forks: 31

paulwritescode/json-api-server

A simple API server that returns a JSON response with a dynamically generated current datetime in ISO 8601 format (UTC), along with some predefined information about a user.

Language: Go - Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

QuantLet/DataGenerationForCausalInference

Generates synthetic data to apply simulations for causal inference

Language: R - Size: 958 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 7 - Forks: 7

1x-technologies/wb-humanoid-mpc

Realtime Physics-Based Procedural Loco-Manipulation Planning and Control

Language: C++ - Size: 21.4 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 30 - Forks: 12

ProjectNeura/COOTA

A powerful data-generating python library.

Language: Python - Size: 262 KB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

RozhakXD/IFakeNumber

Language: Python - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

pflooky/data-caterer

Data generation and validation tool for any data source

Language: Scala - Size: 331 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 9

Sepuliini/Surrogate_Classifier

Multi-objective optimization project featuring data generation for test problems, surrogate model training, and ELA feature extractionโ€”aiming to classify which surrogates work best for given problem landscapes.

Language: Python - Size: 306 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

noisemix/noisemix

NoiseMix - data generation for natural language

Language: Python - Size: 2.18 MB - Last synced at: 5 days ago - Pushed at: about 7 years ago - Stars: 40 - Forks: 7