GitHub topics: synthetic-dataset
sileod/unigram
Language: Python - Size: 84 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

Unity-Technologies/SynthDet 📦
SynthDet - An end-to-end object detection pipeline using synthetic data
Language: C# - Size: 2.19 MB - Last synced at: 16 days ago - Pushed at: 5 months ago - Stars: 373 - Forks: 55

Adhishtanaka/SysGen
SynGen is a tool that creates high-quality synthetic datasets using the Gemini API. It analyzes Markdown documents to generate realistic and diverse examples for machine learning, software testing, and data analysis.
Language: Python - Size: 13.7 KB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

0x5844/PhishNet
PhishNet is an experimental research project implementing Reinforced Self-Training (ReST) human-aligned with crafted instructions and fine-tuned models to craft a high-quality synthetic dataset of phishing emails.
Language: Python - Size: 310 KB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

nachoDRT/MERIT-Dataset
The MERIT Dataset is a fully synthetic, labeled dataset created for training and benchmarking LLMs on Visually Rich Document Understanding tasks. It is also designed to help detect biases and improve interpretability in LLMs, where we are actively working. This repository is actively maintained, and new features are continuously being added.
Language: Python - Size: 495 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

MartinKalema/Power-Distribution-Modelling
Power Distribution Modelling for cea and cel algorithms
Language: Jupyter Notebook - Size: 2.65 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

mehdi-aitaryane/make_regression
A Python application for generating and ploting synthetic regression datasets.
Language: Jupyter Notebook - Size: 830 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Metiu-Metiu/Neural-Texture-Sound-synthesis---data-sets
Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.
Size: 1.5 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

ElsevierSoftwareX/SOFTX-D-20-00055 Fork of agsoto/webgenerator
An open-source software for synthetic web-based user interface and content dataset generation. To cite this Original Software Publication: https://www.sciencedirect.com/science/article/pii/S2352711022000073
Size: 33.8 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0
