An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-augmentation

tinh2044/YOLO12-UnderWater

YOLOv12 Underwater Object Detection is an open-source suite for underwater object detection, built on YOLOv12. It offers an end-to-end pipeline with GPU-accelerated training, customizable data augmentations, real-time inference via Gradio, and support for model export (ONNX & PyTorch).

Language: Python - Size: 52.1 MB - Last synced at: about 12 hours ago - Pushed at: about 13 hours ago - Stars: 1 - Forks: 0

AprilArn/yolov8-indonesian-traffic-sign-detection

Indonesian Traffic Sign Detection with Steering Advice System

Language: Jupyter Notebook - Size: 206 MB - Last synced at: about 17 hours ago - Pushed at: about 17 hours ago - Stars: 0 - Forks: 0

aloth/RogueGPT

RogueGPT - (Fake) News Generator, a research project

Language: Python - Size: 50.8 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 3 - Forks: 2

albumentations-team/albucore

A high-performance image processing library designed to optimize and extend the Albumentations library with specialized functions for advanced image transformations. Perfect for developers working in computer vision who require efficient and scalable image augmentation.

Language: Python - Size: 216 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 18 - Forks: 6

NVIDIA/DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Language: C++ - Size: 395 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 5,419 - Forks: 638

DeepTrackAI/DeepTrack2

DeepTrack2 is a modular Python library for generating, manipulating, and analyzing image data pipelines for machine learning and experimental imaging.

Language: Jupyter Notebook - Size: 611 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 201 - Forks: 57

mlcommons/GaNDLF

A generalizable application framework for segmentation, regression, and classification using PyTorch

Language: Python - Size: 69.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 177 - Forks: 86

janaghoniem/Fruits-Recognition-Using-Deep-Learning-with-Data-Augmentation

A deep learning project for classifying 130+ fruits using EfficientNet, ResNet, and MobileNet with custom augmentations and SE blocks. Built on the Fruits-360 dataset.

Language: Jupyter Notebook - Size: 31.7 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

iver56/audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Language: Python - Size: 10.6 MB - Last synced at: 3 days ago - Pushed at: 12 days ago - Stars: 2,056 - Forks: 203

onuralpszr/kopikatAPI

KopikatAPI is Python library for interacting with the Kopikat API.

Language: Python - Size: 587 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 17 - Forks: 0

HMUNACHI/halo

A Library That Uses Quantized Diffusion Model With Clustered Weights For Efficiently Generating More Image Datasets On-Device.

Language: Python - Size: 1.47 MB - Last synced at: about 7 hours ago - Pushed at: almost 2 years ago - Stars: 11 - Forks: 1

bethgelab/imagecorruptions

Python package to corrupt arbitrary images.

Language: Python - Size: 4.95 MB - Last synced at: about 16 hours ago - Pushed at: about 1 month ago - Stars: 436 - Forks: 71

MSD-IRIMAS/Augmenting-TSC-Elastic-Averaging

Augmenting Time Series Datasets with Weighted Elastic Barycenter Averaging

Language: Python - Size: 646 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 10 - Forks: 3

IDanK0/Deepseek-Dataset-Generator

Deepseek-Dataset-Generator crea dataset conversazionali per il fine-tuning di LLM tramite API DeepSeek. Supporta vari formati (ChatML, ShareGPT, Alpaca, JSON, CSV), configurazione semplice via YAML e log dettagliati. Ideale per generare dati realistici e personalizzati in modo rapido.

Language: Python - Size: 165 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

S76AliKo/ttsgan-shm-augmentation

Code and dataset for structural dynamic response synthesis using Transformer-based GAN (TTS-GAN)

Language: Python - Size: 21.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

imedslab/solt

Streaming over lightweight data transformations

Language: Jupyter Notebook - Size: 34.4 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 265 - Forks: 19

yagizefekose6/FloodDetectionNet

Flood Detection using U-Net with Attention Mechanism

Language: Python - Size: 7.81 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

SqueezeAILab/LLM2LLM

[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Language: Python - Size: 209 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 183 - Forks: 13

PrasannaPulakurthi/papers

Size: 14.2 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

webdataset/webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language: Python - Size: 51.6 MB - Last synced at: 6 days ago - Pushed at: 8 days ago - Stars: 2,617 - Forks: 207

dna-witch/intel-image-transfer-learning

Language: Jupyter Notebook - Size: 733 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

PrasannaPulakurthi/Foreground-Background-Augmentation

Effective Dual-Region Augmentation for Reduced Reliance on Large Amounts of Labeled Data.

Language: Python - Size: 3.94 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 3 - Forks: 0

tuanio/noisy-student-training-asr

Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

Language: Python - Size: 3.08 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 93 - Forks: 15

Muawiya-contact/traffic-sign-vision

Traffic sign recognition AI-Model.

Language: Jupyter Notebook - Size: 391 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

TorchIO-project/torchio

Medical imaging processing for AI applications.

Language: Python - Size: 44.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2,216 - Forks: 247

yusufmo1/kew-mnist-synthetic

Improving botanical image classification by 7% using AI-generated synthetic data augmentation on Kew-MNIST dataset

Language: Jupyter Notebook - Size: 6.22 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

sparkfish/augraphy

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

Language: Python - Size: 245 MB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 417 - Forks: 50

vkit-x/vkit

Boosting Document Intelligence

Language: Python - Size: 780 KB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 23 - Forks: 1

maslychm/gesture_augmentation

Data augmentation for stroke gestures

Language: Jupyter Notebook - Size: 1.87 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 2

goru001/inltk

Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need

Language: Python - Size: 812 KB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 830 - Forks: 160

RamamAgarwal/Project_Wildlife_Classification

This project deals with Wildlife Classification using Computer Vision and Image Processing

Language: Jupyter Notebook - Size: 28.3 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 2

SuryaVamsi-P/Diabetic-Retinopathy-Detection-with-ResNet50

Built an end-to-end deep learning pipeline using ResNet-50 to classify retinal images into five stages of Diabetic Retinopathy. Applied transfer learning, image preprocessing, and AUC-based evaluation on the APTOS 2019 Kaggle dataset, achieving a 94% validation AUC—offering real-world potential in clinical diagnosis automation.

Language: Python - Size: 2.15 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

vanderschaarlab/synthcity

A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.

Language: Python - Size: 6.76 MB - Last synced at: 15 days ago - Pushed at: 30 days ago - Stars: 552 - Forks: 75

Westlake-AI/openmixup

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

Language: Python - Size: 3.68 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 650 - Forks: 59

AlexanderVNikitin/tsgm

Generation and evaluation of synthetic time series datasets (also, augmentations, visualizations, a collection of popular datasets) NeurIPS'24

Language: Python - Size: 9.81 MB - Last synced at: 12 days ago - Pushed at: 10 months ago - Stars: 168 - Forks: 18

Zehong-Wang/Awesome-Foundation-Models-on-Graphs

A collection of graph foundation models including papers, codes, and datasets.

Size: 3.61 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 17 - Forks: 4

dmey/synthia

📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python

Language: Python - Size: 19.7 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 9

Jorgen98/LineShaper

Interactive tool for generating geographically accurate definitions of public transport routes.

Language: TypeScript - Size: 6.39 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

ipleiria-ciic/data-augmentation-iiot

Advanced technologies and software for mineral resources.

Language: Jupyter Notebook - Size: 44.4 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

DemisEom/SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Language: Python - Size: 428 KB - Last synced at: about 13 hours ago - Pushed at: about 3 years ago - Stars: 649 - Forks: 135

Paperspace/DataAugmentationForObjectDetection

Data Augmentation For Object Detection

Language: Jupyter Notebook - Size: 9.93 MB - Last synced at: 14 days ago - Pushed at: about 5 years ago - Stars: 1,150 - Forks: 320

levisstrauss/Botanical-Recognition-EfficientNet-Classification

Advanced deep learning solution for flower classification using transfer learning with EfficientNet-B0. Achieves 90.23% accuracy on 102 flower species from Oxford Dataset. Lightweight (17.9MB) model with fine-tuning and data augmentation for optimal performance.

Language: HTML - Size: 340 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0

MohamedAliHabib/Brain-Tumor-Detection

Brain Tumor Detection Using Convolutional Neural Networks.

Language: Jupyter Notebook - Size: 43.6 MB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 291 - Forks: 187

vinthony/ghost-free-shadow-removal

[AAAI 2020] Towards Ghost-free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN

Language: Jupyter Notebook - Size: 3.8 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 314 - Forks: 60

sankoktas/bhi360-fall-detection

Fall detection system using Bosch BHI360 sensor data with time-series labeling, feature extraction, and machine learning (LOSO CV + Gradient Boosting).

Language: Jupyter Notebook - Size: 49.5 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

Petros626/OpenPCDet-data-augmentor

Modified version of 'OpenPCDet Toolbox for LiDAR-based 3D Object Detection' to to serve as data augmentor.

Language: Python - Size: 27.7 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

SeanLee97/llano

Let ChatGPT (Large Language Models) Serve As Data Annotator and Zero-shot/few-shot Information Extractor.

Language: Python - Size: 140 KB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 32 - Forks: 3

QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

Language: Python - Size: 25.3 MB - Last synced at: 17 days ago - Pushed at: 11 months ago - Stars: 3,166 - Forks: 415

justinsalamon/scaper

A library for soundscape synthesis and augmentation

Language: Python - Size: 65.6 MB - Last synced at: 14 days ago - Pushed at: about 3 years ago - Stars: 399 - Forks: 61

IAAR-Shanghai/ICSFSurvey

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

Language: Jupyter Notebook - Size: 5.02 MB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 168 - Forks: 5

asteroid-team/torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language: Python - Size: 2.28 MB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 1,042 - Forks: 91

425776024/nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Language: Python - Size: 1.05 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 1,833 - Forks: 169

surabhiwaingankar/NeutralABSA

Enhancing Neutral Sentiment Classification in Aspect-Based Sentiment Analysis (ABSA)

Language: Jupyter Notebook - Size: 1.55 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

monetjoe/EMelodyGen

EMelodyGen creates emotional ABC melodies via templates. Rough4Q dataset is auto-labeled by small emotional datasets and music psychology to fine-tune backbone after augmentation with 99% correct rate, 91% emotional alignment and effective feature controls.|EMelodyGen通过模板控制ABC生成旋律情感。利用小型情感数据集和音乐心理学为Rough4Q数据集生成标签,经转换和增强后微调骨干网络。生成谱正确率99%,情感准确率为 91%。

Language: Python - Size: 1.71 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 10 - Forks: 0

arundo/tsaug

A Python package for time series augmentation

Language: Python - Size: 21.4 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 354 - Forks: 37

fmenat/DSensDp

Public repository of our research work at IEEE Access

Language: Python - Size: 463 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 3 - Forks: 0

visual-layer/fastdup

fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.

Language: Python - Size: 1.73 GB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 1,681 - Forks: 82

jim-schwoebel/allie

🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.

Language: Python - Size: 275 MB - Last synced at: 13 days ago - Pushed at: 2 months ago - Stars: 141 - Forks: 35

akiomik/pilgram

A python library for instagram filters

Language: Jupyter Notebook - Size: 3.59 MB - Last synced at: 15 days ago - Pushed at: 3 months ago - Stars: 118 - Forks: 17

ZhaoJ9014/face.evoLVe

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Language: Python - Size: 8.11 MB - Last synced at: 17 days ago - Pushed at: 3 months ago - Stars: 3,514 - Forks: 759

chetan0220/Predictive-Maintenance-and-Diagnostic-Report-Generation

This project detects failure of machine. It also detects the type of failure and gives instructions to machine operator in simple language using report.

Language: Jupyter Notebook - Size: 58.2 MB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 1

xlite-dev/torchlm

💎A high level pipeline for face landmarks detection: train, eval, inference (Python/C++) and 100+ data augmentations.

Language: Python - Size: 154 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 258 - Forks: 24

henryhcooperr/FaceRecognition-MultiArchitecture-Pipeline

A comprehensive face recognition system featuring multiple deep learning architectures (CNN, Siamese, ArcFace, Transformer-hybrid), interactive interface, advanced preprocessing pipeline, hyperparameter optimization, and real-time demo capabilities. Includes visualization tools, cross-validation, and support for multiple datasets.

Language: Python - Size: 651 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

SigVarGen/SigVarGen

SigVarGen is a Python framework for time-series signal generation, data augmentation, and anomaly simulation. It creates diverse 1D signal variants under controlled conditions, including idle-state, perturbed, and noisy signals.

Language: Jupyter Notebook - Size: 84 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 3 - Forks: 0

AlecioP/forms-classifier

Classification of documents from an image

Language: Python - Size: 21.5 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

kartikey2807/Bike-Classification-1RT700

Plotting trends, correlations and outliers in the feature space. Classifying 'bike demand' based on weather patterns, using regression.

Language: Python - Size: 12.9 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

mratsim/Amazon-Forest-Computer-Vision

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

Language: Jupyter Notebook - Size: 3.13 MB - Last synced at: 11 days ago - Pushed at: over 5 years ago - Stars: 370 - Forks: 73

Westlake-AI/Awesome-Mixup

[Survey] Awesome List of Mixup Augmentation and Beyond (https://arxiv.org/abs/2409.05202)

Size: 848 KB - Last synced at: 23 days ago - Pushed at: 8 months ago - Stars: 149 - Forks: 11

firmai/deltapy

DeltaPy - Tabular Data Augmentation (by @firmai)

Language: Jupyter Notebook - Size: 1.47 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 546 - Forks: 56

yongzhuo/nlp_xiaojiang

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用

Language: Python - Size: 23.4 MB - Last synced at: 15 days ago - Pushed at: over 3 years ago - Stars: 1,534 - Forks: 392

zhanlaoban/EDA_NLP_for_Chinese

An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。

Language: Python - Size: 22.5 KB - Last synced at: 22 days ago - Pushed at: about 3 years ago - Stars: 1,371 - Forks: 240

Vanhoai/flowers-detection

🌸 Deep Learning project for flower classification and object detection using PyTorch and Keras

Language: Jupyter Notebook - Size: 6.98 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

BeJian/Res-ACWGAN-GP

End-to-end GAN for AHU fault diagnosis with limited fault data.

Language: Python - Size: 313 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 3 - Forks: 0

xdMikayu/CSCI316-Mask-Detection-Transferlearning

Deep Learning-based Face Mask Detection using Transfer Learning with InceptionV3. Built as part of UOWD CSCI316 project. Includes model training, augmentation, and deployment pipeline.

Language: Python - Size: 0 Bytes - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

YuliangXiu/MobilePose

Light-weight Single Person Pose Estimator

Language: Jupyter Notebook - Size: 37 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 642 - Forks: 149

sim1-99/Causality-Medical-Image-Domain-Generalization Fork of cheng-01037/Causality-Medical-Image-Domain-Generalization

[IEEE-TMI'22] Causality-inspired Single-source Domain Generalization for Medical Image Segmentation (code&data-processing pipeline)

Language: Jupyter Notebook - Size: 428 KB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 0 - Forks: 0

Anzar18/FloodDetectionNet

Flood Detection using U-Net with Attention Mechanism

Language: Python - Size: 6.84 KB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 0 - Forks: 0

fmenat/CoM-views

Public repository of our work in all Combinations of Missing (CoM) views in multi-view learning models

Language: Python - Size: 1.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Tebmer/Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

Size: 18.6 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1,015 - Forks: 60

RuoyuChen10/CCL-FSOD

[TPAMI 2025] Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection

Language: Python - Size: 1.67 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 20 - Forks: 1

AvaAvarai/Java_Tabular_Vis_Toolkit

Cross-platform tool for Computational Interactive Visual Learning using lossless General Line Coordinate data visualizations and human-in-the-loop guided classification by eight classifier algorithms to find, test, and boost robust machine learning models with a goal of high case to parameter ratio.

Language: Java - Size: 241 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 1

Twilight-skylight/NeuraScan

An AI-powered platform for brain MRI analysis and tumor classification, providing instant insights and detailed medical reports.

Language: TypeScript - Size: 1.84 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ALebrun-108/BoxSERS

Python package that provides a full range of functionality to process and analyze vibrational spectra (Raman, SERS, FTIR, etc.).

Language: Jupyter Notebook - Size: 20 MB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 64 - Forks: 15

marian-nmt/sotastream

A library for data streaming and augmentation

Language: Python - Size: 540 KB - Last synced at: about 24 hours ago - Pushed at: about 1 month ago - Stars: 20 - Forks: 3

styfeng/DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

Size: 120 KB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 827 - Forks: 77

lielsheri/PatientSignal

Diagnosing Through the Noise: Understanding Patient Self‑Descriptions

Size: 11.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

remydecoupes/GeoNLPlify

:earth_africa: :book: A NLP library for data augmentation focusing on spatial information contained in text

Language: Python - Size: 149 KB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

iberganzo/ArchaeolDA

ArchaeolDA. Data Augmentation tool for Deep Learning algorithms

Language: Python - Size: 54.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

Arittra-Bag/NeuraScan

An AI-powered platform for brain MRI analysis and tumor classification, providing instant insights and detailed medical reports.

Language: TypeScript - Size: 1.84 MB - Last synced at: 22 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Westlake-AI/AutoMix

[ECCV 2022 Oral] AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

Language: Python - Size: 23.4 KB - Last synced at: 24 days ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 2

vishal220703/AutoVision

Deep learning project for autonomous vehicles combining semantic segmentation (U-Net variants) and object detection (YOLOv3/YOLOv8) using camera.

Language: Jupyter Notebook - Size: 6.54 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

DonorNearBy/YOLO12-UnderWater

YOLOv12 Underwater Object Detection is an open-source suite for underwater object detection, built on YOLOv12. It offers an end-to-end pipeline with GPU-accelerated training, customizable data augmentations, real-time inference via Gradio, and support for model export (ONNX & PyTorch).

Language: Python - Size: 52.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

sunaglmez/Data-Augmentation-for-Sperm-Morphology-Images

The sperm_data_augmentation and sperm_data_preprocessing tools enhance and prepare sperm images 🧬 by applying augmentation techniques 🔄 and preprocessing steps 🧑‍🔬 for model training 🤖, contributing to medical research and analysis in reproductive health 🩺.

Language: Python - Size: 30.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

GabrieleLozupone/LDAE

Official PyTorch implementation of "Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging". LDAE is a novel unsupervised framework for 3D medical imaging that combines a latent diffusion model with semantic controls.

Language: Jupyter Notebook - Size: 6.82 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 0

LirongWu/awesome-graph-self-supervised-learning

Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"

Size: 444 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 1,406 - Forks: 167

angelalim88/RockPaperScissor-Image-Prediction

This project builds a CNN using TensorFlow to classify images of rock, paper, and scissors gestures, achieving high accuracy through data augmentation and training on a dataset of 2,188 images.

Language: Jupyter Notebook - Size: 150 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

VyjayanthiPolapragada/Image_Classifier_CNN_Data_Augmentation

A deep learning project using Convolutional Neural Networks (CNNs) to classify CIFAR-10 images. The model leverages data augmentation, batch normalization, and ReLU activation to improve performance and generalization. Includes training and evaluation scripts for multi-class image classification.

Language: Jupyter Notebook - Size: 79.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

zhunzhong07/Random-Erasing

Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST

Language: Python - Size: 15.6 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 729 - Forks: 156

LybahNisar/HandWritten-Digit-Recognization-using-Deep-Learning

Handwritten digit recognition on the MNIST dataset using deep learning techniques, achieving up to 99% accuracy with Dense Networks, CNNs, Data Augmentation, Dropout, and Autoencoders.

Language: Jupyter Notebook - Size: 3.51 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mimihime0/CNN-Fashion-MNIST-Classifier

A convolutional neural network (CNN) for classifying the Fashion-MNIST dataset. Includes experiments with regularization techniques, data augmentation, and hyperparameter tuning to optimize model performance, achieving 89.76% test accuracy.

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Related Keywords
data-augmentation 1,057 deep-learning 356 machine-learning 210 python 164 pytorch 155 computer-vision 144 tensorflow 139 cnn 122 image-classification 116 keras 109 convolutional-neural-networks 101 transfer-learning 101 image-processing 67 classification 62 nlp 59 neural-networks 43 object-detection 42 data-science 38 opencv 37 deep-neural-networks 36 keras-tensorflow 36 gan 36 neural-network 34 natural-language-processing 33 artificial-intelligence 30 cnn-classification 26 synthetic-data 25 jupyter-notebook 25 python3 24 mixup 24 augmentation 23 generative-adversarial-network 23 numpy 21 cnn-keras 20 fine-tuning 20 resnet 19 data-preprocessing 19 regularization 19 dataset 18 ai 18 image-augmentation 18 bert 18 resnet-50 18 cifar10 17 self-supervised-learning 17 semantic-segmentation 16 vgg16 16 cnn-model 16 text-classification 16 time-series 15 graph-neural-networks 15 unsupervised-learning 14 transformers 14 tensorflow2 14 synthetic-dataset-generation 13 transformer 13 contrastive-learning 13 image-segmentation 13 albumentations 13 representation-learning 12 speech-recognition 12 yolo 12 matplotlib 12 deeplearning 12 generative-model 12 medical-imaging 12 data-generation 12 batch-normalization 12 feature-extraction 12 data-augmentation-strategies 12 robustness 12 language-model 11 semi-supervised-learning 11 audio 11 artificial-neural-networks 11 few-shot-learning 11 image-recognition 11 gans 11 data 11 convolutional-neural-network 11 segmentation 11 diffusion-models 11 data-visualization 11 mnist 11 lstm 10 audio-processing 10 u-net 10 imbalanced-data 9 sentiment-analysis 9 adversarial-attacks 9 llm 9 model-evaluation 9 logistic-regression 9 generative-ai 9 question-answering 9 ensemble-learning 9 dropout 9 adam-optimizer 9 image-generation 8 self-driving-car 8