GitHub topics: diffusion-transformer

Repositories

Pengchengpcx/FTEdit

[CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing

Language: Python - Size: 14.6 KB - Last synced at: about 2 hours ago - Pushed at: about 3 hours ago - Stars: 14 - Forks: 0

ivcylc/OpenMusic

OpenMusic: SOTA Text-to-music (TTM) Generation

Language: Python - Size: 2.06 MB - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 575 - Forks: 60

VachanVY/diffusion-transformer

Pytorch and JAX Implementation of Scalable Diffusion Models with Transformers | Diffusion Transformers in Pytorch and JAX

Language: Python - Size: 56.5 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7 - Forks: 0

ByteDance-Seed/SeedVR

Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)

Language: Python - Size: 2.43 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 164 - Forks: 9

IceClear/SeedVR2

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

Size: 1.74 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 251 - Forks: 13

keshik6/grafting

Exploring Diffusion Transformer Designs via Grafting

Language: Jupyter Notebook - Size: 2.78 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 18 - Forks: 1

ModelTC/HarmoniCa

[ICML 2025] This is the official PyTorch implementation of "HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration".

Language: Python - Size: 7.92 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 4 - Forks: 0

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!

Language: Python - Size: 29 MB - Last synced at: 14 days ago - Pushed at: about 1 month ago - Stars: 1,686 - Forks: 97

thu-ml/RIFLEx

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)

Language: Python - Size: 139 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 675 - Forks: 66

Tencent-Hunyuan/HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Language: Python - Size: 71.2 MB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 10,212 - Forks: 908

yangluo7/CAME

The official implementation of "CAME: Confidence-guided Adaptive Memory Optimization"

Language: Python - Size: 996 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 91 - Forks: 8

lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

Language: Python - Size: 706 KB - Last synced at: 26 days ago - Pushed at: 3 months ago - Stars: 542 - Forks: 57

ankitdhall/torchsmith

Torchsmith is a minimalist library that focuses on understanding generative AI by building it using primitive PyTorch operations

Language: Python - Size: 12.6 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

bytedance/ComfyUI_InfiniteYou

🔥 Official ComfyUI native node for InfiniteYou with FLUX

Language: Python - Size: 1.57 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 138 - Forks: 26

ML-GSAI/Scaling-Diffusion-Transformers-muP

Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".

Language: Python - Size: 5.59 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 0

Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language: Python - Size: 58 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 2,187 - Forks: 91

Fantasy-AMAP/fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Language: Python - Size: 126 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,124 - Forks: 80

desaixie/pa_vdm

CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151

Language: Python - Size: 1.95 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 64 - Forks: 0

milmor/diffusion-transformer

Implementation of Diffusion Transformer Model in Pytorch

Language: Python - Size: 60.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 60 - Forks: 9

Can5558/ICEdit

Repository for paper "In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer"

Language: Python - Size: 8.09 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

wangjiangshan0725/RF-Solver-Edit

[ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!

Language: Python - Size: 93.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 480 - Forks: 12

AdaCache-DiT/AdaCache

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Language: Python - Size: 290 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 146 - Forks: 6

HumanAIGC/omnitalker

Project Page repo of OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

Language: JavaScript - Size: 492 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 189 - Forks: 12

bytedance/InfiniteYou

🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Language: Python - Size: 13.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1,877 - Forks: 133

lucasnewman/f5-tts-swift

Implementation of F5-TTS in Swift using MLX

Language: Swift - Size: 245 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 63 - Forks: 10

Pur1zumu/RIFT-SVC

Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.

Language: Python - Size: 21.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 39 - Forks: 8

K1nght/Unified-Unlearning-w-Remain-Geometry

[NeurIPS2024 (Spotlight)] "Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement" by Zhehao Huang, Xinwen Cheng, JingHao Zheng, Haoran Wang, Zhengbao He, Tao Li, Xiaolin Huang

Language: Python - Size: 8.5 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 0

shallowdream204/DreamClear

[NeurIPS 2024] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Language: Python - Size: 18.7 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 1,125 - Forks: 50

Qi-studio/DiT-VTON

DiT-VTON: Exploring Diffusion Transformer Framework for Multi-Category Virtual Try-On with Integrated Image Customization

Language: JavaScript - Size: 821 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 1

prathebaselva/FORA

FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.

Language: Python - Size: 16 MB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 38 - Forks: 2

sitamgithub-MSIT/sana-litserve

Leverage SANA's capabilities using LitServe.

Language: Python - Size: 294 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

yhwangs/TQ-DiT

TQ-DiT: Efficient Time-Aware Quantization for Diffusion Transformers

Language: Python - Size: 80.1 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

explainingai-code/VideoGeneration-PyTorch

This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference code on Moving mnist dataset and UCF101 dataset

Language: Python - Size: 43.9 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

explainingai-code/DiT-PyTorch

This repo implements Diffusion Transformers(DiT) in PyTorch and provides training and inference code on CelebHQ dataset

Language: Python - Size: 46.9 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 18 - Forks: 5

TiankaiHang/Min-SNR-Diffusion-Training

[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy

Language: Python - Size: 127 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 232 - Forks: 6

dirmeier/diffusion-transformer

A diffusion transformer implementation in Flax

Language: Python - Size: 95.7 KB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

u84819482/Nano-diffusion

Minimal DDPM/DiT-based generation of MNIST digits

Language: Jupyter Notebook - Size: 3.83 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

ArchiMickey/Just-a-DiT

A repo of a modified version of Diffusion Transformer

Language: Python - Size: 6.62 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

DiT-3D/DiT-3D

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Language: Python - Size: 373 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 158 - Forks: 9

gtebbutt/ridge

Language: Python - Size: 88.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

milmor/diffusion-transformer-keras

Implementation of Latent Diffusion Transformer Model in Tensorflow / Keras

Language: Python - Size: 67.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 0

DiffusionMamba/DiM

Official Codebase of "Scaling Diffusion Mamba for Efficient Image Generation"

Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Related Keywords

diffusion-transformer 42 diffusion-models 21 diffusion 7 image-generation 7 image-editing 6 dit 6 video-generation 5 pytorch 5 flow-matching 4 flux 3 text-to-image 3 transformer 3 transformers 3 python 3 deep-learning 3 rectified-flow 3 generative-model 2 diffusers 2 in-context 2 long-video-generation 2 editing-image 2 caching 2 mlx 2 text-to-speech 2 tts 2 talking-head 2 ai 2 video-restoration 2 one-step-diffusion-model 2 image-restoration 2 diffusion-model 2 mnist 2 face 2 identity-preserving 2 personalization 2 research 2 jax 2 state-space-models 1 audio-visual 1 efficiency 1 content-adaptive 1 hyperparameter-tuning 1 video-inversion 1 video-editing 1 mup 1 scaling 1 opensora 1 image-inversion 1 video-diffusion-model 1 autoregressive 1 aigc 1 generation-models 1 latte 1 latent-diffusion-models 1 post-training-quantization 1 litserve 1 lightning-ai 1 fastapi 1 artificial-intelligence 1 efficient-diffusion-training 1 faster-sampling 1 flax 1 denoising-diffusion-models 1 virtualtryon 1 3d-shape-generation 1 point-clouds 1 super-resolution 1 restoration 1 efficient-deep-learning 1 pixelart 1 steepest-descent 1 stable-diffusion 1 machine-unlearning 1 svc-model 1 svc 1 singing-voice-conversion 1 ai-voice-clone 1 mamba 1 swift 1 mlx-swift 1 real-time 1 comfyui-nodes 1 pixart 1 icml-2025 1 icml 1 feature-caching 1 acceleration 1 text-to-image-generation 1 model-grafting 1 grafting 1 architecture-research 1 pytorch-implementation 1 jax-implementation 1 vall-e 1 text-to-music-transformer 1 text-to-music 1 text-to-audio-ai 1 text-to-audio 1 music-generation 1 music-ai-architectures 1