GitHub topics: diffusion-transformer
Pengchengpcx/FTEdit
[CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Language: Python - Size: 14.6 KB - Last synced at: about 2 hours ago - Pushed at: about 3 hours ago - Stars: 14 - Forks: 0

ivcylc/OpenMusic
OpenMusic: SOTA Text-to-music (TTM) Generation
Language: Python - Size: 2.06 MB - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 575 - Forks: 60

VachanVY/diffusion-transformer
Pytorch and JAX Implementation of Scalable Diffusion Models with Transformers | Diffusion Transformers in Pytorch and JAX
Language: Python - Size: 56.5 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7 - Forks: 0

ByteDance-Seed/SeedVR
Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)
Language: Python - Size: 2.43 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 164 - Forks: 9

IceClear/SeedVR2
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
Size: 1.74 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 251 - Forks: 13

keshik6/grafting
Exploring Diffusion Transformer Designs via Grafting
Language: Jupyter Notebook - Size: 2.78 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 18 - Forks: 1

ModelTC/HarmoniCa
[ICML 2025] This is the official PyTorch implementation of "HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration".
Language: Python - Size: 7.92 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 4 - Forks: 0

River-Zhang/ICEdit
Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!
Language: Python - Size: 29 MB - Last synced at: 14 days ago - Pushed at: about 1 month ago - Stars: 1,686 - Forks: 97

thu-ml/RIFLEx
Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)
Language: Python - Size: 139 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 675 - Forks: 66

Tencent-Hunyuan/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Language: Python - Size: 71.2 MB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 10,212 - Forks: 908

yangluo7/CAME
The official implementation of "CAME: Confidence-guided Adaptive Memory Optimization"
Language: Python - Size: 996 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 91 - Forks: 8

lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
Language: Python - Size: 706 KB - Last synced at: 26 days ago - Pushed at: 3 months ago - Stars: 542 - Forks: 57

ankitdhall/torchsmith
Torchsmith is a minimalist library that focuses on understanding generative AI by building it using primitive PyTorch operations
Language: Python - Size: 12.6 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

bytedance/ComfyUI_InfiniteYou
🔥 Official ComfyUI native node for InfiniteYou with FLUX
Language: Python - Size: 1.57 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 138 - Forks: 26

ML-GSAI/Scaling-Diffusion-Transformers-muP
Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".
Language: Python - Size: 5.59 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 0

Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language: Python - Size: 58 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 2,187 - Forks: 91

Fantasy-AMAP/fantasy-talking
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Language: Python - Size: 126 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,124 - Forks: 80

desaixie/pa_vdm
CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151
Language: Python - Size: 1.95 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 64 - Forks: 0

milmor/diffusion-transformer
Implementation of Diffusion Transformer Model in Pytorch
Language: Python - Size: 60.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 60 - Forks: 9

Can5558/ICEdit
Repository for paper "In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer"
Language: Python - Size: 8.09 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

wangjiangshan0725/RF-Solver-Edit
[ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!
Language: Python - Size: 93.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 480 - Forks: 12

AdaCache-DiT/AdaCache
Adaptive Caching for Faster Video Generation with Diffusion Transformers
Language: Python - Size: 290 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 146 - Forks: 6

HumanAIGC/omnitalker
Project Page repo of OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication
Language: JavaScript - Size: 492 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 189 - Forks: 12

bytedance/InfiniteYou
🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Language: Python - Size: 13.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1,877 - Forks: 133

lucasnewman/f5-tts-swift
Implementation of F5-TTS in Swift using MLX
Language: Swift - Size: 245 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 63 - Forks: 10

Pur1zumu/RIFT-SVC
Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.
Language: Python - Size: 21.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 39 - Forks: 8

K1nght/Unified-Unlearning-w-Remain-Geometry
[NeurIPS2024 (Spotlight)] "Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement" by Zhehao Huang, Xinwen Cheng, JingHao Zheng, Haoran Wang, Zhengbao He, Tao Li, Xiaolin Huang
Language: Python - Size: 8.5 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 0

shallowdream204/DreamClear
[NeurIPS 2024] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Language: Python - Size: 18.7 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 1,125 - Forks: 50

Qi-studio/DiT-VTON
DiT-VTON: Exploring Diffusion Transformer Framework for Multi-Category Virtual Try-On with Integrated Image Customization
Language: JavaScript - Size: 821 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 1

prathebaselva/FORA
FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.
Language: Python - Size: 16 MB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 38 - Forks: 2

sitamgithub-MSIT/sana-litserve
Leverage SANA's capabilities using LitServe.
Language: Python - Size: 294 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

yhwangs/TQ-DiT
TQ-DiT: Efficient Time-Aware Quantization for Diffusion Transformers
Language: Python - Size: 80.1 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

explainingai-code/VideoGeneration-PyTorch
This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference code on Moving mnist dataset and UCF101 dataset
Language: Python - Size: 43.9 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

explainingai-code/DiT-PyTorch
This repo implements Diffusion Transformers(DiT) in PyTorch and provides training and inference code on CelebHQ dataset
Language: Python - Size: 46.9 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 18 - Forks: 5

TiankaiHang/Min-SNR-Diffusion-Training
[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy
Language: Python - Size: 127 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 232 - Forks: 6

dirmeier/diffusion-transformer
A diffusion transformer implementation in Flax
Language: Python - Size: 95.7 KB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

u84819482/Nano-diffusion
Minimal DDPM/DiT-based generation of MNIST digits
Language: Jupyter Notebook - Size: 3.83 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

ArchiMickey/Just-a-DiT
A repo of a modified version of Diffusion Transformer
Language: Python - Size: 6.62 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

DiT-3D/DiT-3D
🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"
Language: Python - Size: 373 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 158 - Forks: 9

gtebbutt/ridge
Language: Python - Size: 88.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

milmor/diffusion-transformer-keras
Implementation of Latent Diffusion Transformer Model in Tensorflow / Keras
Language: Python - Size: 67.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 0

DiffusionMamba/DiM
Official Codebase of "Scaling Diffusion Mamba for Efficient Image Generation"
Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
