An open API service providing repository metadata for many open source software ecosystems.

Topic: "diffusion-models"

diff-usion/Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language: HTML - Size: 1.84 MB - Last synced at: 11 days ago - Pushed at: 9 months ago - Stars: 11,638 - Forks: 972

Tencent/HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Language: Python - Size: 71.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9,694 - Forks: 833

Tencent/Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Language: Python - Size: 79.2 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 8,423 - Forks: 706

openvinotoolkit/openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

Language: C++ - Size: 844 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8,132 - Forks: 2,573

FoundationVision/VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language: Jupyter Notebook - Size: 620 KB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 7,408 - Forks: 463

open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language: Jupyter Notebook - Size: 31.3 MB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 7,131 - Forks: 1,079

yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language: Python - Size: 131 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 5,620 - Forks: 529

Fanghua-Yu/SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Language: Python - Size: 10.2 MB - Last synced at: 13 days ago - Pushed at: 9 months ago - Stars: 4,985 - Forks: 424

showlab/Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

Size: 681 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4,299 - Forks: 253

bytedance/LatentSync

Taming Stable Diffusion for Lip Sync!

Language: Python - Size: 9.06 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 3,589 - Forks: 526

Lightricks/LTX-Video

Official repository for LTX-Video

Language: Python - Size: 126 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 3,339 - Forks: 293

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

Diffusion model papers, survey, and taxonomy

Size: 272 KB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 3,157 - Forks: 263

ali-vilab/VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language: Python - Size: 62.5 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 3,099 - Forks: 275

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Size: 197 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 3,024 - Forks: 512

TingsongYu/PyTorch-Tutorial-2nd

《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。

Language: Jupyter Notebook - Size: 84.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2,957 - Forks: 324

deepseek-ai/DreamCraft3D

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Language: Python - Size: 66.3 MB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 2,901 - Forks: 346

jy0205/Pyramid-Flow

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Language: Python - Size: 929 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2,612 - Forks: 258

Tencent/MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Language: Python - Size: 122 MB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 2,308 - Forks: 200

bghira/SimpleTuner

A general fine-tuning kit geared toward diffusion models.

Language: Python - Size: 9.85 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2,226 - Forks: 213

Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language: Python - Size: 58 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 2,180 - Forks: 91

andreas128/RePaint

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022

Language: Python - Size: 91.8 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 2,071 - Forks: 168

ChenHsing/Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

Size: 162 MB - Last synced at: 5 days ago - Pushed at: 23 days ago - Stars: 2,067 - Forks: 106

open-mmlab/mmgeneration

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

Language: Python - Size: 26.6 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 1,970 - Forks: 232

adobe-research/custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Language: Python - Size: 60.5 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 1,930 - Forks: 141

yang-song/score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Language: Jupyter Notebook - Size: 3.93 MB - Last synced at: 8 days ago - Pushed at: 9 months ago - Stars: 1,891 - Forks: 327

SUDO-AI-3D/zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Language: Python - Size: 2.38 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 1,876 - Forks: 126

siliconflow/onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

Language: Jupyter Notebook - Size: 114 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 1,863 - Forks: 125

junshutang/Make-It-3D

[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

Language: Python - Size: 178 MB - Last synced at: 15 days ago - Pushed at: 10 months ago - Stars: 1,841 - Forks: 128

amirhossein-kz/Awesome-Diffusion-Models-in-Medical-Imaging

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

Size: 628 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 1,763 - Forks: 156

eloialonso/diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Language: Python - Size: 46.9 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 1,739 - Forks: 123

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language: Python - Size: 5.35 MB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 1,708 - Forks: 75

wangkai930418/awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

Size: 617 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 1,664 - Forks: 74

LuChengTHU/dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

Language: Python - Size: 63.5 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 1,655 - Forks: 125

hymie122/RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

Size: 6.49 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 1,601 - Forks: 110

guochengqian/Magic123

[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Language: Jupyter Notebook - Size: 113 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 1,574 - Forks: 96

TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Language: Python - Size: 37 MB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 1,548 - Forks: 127

MaximeVandegar/Papers-in-100-Lines-of-Code

Implementation of papers in 100 lines of code.

Language: Python - Size: 20.5 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 1,479 - Forks: 156

yang-song/score_sde

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Language: Jupyter Notebook - Size: 4.35 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 1,476 - Forks: 206

mit-han-lab/nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Language: Cuda - Size: 85.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,425 - Forks: 84

menyifang/MIMO

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

Size: 13.5 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1,412 - Forks: 57

THUDM/ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Language: Python - Size: 4.18 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 1,369 - Forks: 71

ThuCCSLab/Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

Size: 2.3 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,362 - Forks: 87

bloc97/CrossAttentionControl

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

Language: Jupyter Notebook - Size: 62.3 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 1,331 - Forks: 88

PKU-YuanGroup/MagicTime

[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Language: Python - Size: 1.08 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1,311 - Forks: 125

Zheng-Chong/CatVTON

[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).

Language: Python - Size: 16.3 MB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 1,264 - Forks: 154

wyhuai/DDNM

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

Language: Python - Size: 14 MB - Last synced at: 15 days ago - Pushed at: 12 months ago - Stars: 1,246 - Forks: 92

davidADSP/Generative_Deep_Learning_2nd_Edition

The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.

Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: 15 days ago - Pushed at: 11 months ago - Stars: 1,245 - Forks: 485

gcorso/DiffDock

Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

Language: Python - Size: 211 MB - Last synced at: 10 days ago - Pushed at: 8 months ago - Stars: 1,219 - Forks: 296

Yujun-Shi/DragDiffusion

[CVPR2024, Highlight] Official code for DragDiffusion

Language: Python - Size: 25.3 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 1,207 - Forks: 91

Tencent/HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Language: Python - Size: 146 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1,206 - Forks: 95

muzishen/IMAGDressing

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.

Language: Python - Size: 46.4 MB - Last synced at: 26 days ago - Pushed at: about 1 month ago - Stars: 1,201 - Forks: 104

Fantasy-Studio/Paint-by-Example

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Language: Python - Size: 18 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 1,170 - Forks: 99

declare-lab/tango

A family of diffusion models for text-to-audio generation.

Language: Python - Size: 19.5 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 1,086 - Forks: 88

opendilab/awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

Size: 634 KB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 1,084 - Forks: 59

a-r-r-o-w/finetrainers

Memory-optimized training library for diffusion models

Language: Python - Size: 54.1 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,060 - Forks: 116

OpenBMB/VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Language: Python - Size: 7.38 MB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 1,056 - Forks: 93

lukasHoel/text2room

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

Language: Python - Size: 8.85 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 1,051 - Forks: 73

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

Language: Python - Size: 4.45 MB - Last synced at: 2 days ago - Pushed at: 6 days ago - Stars: 1,027 - Forks: 82

PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models

A collection of resources on controllable generation with text-to-image diffusion models.

Size: 3.04 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 1,027 - Forks: 27

omerbt/MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Language: Jupyter Notebook - Size: 6.93 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 1,027 - Forks: 59

wladradchenko/wunjo.wladradchenko.ru

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.

Language: Python - Size: 263 MB - Last synced at: 5 days ago - Pushed at: 22 days ago - Stars: 1,006 - Forks: 102

liuyuan-pal/SyncDreamer

[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

Language: Python - Size: 81.6 MB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 958 - Forks: 43

open-mmlab/PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Language: Python - Size: 79 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 956 - Forks: 75

showlab/MotionDirector

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Language: Python - Size: 177 MB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 955 - Forks: 55

3DTopia/3DTopia-XL

[CVPR 2025] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

Language: Python - Size: 72.3 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 949 - Forks: 35

cure-lab/MagicDrive

[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”

Language: Python - Size: 23.5 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 937 - Forks: 48

phizaz/diffae

Official implementation of Diffusion Autoencoders

Language: Jupyter Notebook - Size: 10.9 MB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 910 - Forks: 144

radames/Real-Time-Latent-Consistency-Model

App showcasing multiple real-time diffusion models pipelines with Diffusers

Language: Python - Size: 396 KB - Last synced at: 4 days ago - Pushed at: 29 days ago - Stars: 888 - Forks: 104

NVlabs/ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language: Python - Size: 16.4 MB - Last synced at: 26 days ago - Pushed at: 10 months ago - Stars: 888 - Forks: 49

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Language: Python - Size: 102 MB - Last synced at: 11 days ago - Pushed at: 10 months ago - Stars: 883 - Forks: 43

UCSC-VLAA/story-adapter

A Training-free Iterative Framework for Long Story Visualization

Language: Python - Size: 280 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 877 - Forks: 123

eric-ai-lab/MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Language: Python - Size: 61.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 854 - Forks: 52

ali-vilab/videocomposer

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

Language: Python - Size: 33.3 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 843 - Forks: 77

hkproj/pytorch-stable-diffusion

Stable Diffusion implemented from scratch in PyTorch

Language: Jupyter Notebook - Size: 1.08 MB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 830 - Forks: 171

shivammehta25/Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language: Jupyter Notebook - Size: 57.6 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 819 - Forks: 104

showlab/Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Language: Python - Size: 162 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 804 - Forks: 36

mbreuss/diffusion-literature-for-robotics

Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.

Size: 220 KB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 803 - Forks: 41

songweige/rich-text-to-image

Rich-Text-to-Image Generation

Language: Python - Size: 41.4 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 786 - Forks: 67

lixinustc/Awesome-diffusion-model-for-image-processing

one summary of diffusion-based image processing, including restoration, enhancement, coding, quality assessment

Size: 1.16 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 758 - Forks: 57

hustvl/GaussianDreamer

[CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models

Language: Python - Size: 74.7 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 753 - Forks: 39

Boese0601/MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Language: Python - Size: 131 MB - Last synced at: 26 days ago - Pushed at: 10 months ago - Stars: 746 - Forks: 64

lzzcd001/MeshDiffusion

Official implementation of "MeshDiffusion: Score-based Generative 3D Mesh Modeling" (ICLR 2023 Spotlight)

Language: Python - Size: 17.9 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 746 - Forks: 40

dome272/Paella

Official Implementation of Paella https://arxiv.org/abs/2211.07292v2

Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 744 - Forks: 54

MyNiuuu/MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Language: Python - Size: 117 MB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 732 - Forks: 46

vicgalle/stable-diffusion-aesthetic-gradients

Personalization for Stable Diffusion via Aesthetic Gradients 🎨

Language: Jupyter Notebook - Size: 92.5 MB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 730 - Forks: 62

yuval-alaluf/Attend-and-Excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

Language: Jupyter Notebook - Size: 103 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 729 - Forks: 62

ayaanzhaque/instruct-nerf2nerf

Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions (ICCV 2023)

Language: Python - Size: 3.89 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 719 - Forks: 58

ali-vilab/TeaCache

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Language: Python - Size: 22.5 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 686 - Forks: 26

OpenTexture/Paint3D

[CVPR 2024] Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models, a no lighting baked texture generative model

Language: Python - Size: 38.3 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 685 - Forks: 32

ExponentialML/Text-To-Video-Finetuning 📦

Finetune ModelScope's Text To Video model using Diffusers 🧨

Language: Python - Size: 1.82 MB - Last synced at: about 6 hours ago - Pushed at: over 1 year ago - Stars: 685 - Forks: 109

zubair-irshad/Awesome-Robotics-3D

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

Size: 730 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 680 - Forks: 35

mit-han-lab/distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language: Python - Size: 2.42 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 675 - Forks: 32

PKU-YuanGroup/ConsisID

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Language: Python - Size: 13.3 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 667 - Forks: 33

Project-MONAI/GenerativeModels 📦

MONAI Generative Models makes it easy to train, evaluate, and deploy generative models and related applications

Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 653 - Forks: 90

Text-to-Audio/Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Language: Python - Size: 961 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 646 - Forks: 87

teticio/audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Language: Jupyter Notebook - Size: 35.8 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 646 - Forks: 66

thu-ml/CRM

[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.

Language: Python - Size: 4.38 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 641 - Forks: 54

ChenWu98/cycle-diffusion

[ICCV 2023] A latent space for stochastic diffusion models

Language: Python - Size: 52.5 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 617 - Forks: 36

mikonvergence/DiffusionFastForward

DiffusionFastForward: a free course and experimental framework for diffusion-based generative models

Language: Jupyter Notebook - Size: 4.09 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 601 - Forks: 54

omriav/blended-latent-diffusion

Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]

Language: Jupyter Notebook - Size: 9.84 MB - Last synced at: 26 days ago - Pushed at: 11 months ago - Stars: 594 - Forks: 37

Related Topics
deep-learning 234 pytorch 233 stable-diffusion 207 generative-ai 173 diffusion 151 image-generation 147 generative-model 142 machine-learning 134 computer-vision 124 text-to-image 98 generative-models 73 ddpm 72 video-generation 62 python 61 aigc 54 artificial-intelligence 52 ai 50 image-editing 46 diffusers 44 diffusion-model 34 text-to-video 34 large-language-models 33 generative-adversarial-network 32 transformers 32 gan 30 3d-generation 29 super-resolution 28 text-to-image-generation 27 latent-diffusion 27 transformer 25 3d 24 vae 24 llm 24 huggingface 23 generative-art 23 score-based-generative-modeling 23 flow-matching 23 score-based-generative-models 22 text-to-3d 22 jax 21 pytorch-lightning 21 robotics 21 video-editing 21 pytorch-implementation 20 inpainting 20 reinforcement-learning 20 image-synthesis 20 deep-neural-networks 19 image-processing 19 neural-networks 19 ddim 18 controlnet 18 awesome 17 latent-diffusion-models 17 medical-imaging 17 eccv2024 17 style-transfer 17 graph-neural-networks 17 controllable-generation 16 score-matching 16 segmentation 16 unet 16 consistency-models 16 3d-reconstruction 15 conditional-generation 15 denoising-diffusion 14 fine-tuning 14 image-to-3d 14 inverse-problems 14 torch 14 multimodal 14 sdxl 14 stochastic-differential-equations 14 diffusion-transformer 14 text2image 14 audio-generation 14 tts 13 genai 13 synthetic-data 13 anomaly-detection 13 cvpr2024 13 nlp 13 variational-autoencoder 13 text-to-video-generation 13 mri 12 text-generation 12 text-to-speech 12 llms 12 cvpr2025 12 awesome-list 11 mnist 11 tensorflow 11 clip 11 image-to-image-translation 11 image-restoration 11 natural-language-processing 11 cvpr 11 imitation-learning 11 video 11 speech-synthesis 10