GitHub topics: video-diffusion-model
OutofAi/GEN3C-MODAL
GRADIO GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
Language: Python - Size: 43.9 KB - Last synced at: about 22 hours ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

nv-tlabs/GEN3C
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
Language: Jupyter Notebook - Size: 108 MB - Last synced at: 5 days ago - Pushed at: 13 days ago - Stars: 1,017 - Forks: 54

ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
Size: 162 MB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 2,188 - Forks: 112

km1994/AwesomeMultiModel
【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语音合成(TTS),人像分割(SA),多模态(VLM),Ai 换脸(Face Swapping), 文生视频(VD),图生视频(SVD),Ai 动作迁移,Ai 虚拟试衣,数字人,全模态理解(Omni),Ai音乐生成 干货学习 等 实战与经验。
Size: 39.1 KB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 25 - Forks: 3

AILab-CVC/FreeNoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
Language: Python - Size: 81.7 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 416 - Forks: 25

FareedKhan-dev/text2video-from-scratch
A Straightforward, Step-by-Step Implementation of a Video Diffusion Model
Language: Python - Size: 4.48 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 54 - Forks: 10

oooolga/Ctrl-V
👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"
Language: Python - Size: 20.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 27 - Forks: 2

MyNiuuu/AniCrafter
AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models
Language: Python - Size: 89.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 80 - Forks: 3

westlake-repl/LeanVAE
[ICCV2025]LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
Language: Python - Size: 44.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 30 - Forks: 2

chikap421/catlvdm
This repository accompanies the paper "Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation"
Language: Python - Size: 48 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

MingtaoGuo/Relightable-Portrait-Animation
[CVPR 2025] High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model
Language: Python - Size: 10.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 43 - Forks: 3

SamurAIGPT/AI-Faceless-Video-Generator
Generate a video script, voice and a talking face completely with AI
Language: Jupyter Notebook - Size: 16.6 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 305 - Forks: 49

donydchen/mvsplat360
🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
Language: Python - Size: 1.15 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 257 - Forks: 10

SamurAIGPT/Text-To-Video-API
Text to Video API generation documentation
Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 19 - Forks: 4

sczhou/Upscale-A-Video
[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
Language: Python - Size: 10.8 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 1,228 - Forks: 71

SamurAIGPT/Text-To-Video-AI
Generate video from text using AI
Language: Jupyter Notebook - Size: 15.8 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 529 - Forks: 190

ant-research/LeviTor
[CVPR'25 Highlight] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
Language: Python - Size: 12.8 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 144 - Forks: 8

desaixie/pa_vdm
CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151
Language: Python - Size: 1.95 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 64 - Forks: 0

lukasuz/MotionDreamer
[3DV 2025] MotionDreamer: Exploring Semantic Video Diffusion features for Zero-Shot 3D Mesh Animation
Language: Python - Size: 18.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 27 - Forks: 0

mbreuss/diffusion-literature-for-robotics
Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.
Size: 220 KB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 803 - Forks: 41

MyNiuuu/MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
Language: Python - Size: 117 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 732 - Forks: 46

geonyeong-park/Spectral-Motion-Alignment
The official repository of "Spectral Motion Alignment for Video Motion Transfer using Diffusion Models".
Language: Python - Size: 2.64 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 27 - Forks: 2

HyeonHo99/Video-Motion-Customization
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)
Language: Python - Size: 1.7 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 189 - Forks: 8

TianxingWu/FreeInit
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python - Size: 59.3 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 512 - Forks: 21

xie-lab-ml/IV-mixed-Sampler
[ICLR2025] IV-mixed Sampler: Using image diffusion models and video diffusion models together to improve the visual quality and the temporal coherence of video generation.
Language: Python - Size: 165 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 21 - Forks: 0

Kunhao-Liu/ViewExtrapolator
[arXiv 2024] Novel View Extrapolation with Video Diffusion Priors
Language: Python - Size: 55.6 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 101 - Forks: 3

XuweiyiChen/UniCtrl
Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
Size: 12.5 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 68 - Forks: 2

liuff19/ReconX
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
Size: 2.63 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 519 - Forks: 16

yhZhai/mcm
[NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
Language: Python - Size: 32 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 45 - Forks: 4

alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
Language: Python - Size: 35.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 751 - Forks: 61

notjedi/sd-webui-img2vid
Language: Python - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

makepixelsdance/makepixelsdance.github.io
Homepage for PixelDance. Paper -> https://arxiv.org/abs/2311.10982
Language: HTML - Size: 419 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 1
