An open API service providing repository metadata for many open source software ecosystems.

GitHub / FoundationVision 1 Repository

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language: Python - Size: 5.35 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 1,749 - Forks: 77

FoundationVision/VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language: Jupyter Notebook - Size: 620 KB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 7,872 - Forks: 480

FoundationVision/GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language: Python - Size: 22.3 MB - Last synced at: 2 days ago - Pushed at: 7 months ago - Stars: 1,122 - Forks: 69

FoundationVision/Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Language: Python - Size: 13.5 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 564 - Forks: 43

FoundationVision/UniTok

A Unified Tokenizer for Visual Generation and Understanding

Language: Python - Size: 30.7 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 271 - Forks: 5

FoundationVision/Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Language: Python - Size: 10.1 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 1,210 - Forks: 55

FoundationVision/VNext

Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral))

Language: Python - Size: 53.7 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 615 - Forks: 55

FoundationVision/Liquid

Liquid: Language Models are Scalable and Unified Multi-modal Generators

Language: Python - Size: 31.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 353 - Forks: 24

FoundationVision/OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Language: Python - Size: 68.9 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 287 - Forks: 7

FoundationVision/UniRef

[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

Language: Python - Size: 14.9 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 238 - Forks: 15

FoundationVision/vaex

🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook

Language: Python - Size: 57.6 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 90 - Forks: 5

FoundationVision/GenerateU

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

Language: Python - Size: 14.4 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 167 - Forks: 7

FoundationVision/Autoregressive-Models-in-Vision-Survey Fork of ChaofanTao/Autoregressive-Models-in-Vision-Survey

The paper collections for the autoregressive models in vision.

Size: 7.88 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 9 - Forks: 0

FoundationVision/FlashVideo

FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Language: Python - Size: 1.45 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 384 - Forks: 24

FoundationVision/flashvideo-page

Language: HTML - Size: 586 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

FoundationVision/Goku

Goku: Generative Flow Kit for Unified Image-Video Creation

Language: JavaScript - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

FoundationVision/infinity.project

Language: HTML - Size: 36 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

FoundationVision/.github

Size: 1.95 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0