GitHub topics: multimodal-foundation-model
mahmoodlab/MADELEINE
MADELEINE: multi-stain slide representation learning (ECCV'24)
Language: Python - Size: 22.9 MB - Last synced at: 23 days ago - Pushed at: 2 months ago - Stars: 49 - Forks: 5

MJ-Bench/MJ-Bench
Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"
Language: Jupyter Notebook - Size: 218 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 43 - Forks: 5

TXH-mercury/VAST
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Language: Jupyter Notebook - Size: 73.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 144 - Forks: 5
