Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: foundation-models

m6129/VKR

Investigation of the capabilities of foundations models in the context of time series forecasting

Language: Jupyter Notebook - Size: 5.81 MB - Last synced: 33 minutes ago - Pushed: about 4 hours ago - Stars: 5 - Forks: 0

uni-medical/STU-Net

The largest pre-trained medical image segmentation model (1.4B parameters) based on the largest public dataset (>100k annotations), up until April 2023.

Language: Python - Size: 43.5 MB - Last synced: about 8 hours ago - Pushed: 1 day ago - Stars: 230 - Forks: 23

MileBench/MileBench

This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"

Language: Python - Size: 3.51 MB - Last synced: about 10 hours ago - Pushed: about 11 hours ago - Stars: 15 - Forks: 0

shengchaochen82/Awesome-Foundation-Models-for-Weather-and-Climate

A comprehesive survey about foundation models for weather and cliamte data understanding.

Size: 129 KB - Last synced: about 11 hours ago - Pushed: about 12 hours ago - Stars: 73 - Forks: 14

qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM

A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.

Size: 193 KB - Last synced: about 17 hours ago - Pushed: about 1 month ago - Stars: 767 - Forks: 53

uncbiag/Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

Size: 196 KB - Last synced: about 17 hours ago - Pushed: 1 day ago - Stars: 588 - Forks: 26

llm-jp/awesome-japanese-llm

日本語LLMまとめ - Overview of Japanese LLMs

Size: 6.12 MB - Last synced: about 17 hours ago - Pushed: 2 days ago - Stars: 775 - Forks: 21

amazon-science/chronos-forecasting

Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

Language: Python - Size: 560 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 1,800 - Forks: 223

hyp1231/awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

Size: 164 KB - Last synced: about 16 hours ago - Pushed: 3 days ago - Stars: 1,029 - Forks: 77

autodistill/autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Language: Python - Size: 2.49 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 1,566 - Forks: 122

kaiko-ai/eva

Evaluation framework for oncology foundation models (FMs)

Language: Python - Size: 4.49 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 44 - Forks: 2

JerryX1110/awesome-segment-anything-extensions

Segment-anything related awesome extensions/projects/repos.

Size: 64.5 KB - Last synced: about 15 hours ago - Pushed: 11 months ago - Stars: 328 - Forks: 12

microsoft/unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language: Python - Size: 64.2 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 18,564 - Forks: 2,394

BeileiCui/EndoDAC

[MICCAI'2024] EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera

Language: Python - Size: 1.44 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 4 - Forks: 0

SaberaTalukder/TOTEM

The official code 👩‍💻 for - TOTEM: TOkenized Time Series EMbeddings for General Time Series Analysis

Language: Python - Size: 58.6 KB - Last synced: 4 days ago - Pushed: 3 months ago - Stars: 46 - Forks: 4

jusiro/FLAIR

FLAIR: A Foundation LAnguage-Image model of the Retina for fundus image understanding.

Language: Python - Size: 1.35 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 56 - Forks: 5

FoundationVision/Groma

Grounded Multimodal Large Language Model with Localized Visual Tokenization

Language: Python - Size: 13.5 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 374 - Forks: 50

haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language: Python - Size: 13.4 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 16,707 - Forks: 1,796

HazyResearch/meerkat

Creative interactive views of any dataset.

Language: Python - Size: 66.5 MB - Last synced: 1 day ago - Pushed: 3 months ago - Stars: 814 - Forks: 42

alan-turing-institute/foundation-models-reading-group

Information and materials for the Turing's Foundation Models reading group.

Language: Jupyter Notebook - Size: 287 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 25 - Forks: 3

RoyalSkye/Routing-MVMoE

[ICML 2024] "MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts"

Language: Python - Size: 374 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 19 - Forks: 0

mazurowski-lab/finetune-SAM

This is an official repo for fine-tuning SAM to customized medical images.

Language: Python - Size: 1.76 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 37 - Forks: 4

westlake-repl/IDvs.MoRec

End-to-end Training for Multimodal Recommendation Systems

Language: Python - Size: 57.1 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 105 - Forks: 11

aws-samples/generative-ai-sagemaker-cdk-demo

Deploy Generative AI models from Amazon SageMaker JumpStart using AWS CDK

Language: Python - Size: 8.25 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 66 - Forks: 23

mlmed/torchxrayvision

TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.

Language: Jupyter Notebook - Size: 43.8 MB - Last synced: 4 days ago - Pushed: 6 days ago - Stars: 841 - Forks: 207

baaivision/EVA

EVA Series: Visual Representation Fantasies from BAAI

Language: Python - Size: 8.61 MB - Last synced: 6 days ago - Pushed: 2 months ago - Stars: 1,993 - Forks: 142

OFA-Sys/ONE-PEACE

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Language: Python - Size: 29.9 MB - Last synced: 6 days ago - Pushed: 6 months ago - Stars: 850 - Forks: 53

NVlabs/FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

Language: Python - Size: 1.28 MB - Last synced: 7 days ago - Pushed: 22 days ago - Stars: 682 - Forks: 53

CLUEbenchmark/SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

Size: 24.3 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 2,640 - Forks: 88

baaivision/Emu

Emu Series: Generative Multimodal Models from BAAI

Language: Python - Size: 46.3 MB - Last synced: 6 days ago - Pushed: 2 months ago - Stars: 1,506 - Forks: 78

som-shahlab/femr

FEMR (Framework for Electronic Medical Records) provides tooling for large-scale, self-supervised learning using electronic health records

Language: Python - Size: 41.4 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 81 - Forks: 11

mxliu/ACTION-Software-for-Functional-MRI-Analysis

Open-Source Python Software for Functional MRI Analysis

Language: Python - Size: 143 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 0 - Forks: 0

pnnl/cactus

LLM Agent that leverages cheminformatics tools to provide informed responses.

Language: Jupyter Notebook - Size: 14.5 MB - Last synced: 11 days ago - Pushed: 12 days ago - Stars: 2 - Forks: 0

pyxu-org/pyxu

Modular and scalable computational imaging in Python with GPU/out-of-core computing.

Language: Python - Size: 284 MB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 109 - Forks: 15

baaivision/Uni3D

[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI

Language: Python - Size: 6.05 MB - Last synced: 6 days ago - Pushed: 4 months ago - Stars: 401 - Forks: 20

ai4co/awesome-fm4co

Recent research papers about Foundation Models for Combinatorial Optimization

Size: 21.5 KB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 39 - Forks: 3

elte-nlp/elte-nlp-course

NLP & FM Lecture Slides

Language: Jupyter Notebook - Size: 138 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 22 - Forks: 2

MMMU-Benchmark/MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Language: Python - Size: 3.32 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 254 - Forks: 16

lyy1994/awesome-data-contamination

The Paper List on Data Contamination for Large Language Models Evaluation.

Size: 356 KB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 22 - Forks: 1

xyzforever/BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

Language: Python - Size: 19.2 MB - Last synced: 6 days ago - Pushed: almost 2 years ago - Stars: 152 - Forks: 17

Haiyang-W/GiT

Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Language: Python - Size: 12.4 MB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 211 - Forks: 9

WisconsinAIVision/ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Language: Python - Size: 17.4 MB - Last synced: 16 days ago - Pushed: 18 days ago - Stars: 165 - Forks: 8

westlake-repl/MicroLens

A Large Short-video Recommendation Dataset with Raw Text/Audio/Image/Videos (Talk Invited by DeepMind).

Language: Python - Size: 62.5 MB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 90 - Forks: 5

jqin4749/MindVideo

Official code base for MinD-Video

Language: Python - Size: 7.78 MB - Last synced: 12 days ago - Pushed: 5 months ago - Stars: 350 - Forks: 25

ShadowXZT/DOFA-pytorch

Code for Neural Plasticity-Inspired Foundation Model for Observing the Earth Crossing Modalities

Language: Jupyter Notebook - Size: 510 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 13 - Forks: 1

zhu-xlab/DOFA

Code for Neural Plasticity-Inspired Foundation Model for Observing the Earth Crossing Modalities

Language: Jupyter Notebook - Size: 903 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 36 - Forks: 1

emadeldeen24/TSLANet

[ICML 2024] A novel, efficient approach combining convolutional operations with adaptive spectral analysis as a foundation model for different time series tasks

Language: Python - Size: 286 KB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 22 - Forks: 2

yifanzhang-pro/M-MAE

Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)

Language: Python - Size: 72.3 KB - Last synced: 17 days ago - Pushed: 18 days ago - Stars: 8 - Forks: 3

yifanzhang-pro/Matrix-SSL

Official implementation of ICML 2024 paper "Matrix Information Theory for Self-supervised Learning" (https://arxiv.org/abs/2305.17326)

Language: Python - Size: 46.9 KB - Last synced: 17 days ago - Pushed: 18 days ago - Stars: 12 - Forks: 4

WING-NUS/cs6101

The Web IR / NLP Group (WING)'s public reading group at the National University of Singapore.

Language: JavaScript - Size: 24.6 MB - Last synced: 6 days ago - Pushed: 5 months ago - Stars: 37 - Forks: 44

kenza-ily/foundationmodel_segmentation

Decoding the Learned Features of Masked Autoencoders in Semantic Segmentation Tasks

Language: Python - Size: 50.1 MB - Last synced: 17 days ago - Pushed: 18 days ago - Stars: 0 - Forks: 0

scallop-lang/scallop

Framework and Language for Neurosymbolic Programming. Join Our Discord: https://discord.gg/RavzdND229

Language: Rust - Size: 7.52 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 164 - Forks: 8

ACEsuit/mace-mp

MACE-MP models

Language: Shell - Size: 13.7 KB - Last synced: 18 days ago - Pushed: 21 days ago - Stars: 24 - Forks: 3

robot-learning-freiburg/SPINO

Few-Shot Panoptic Segmentation With Foundation Models

Language: Python - Size: 4.74 MB - Last synced: 17 days ago - Pushed: about 2 months ago - Stars: 22 - Forks: 2

Luodian/Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language: Python - Size: 7.39 MB - Last synced: 22 days ago - Pushed: 3 months ago - Stars: 3,447 - Forks: 239

MR-HosseinzadehTaher/Eden

[CVPR 2024] Official PyTorch Implementation for Adam

Language: Python - Size: 16.8 MB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 8 - Forks: 2

NASA-IMPACT/hls-foundation-os

This repository contains examples of fine-tuning Harmonized Landsat and Sentinel-2 (HLS) Prithvi foundation model.

Language: Jupyter Notebook - Size: 3.92 MB - Last synced: 22 days ago - Pushed: 3 months ago - Stars: 257 - Forks: 66

ashiq24/CoDA-NO

Codomain attention neural operator for single to multi-physics PDE adaptation.

Language: Python - Size: 1.44 MB - Last synced: 23 days ago - Pushed: 23 days ago - Stars: 12 - Forks: 2

AlaaLab/WebCP

[ NeurIPS 2023 R0-FoMo Workshop ] Official Codebase for "Estimating Uncertainty in Multimodal Foundation Models using Public Internet Data"

Language: Jupyter Notebook - Size: 302 KB - Last synced: 6 days ago - Pushed: about 2 months ago - Stars: 4 - Forks: 1

ashleykleynhans/llava-docker

Docker image for LLaVA: Large Language and Vision Assistant

Language: Shell - Size: 112 KB - Last synced: 18 days ago - Pushed: about 1 month ago - Stars: 61 - Forks: 13

baaivision/tokenize-anything

Tokenize Anything via Prompting

Language: Jupyter Notebook - Size: 6.71 MB - Last synced: 6 days ago - Pushed: about 1 month ago - Stars: 435 - Forks: 16

MrGiovanni/ModelsGenesis

[MICCAI 2019] [MEDIA 2020] Models Genesis

Language: Jupyter Notebook - Size: 22 MB - Last synced: 6 days ago - Pushed: 3 months ago - Stars: 722 - Forks: 139

jhuapl-fomo/ralf

A lightweight library to support the development of applications using LLMs

Language: Python - Size: 440 KB - Last synced: 23 days ago - Pushed: 24 days ago - Stars: 5 - Forks: 0

OpenGVLab/PonderV2

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

Language: Python - Size: 869 KB - Last synced: 23 days ago - Pushed: 24 days ago - Stars: 298 - Forks: 5

time-series-foundation-models/lag-llama

Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

Language: Python - Size: 208 KB - Last synced: 25 days ago - Pushed: 25 days ago - Stars: 949 - Forks: 97

deepseek-ai/DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language: Python - Size: 12.2 MB - Last synced: 25 days ago - Pushed: 25 days ago - Stars: 1,506 - Forks: 151

mims-harvard/SPECTRA

Spectral Framework For AI Model Evaluation

Language: Roff - Size: 70 MB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 13 - Forks: 1

mims-harvard/UniTS

A unified time series model.

Language: Python - Size: 66.4 KB - Last synced: 27 days ago - Pushed: about 2 months ago - Stars: 290 - Forks: 32

OpenGVLab/InternVideo

Video Foundation Models & Data for Multimodal Understanding

Language: Python - Size: 39.6 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 890 - Forks: 56

mlsquare/fedem

Library for Federated Emergence & Foundation Models

Language: Python - Size: 5.81 MB - Last synced: 27 days ago - Pushed: about 1 month ago - Stars: 9 - Forks: 1

aws-samples/foundation-model-benchmarking-tool

Foundation model benchmarking tool. Run any model on Amazon SageMaker and benchmark for performance across instance type and serving stack options.

Language: Jupyter Notebook - Size: 74.9 MB - Last synced: 30 days ago - Pushed: about 1 month ago - Stars: 77 - Forks: 8

UCSC-VLAA/MixCon3D

[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"

Language: Python - Size: 276 KB - Last synced: 28 days ago - Pushed: 28 days ago - Stars: 15 - Forks: 1

Azure/intelligent-app-workshop

Immersive workshop showcasing the remarkable potential of integrating SoTA foundation models to enhance product experiences and streamline backend workflows. Leverages Microsoft's Copilot stack, Semantic Kernel and Azure primitives to offer an engaging and comprehensive introduction to AI-infused app development and deployment

Language: Python - Size: 37.9 MB - Last synced: 28 days ago - Pushed: 28 days ago - Stars: 139 - Forks: 28

jianglongye/featurenerf

FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models, ICCV 2023

Language: Python - Size: 0 Bytes - Last synced: 29 days ago - Pushed: 29 days ago - Stars: 1 - Forks: 0

yasserben/CLOUDS

[CVPR 2024] Official Implementation of Collaborating Foundation models for Domain Generalized Semantic Segmentation

Language: Python - Size: 812 KB - Last synced: 16 days ago - Pushed: 5 months ago - Stars: 34 - Forks: 0

artpli/humanoid-bench Fork of carlosferrazza/humanoid-bench

My Dear Robots

Language: Python - Size: 87.9 MB - Last synced: 30 days ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

zjunlp/KnowledgeEditingPapers

Must-read Papers on Knowledge Editing for Large Language Models.

Size: 78 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 564 - Forks: 39

IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

Size: 15 MB - Last synced: 27 days ago - Pushed: 2 months ago - Stars: 131 - Forks: 3

yifanzhang-pro/Kernel-InfoNCE

Official implementation of ICLR 2024 paper "Contrastive Learning Is Spectral Clustering On Similarity Graph" (https://arxiv.org/abs/2303.15103)

Language: Python - Size: 30.3 KB - Last synced: 30 days ago - Pushed: 3 months ago - Stars: 11 - Forks: 0

yoshall/Awesome-Trajectory-Computing

A professional list of Deep Learning and Large (Language) Models (LM, LLM, FM) for Trajectory Data Management and Mining.

Size: 25.4 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 53 - Forks: 2

yoshall/Awesome-Multimodal-Urban-Computing

A professional list on Multi-modal Data Fusion Models and Key Datasets for Urban Computing.

Size: 2.23 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 47 - Forks: 5

NVlabs/EmerNeRF

PyTorch Implementation of EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

Language: Python - Size: 2.2 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 478 - Forks: 34

tatsu-lab/alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language: Jupyter Notebook - Size: 190 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 1,062 - Forks: 156

OpenGVLab/Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language: Python - Size: 19.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2,636 - Forks: 212

ml6team/fondant

Production-ready data processing made easy and shareable

Language: Python - Size: 23 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 316 - Forks: 24

VisualWebBench/VisualWebBench

Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

Language: Python - Size: 2.95 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 9 - Forks: 0

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

Language: Python - Size: 29.4 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 37,740 - Forks: 4,233

VectorInstitute/odyssey

A toolkit for developing foundation models using Electronic Health Record (EHR) data.

Language: Jupyter Notebook - Size: 4.22 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3 - Forks: 0

gitana/sdk

Gitana SDK

Language: CSS - Size: 60.5 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

salute-developers/GigaAM

Foundational Model for Speech Recognition Tasks

Size: 1.02 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 58 - Forks: 2

mlc-ai/mlc-assistant

Chat with your documents and improve your writing using large-language models within your browser.

Language: JavaScript - Size: 103 MB - Last synced: 1 day ago - Pushed: 2 months ago - Stars: 18 - Forks: 3

om-ai-lab/RS5M

RS5M: a large-scale vision language dataset for remote sensing

Language: Python - Size: 44.7 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 145 - Forks: 6

mala-lab/NegPrompt

The official implementation of CVPR 24' Paper "Learning Transferable Negative Prompts for Out-of-Distribution Detection"

Language: Python - Size: 7.36 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 1 - Forks: 0

OuyangKun10/A-Large-Scale-Collection-of-Time-Series-Forecasting-Dataset

Here is a large scale collection of time series forecasting dataset, including various domains (e.g., Traffic, Electricity, Environment, Energy, Industry, Retail, Finance, Healthcare, Climate and Web).

Language: Python - Size: 30.3 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 5 - Forks: 0

thaisaraujom/machine-learning

Projects and summaries for the Machine Learning [PPGEEC2318] course at UFRN, taught by Professor Ivanovitch Silva.

Size: 20.5 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

microsoft/DPSDA

[ICLR 2024] Generating DP Synthetic Data without Training

Language: Python - Size: 40 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 58 - Forks: 4

mbzuai-oryx/groundingLMM

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks [CVPR 2024].

Language: Python - Size: 109 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 536 - Forks: 25

OpenRobotLab/PointLLM

[arXiv 2023] PointLLM: Empowering Large Language Models to Understand Point Clouds

Language: Python - Size: 3.49 MB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 368 - Forks: 13

wzongyu/LLM-and-Multimodal-Paper-List

A paper list about large language models and multimodal models (Diffusion, VLM). From foundations to applications. It is only used to record papers for my personal needs.

Size: 103 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 25 - Forks: 2

aws-samples/inference-audiocraft-musicgen-on-amazon-sagemaker

Deploy Audiocraft Musicgen on Amazon SageMaker using SageMaker Endpoints for Async Inference.

Language: Jupyter Notebook - Size: 893 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 1 - Forks: 1

Related Keywords
foundation-models 150 large-language-models 35 deep-learning 27 llm 25 machine-learning 24 multimodal 22 vision-language-model 12 representation-learning 12 self-supervised-learning 11 artificial-intelligence 10 generative-ai 10 vision-transformer 10 pytorch 9 chatgpt 9 gpt-4 9 computer-vision 8 nlp 8 vision-and-language 8 llama 8 transfer-learning 7 prompt-engineering 7 evaluation 7 llms 7 medical-imaging 7 forecasting 6 natural-language-processing 6 remote-sensing 6 fine-tuning 6 time-series 6 python 6 contrastive-learning 5 transformers 5 awesome-list 5 transformer 5 image-classification 5 object-detection 5 ai 5 pre-trained-model 5 instruction-tuning 5 llama2 5 openai 4 generative-model 4 video-understanding 4 embodied-ai 4 large-multimodal-models 4 benchmark 4 multimodal-deep-learning 4 visual-question-answering 4 zero-shot-learning 4 visual-language-learning 4 llava 4 pretraining 4 zero-shot-classification 4 gpt 4 stable-diffusion 4 chatbot 4 masked-autoencoder 3 paper-list 3 multimodal-large-language-models 3 clip 3 time-series-forecasting 3 huggingface 3 multimodal-learning 3 neurips-2023 3 awesome 3 federated-learning 3 semantic-segmentation 3 mllm 3 vision-language-pretraining 3 robotics 3 aws 3 pre-training 3 timeseries 3 classification 3 language-models 3 large-language-model 3 visual-prompting 3 vision-language 3 amazon 2 medical 2 3d 2 finetuning 2 amazon-sagemaker 2 text-recommendation 2 cnn 2 llm-recommendation 2 image-recommendation 2 multi-modal 2 nerf 2 autonomous-driving 2 sam 2 change-detection 2 neural-combinatorial-optimization 2 combinatorial-optimization 2 diffusion-models 2 ml 2 unified-model 2 multi-modality 2 non-contrastive-learning 2 llama-2 2