An open API service providing repository metadata for many open source software ecosystems.

Topic: "text-image-retrieval"

alibaba/EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Language: Python - Size: 19.9 MB - Last synced at: 17 days ago - Pushed at: 7 months ago - Stars: 2,138 - Forks: 257

NVlabs/ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language: Python - Size: 16.4 MB - Last synced at: 20 days ago - Pushed at: 12 months ago - Stars: 913 - Forks: 49

360CVGroup/FG-CLIP

New generation of CLIP with fine grained discrimination capability, ICML2025

Language: Python - Size: 5.59 MB - Last synced at: about 12 hours ago - Pushed at: about 13 hours ago - Stars: 212 - Forks: 8

xiaoyuan1996/retrievalSystem

The back-end of cross-modal retrieval system,wihch will contain services such as semantic location .etc

Language: Python - Size: 103 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 64 - Forks: 12

BIGBALLON/GME-Search

A multimodal image search engine built on the GME model, capable of handling diverse input types. Whether you're querying with text, images, or both, provides powerful and flexible image retrieval under arbitrary inputs. Perfect for research and demos.

Language: Python - Size: 12.7 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 39 - Forks: 4

sdc17/CrossGET

[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.

Language: Python - Size: 11.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 26 - Forks: 0

KimRass/CLIP

PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k

Language: Python - Size: 18.3 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 0

HTAnh2003/LLM_Powered_Video_Search

The LLM-Powered Video Search System is an advanced multimodal video search solution that leverages Large Language Models (LLMs) to enhance video retrieval through text, image, and metadata queries.

Language: Jupyter Notebook - Size: 5.29 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

AIoT-Lab-BKAI/PIMA

PIMA - A Novel Approach for Pill-Prescription Matching with GNN Assistance and Contrastive Learning

Language: Jupyter Notebook - Size: 5.2 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

MayssaJaz/Text2Image-Search

A search engine, operating on the foundation of the OpenAI Clip Model to retrieve images corresponding to textual queries.

Language: Jupyter Notebook - Size: 523 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

lorenzo-stacchio/Digimon_Dataset

Digimon Dataset for MultiModal Machine Learning

Language: Python - Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0