GitHub topics: colpali
illuin-tech/colpali
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
Language: Python - Size: 796 KB - Last synced at: about 1 hour ago - Pushed at: about 15 hours ago - Stars: 1,772 - Forks: 151

morphik-org/morphik-core
Open source multi-modal RAG for building AI apps over private knowledge.
Language: Python - Size: 116 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,648 - Forks: 82

liweiphys/layra
LAYRA is a ready-to-use visual RAG system with a complete web UI built with Next.js and FastAPI, preserving document layout, tables, paragraphs, and graphical elements without any structural fragmentation.
Language: TypeScript - Size: 2.61 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 427 - Forks: 42

illuin-tech/vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
Language: Python - Size: 2.97 MB - Last synced at: 8 days ago - Pushed at: 17 days ago - Stars: 197 - Forks: 24

AnswerDotAI/byaldi
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
Language: Python - Size: 1.94 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 774 - Forks: 81

tonywu71/colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳
Size: 10.4 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 269 - Forks: 17

StarlightSearch/EmbedAnything
Production-ready Inference, Ingestion and Indexing built in Rust 🦀
Language: Rust - Size: 36.7 MB - Last synced at: 12 days ago - Pushed at: 16 days ago - Stars: 506 - Forks: 45

adithya-s-k/VARAG
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
Language: Python - Size: 9.48 MB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 444 - Forks: 42

s-emanuilov/litepali
LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.
Language: Python - Size: 691 KB - Last synced at: 14 days ago - Pushed at: 7 months ago - Stars: 46 - Forks: 1

hyun-yang/MyColPali
The PyQt6 application using ColPali and OpenAI to show Efficient Document Retrieval with Vision Language Models
Language: Python - Size: 313 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 8 - Forks: 1

pranshuchaurasia/image-indexing-and-retrival-with-qdrant
The repo provides the code for Qdrant for efficient image indexing and retrieval using models such as ColPali, ColQwen, and VDR-2B-Multi-V1, enhancing multimodal search capabilities across various applications.
Language: Python - Size: 9.77 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

DataFog/vlm-api
REST API for computing cross-modal similarity between images and text using the ColPaLI vision-language model
Language: Python - Size: 2.53 MB - Last synced at: 27 days ago - Pushed at: 6 months ago - Stars: 7 - Forks: 1

Softlandia-Ltd/vision-is-all-you-need
Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo
Language: TypeScript - Size: 228 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 53 - Forks: 9

bastienpo/slides-rag
Language: Python - Size: 110 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ArchismwanChatterjee/OCR-and-Document-Search-Web-Application-Prototype
OCR and Document Search Web Application
Language: Jupyter Notebook - Size: 462 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0
