Topic: "retrieval"
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Language: Python - Size: 43.7 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2,810 - Forks: 462

apache/lucenenet
Apache Lucene.NET
Language: C# - Size: 174 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 2,334 - Forks: 645

qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Language: Python - Size: 3.16 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 2,331 - Forks: 150

intel/intel-extension-for-transformers ๐ฆ
โก Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsโก
Language: Python - Size: 585 MB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 2,169 - Forks: 215

memodb-io/memobase
Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context โ perfect for chatbots, companions, tutors, customer service bots, and all chat-based agents.
Language: Python - Size: 16.7 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 2,068 - Forks: 147

beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Language: Python - Size: 38.9 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 1,930 - Forks: 217

shervinea/mit-15-003-data-science-tools
Study guides for MIT's 15.003 Data Science Tools
Size: 8.94 MB - Last synced at: 4 months ago - Pushed at: about 5 years ago - Stars: 1,842 - Forks: 365

superlinked/superlinked
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
Language: Jupyter Notebook - Size: 138 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,328 - Forks: 99

xhluca/bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Language: Python - Size: 2.03 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 1,294 - Forks: 76

VectifyAI/PageIndex
๐๐ง PageIndex: Document Index for Reasoning-based RAG
Language: Python - Size: 22 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 1,249 - Forks: 117

parthsarthi03/raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Language: Python - Size: 816 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 1,146 - Forks: 159

tensorlakeai/indexify
A realtime serving engine for Data-Intensive Generative AI Applications
Language: Rust - Size: 125 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,048 - Forks: 135

ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Language: Python - Size: 1.61 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 929 - Forks: 126

lucidrains/RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Language: Python - Size: 186 KB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 872 - Forks: 108

Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 871 - Forks: 53

epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System
Language: C++ - Size: 1.09 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 860 - Forks: 40

NeumTry/NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Language: Python - Size: 3.83 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 860 - Forks: 48

AnswerDotAI/byaldi
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
Language: Python - Size: 1.94 MB - Last synced at: 4 days ago - Pushed at: 7 months ago - Stars: 820 - Forks: 92

OpenBMB/VisRAG
Parsing-free RAG supported by VLMs
Language: Python - Size: 14.7 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 725 - Forks: 57

michaelthwan/searchGPT
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
Language: Python - Size: 1.17 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 693 - Forks: 71

ContextualAI/gritlm
Generative Representational Instruction Tuning
Language: Jupyter Notebook - Size: 13.3 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 668 - Forks: 48

shamangary/awesome-local-global-descriptor
My personal note about local and global descriptor
Size: 2.47 MB - Last synced at: 12 days ago - Pushed at: almost 5 years ago - Stars: 649 - Forks: 95

lucidrains/memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
Language: Python - Size: 34.2 MB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 634 - Forks: 46

Anush008/fastembed-rs
Rust library for generating vector embeddings, reranking. Re-write of qdrant/fastembed.
Language: Rust - Size: 619 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 580 - Forks: 78

redis-developer/ArXivChatGuru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
Language: Python - Size: 2.95 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 546 - Forks: 71

EdoardoBotta/RQ-VAE-Recommender
[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"
Language: Python - Size: 146 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 508 - Forks: 63

DataScienceUIBK/Rankify
๐ฅ Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation ๐ฅ. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techniques, 24+ state-of-the-art Reranking models, and multiple RAG methods.
Language: Python - Size: 28.3 MB - Last synced at: 4 days ago - Pushed at: 12 days ago - Stars: 501 - Forks: 36

BUAADreamer/EasyRAG
Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps ๅฝ้ ๆๆ่ต 2024 ๅญฃๅๆนๆก
Language: Python - Size: 30.3 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 481 - Forks: 59

SapienzaNLP/relik
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
Language: Python - Size: 791 KB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 448 - Forks: 33

AkariAsai/learning_to_retrieve_reasoning_paths
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Language: Python - Size: 337 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 433 - Forks: 64

KarelDO/xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
Language: Python - Size: 45.4 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 423 - Forks: 25

Aquila-Network/aquila
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
Language: HTML - Size: 1.5 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 378 - Forks: 25

raphaelsty/cherche
Neural Search
Language: Python - Size: 41.6 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 332 - Forks: 14

LongmaoTeamTf/deep_recommenders
Deep Recommenders
Language: Python - Size: 2.3 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 327 - Forks: 108

arcee-ai/DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
Language: Python - Size: 18.9 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 324 - Forks: 41

tonywu71/colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. ๐จ๐ปโ๐ณ
Size: 8.23 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 315 - Forks: 24

illuin-tech/vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
Language: Python - Size: 2.99 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 235 - Forks: 31

chao1224/MoleculeSTM
Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)
Language: Python - Size: 39.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 228 - Forks: 21

naver/bergen
Benchmarking library for RAG
Language: Jupyter Notebook - Size: 139 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 214 - Forks: 23

meinardmueller/libfmp
libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)
Language: Python - Size: 7.04 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 210 - Forks: 19

zou-group/avatar
(NeurIPS 2024) AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning
Language: Python - Size: 13.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 204 - Forks: 19

FasterDecoding/REST
REST: Retrieval-Based Speculative Decoding, NAACL 2024
Language: C - Size: 1.06 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 202 - Forks: 14

firecrawl/rag-arena
Open-source RAG evaluation through users' feedback
Language: TypeScript - Size: 18.7 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 200 - Forks: 29

jxmorris12/cde
code for training & evaluating Contextual Document Embedding models
Language: Python - Size: 1.67 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 191 - Forks: 11

m-bain/CondensedMovies
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
Language: Python - Size: 22 MB - Last synced at: 20 days ago - Pushed at: almost 3 years ago - Stars: 183 - Forks: 27

ARM-DOE/ACT
Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets
Language: Python - Size: 285 MB - Last synced at: about 14 hours ago - Pushed at: 4 days ago - Stars: 168 - Forks: 40

rom1504/image_embeddings
Using efficientnet to provide embeddings for retrieval
Language: Jupyter Notebook - Size: 16.2 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 157 - Forks: 32

luyug/COIL
NAACL2021 - COIL Contextualized Lexical Retriever
Language: Python - Size: 91.8 KB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 153 - Forks: 28

TIGER-AI-Lab/UniIR
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)
Language: Python - Size: 53.6 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 152 - Forks: 13

protyposis/Aurio
Audio Fingerprinting & Retrieval for .NET
Language: C# - Size: 8.09 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 151 - Forks: 29

chao1224/ChatDrug
LLM for Drug Editing, ICLR 2024
Language: Python - Size: 4.48 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 149 - Forks: 7

zeroentropy-ai/zchunk
A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.
Language: Python - Size: 57.6 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 148 - Forks: 8

zjunlp/OneGen
[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.
Language: Python - Size: 842 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 147 - Forks: 14

simaiden/Clothing-Detection
Language: Python - Size: 151 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 147 - Forks: 45

Anush008/fastembed-js
Library to generate vector embeddings in NodeJS
Language: TypeScript - Size: 1.07 MB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 141 - Forks: 12

Reason-Wang/ToolGen
[ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"
Language: Python - Size: 722 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 133 - Forks: 13

denser-org/denser-chat
Chat with PDF files with source highlights
Language: Python - Size: 31 MB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 123 - Forks: 12

DRSY/MoTIS
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
Language: Swift - Size: 16.5 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 123 - Forks: 10

xlang-ai/BRIGHT
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Language: Python - Size: 12.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 121 - Forks: 12

protyposis/AudioAlign
Audio Synchronization and Analysis Tool
Language: C# - Size: 613 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 114 - Forks: 14

gmftbyGMFTBY/OpenDialog
An Open-Source Package for Chinese Open-domain Conversational Chatbot (ไธญๆ้ฒ่ๅฏน่ฏ็ณป็ป๏ผไธ้ฎ้จ็ฝฒๅพฎไฟก้ฒ่ๆบๅจไบบ)
Language: Python - Size: 1.48 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 108 - Forks: 20

luyug/GC-DPR
Train Dense Passage Retriever (DPR) with a single GPU
Language: Python - Size: 94.7 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 105 - Forks: 18

CoIR-team/coir
(ACL 2025 Main) A Comprehensive Benchmark for Code Information Retrieval.
Language: Python - Size: 2.51 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 102 - Forks: 9

icodingc/ImageRetrieval-tf
ๅบไบtensorflow & tf-servering & flask ็ๅพๅๆฃ็ดข
Language: JavaScript - Size: 11.1 MB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 102 - Forks: 29

PkuRainBow/HDC.caffe
Complete Code for "Hard-Aware-Deeply-Cascaded-Embedding"
Language: Python - Size: 89.3 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 98 - Forks: 35

Dylancer1998/awesome-visual-localization-papers
The relocalization task aims to estimate the 6-DoF pose of a novel (unseen) frame in the coordinate system given by the prior model of the world.
Size: 55.7 KB - Last synced at: 11 days ago - Pushed at: almost 3 years ago - Stars: 93 - Forks: 8

dataplayer12/Fly-LSH
An implementation of efficient LSH inspired by fruit fly brain
Language: Python - Size: 214 KB - Last synced at: 25 days ago - Pushed at: over 6 years ago - Stars: 87 - Forks: 27

ai4protein/VenusREM
๐งฌ Augmenting zero-shot mutant prediction by retrieval-based logits fusion. (ISMB/ECCB 2025)
Language: Python - Size: 359 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 84 - Forks: 8

noagarcia/keras_rmac
RMAC implementation in Keras
Language: Python - Size: 2.25 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 82 - Forks: 28

HITsz-TMG/KaLM-Embedding
Code for KaLM-Embedding models
Language: Python - Size: 319 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 81 - Forks: 6

Anush008/fastembed-go
Go implementation of @qdrant/fastembed.
Language: Go - Size: 3.22 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 81 - Forks: 6

feymanpriv/pymetric
pytorch metric learning tools and pycls
Language: Python - Size: 960 KB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 78 - Forks: 8

lucidrains/marge-pytorch
Implementation of Marge, Pre-training via Paraphrasing, in Pytorch
Language: Python - Size: 166 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 76 - Forks: 11

axflow/original-demo-ui
Demo UI for the axgen library
Language: TypeScript - Size: 899 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 75 - Forks: 12

TmaxEdu/KorDPR
This repo Implements "Dense Passage Retrieval for Open-Domain Question Answering" using Korean Dataset
Language: Python - Size: 71.3 KB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 75 - Forks: 13

Langboat/mengzi-retrieval-lm
An experimental implementation of the retrieval-enhanced language model
Language: Python - Size: 189 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 74 - Forks: 5

neulab/retomaton
PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)
Language: Python - Size: 6.62 MB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 73 - Forks: 4

lucidrains/retrieval-augmented-ddpm
Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch
Language: Python - Size: 4.88 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 64 - Forks: 5

LongxingTan/open-retrievals
All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers
Language: Python - Size: 1.42 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 63 - Forks: 13

voyage-ai/voyageai-python
Voyage AI Official Python Library
Language: Python - Size: 189 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 63 - Forks: 10

mingjm3/image_retrieval_system
A image retrieval program implemented by VLAD algorithm (paper: https://ieeexplore.ieee.org/document/5540039). Can train you own image set and build your own image retrieval system.
Language: C++ - Size: 476 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 63 - Forks: 0

ducha-aiki/google-retrieval-challenge-2019-fastai-starter
fast.ai starter kit for Google Landmark Retrieval 2019 challenge
Language: Jupyter Notebook - Size: 2.74 MB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 61 - Forks: 19

aimagelab/safe-clip
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024
Language: Python - Size: 17.5 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 58 - Forks: 0

vitrivr/cineast
Cineast is a multi-feature content-based mulitmedia retrieval engine. It is capable of retrieving images, audio- and video sequences as well as 3d models based on edge or color sketches, textual descriptions and example objects.
Language: Java - Size: 19.2 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 58 - Forks: 49

UDLF/UDLF
An Unsupervised Distance Learning Framework for Multimedia Retrieval
Language: C++ - Size: 8.46 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 57 - Forks: 11

algoprog/Quin
An easy to use framework for large-scale fact-checking and question answering
Language: Python - Size: 51.8 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 57 - Forks: 7

Confusezius/CVPR2020_PADS
(CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet mining with Reinforcement Learning.
Language: Python - Size: 193 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 57 - Forks: 9

imatge-upc/salbow
Saliency Weighted Convolutional Features for Instance Search
Language: Python - Size: 1.47 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 55 - Forks: 6

ArkanDash/Advanced-RVC-Inference
Advanced RVC Inference for quicker and effortless model downloads
Language: Python - Size: 5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 54 - Forks: 18

yanbeic/VAL
Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning
Language: Python - Size: 3.83 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 54 - Forks: 10

teddylee777/openai-api-kr
OpenAI ๊ณต์ Document, Cookbook, ๊ทธ ๋ฐ์ ์ค์ฉ ์์ ๋ฅผ ๋ฐํ์ผ๋ก ์์ฑํ ํ๊ตญ์ด ํํ ๋ฆฌ์ผ์ ๋๋ค. ๋ณธ ํํ ๋ฆฌ์ผ์ ํตํด Python OpenAI API ๋ฅผ ๋ ์ฝ๊ณ ํจ๊ณผ์ ์ผ๋ก ์ฌ์ฉํ๋ ๋ฐฉ๋ฒ์ ๋ฐฐ์ธ ์ ์์ต๋๋ค.
Language: Jupyter Notebook - Size: 39.2 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 53 - Forks: 25

fmaglia/keras_rmac_plus
Keras implementation of R-MAC+ descriptors
Language: Python - Size: 29.3 KB - Last synced at: 10 months ago - Pushed at: over 6 years ago - Stars: 53 - Forks: 15

ioanacroi/qb-norm
Cross Modal Retrieval with Querybank Normalisation
Language: Python - Size: 42 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 48 - Forks: 2

BIGBALLON/UME-Search
Toward Universal Multimodal Embedding
Language: Python - Size: 712 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 44 - Forks: 4

umbertogriffo/Trie
A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.
Language: Java - Size: 34.6 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 44 - Forks: 12

lorenzhs/BuRR
Bumped Ribbon Retrieval and Approximate Membership Query
Language: C++ - Size: 121 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 43 - Forks: 6

Ryota-Kawamura/LangChain-Chat-with-Your-Data
Start building practical applications that allow you to interact with data using LangChain and LLMs.
Language: Jupyter Notebook - Size: 71.8 MB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 43 - Forks: 43

AstraBert/diRAGnosis
Diagnose the performance of your RAG๐ฉบ
Language: Python - Size: 214 KB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 42 - Forks: 3

orionw/FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
Language: Python - Size: 81.2 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 42 - Forks: 0

zjunlp/RAP
[SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction
Language: Python - Size: 17.1 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 42 - Forks: 3
