An open API service providing repository metadata for many open source software ecosystems.

Topic: "retrieval"

embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

Language: Python - Size: 34.1 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2,418 - Forks: 373

apache/lucenenet

Apache Lucene.NET

Language: C# - Size: 170 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 2,294 - Forks: 647

intel/intel-extension-for-transformers ๐Ÿ“ฆ

โšก Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsโšก

Language: Python - Size: 585 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2,133 - Forks: 211

qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Language: Python - Size: 3.02 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1,940 - Forks: 128

shervinea/mit-15-003-data-science-tools

Study guides for MIT's 15.003 Data Science Tools

Size: 8.94 MB - Last synced at: 13 days ago - Pushed at: over 4 years ago - Stars: 1,825 - Forks: 366

beir-cellar/beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Language: Python - Size: 38.9 MB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 1,771 - Forks: 204

parthsarthi03/raptor

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Language: Python - Size: 816 KB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 1,146 - Forks: 159

xhluca/bm25s

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Language: Python - Size: 2.03 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,105 - Forks: 63

memodb-io/memobase

Profile-Based Long-Term Memory for AI Applications

Language: Python - Size: 8.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1,055 - Forks: 69

superlinked/superlinked

Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.

Language: Jupyter Notebook - Size: 110 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,023 - Forks: 73

tensorlakeai/indexify

A realtime serving engine for Data-Intensive Generative AI Applications

Language: Rust - Size: 123 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 988 - Forks: 125

ArrowLuo/CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language: Python - Size: 1.61 MB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 929 - Forks: 126

Muennighoff/sgpt

SGPT: GPT Sentence Embeddings for Semantic Search

Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 863 - Forks: 54

lucidrains/RETRO-pytorch

Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

Language: Python - Size: 186 KB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 861 - Forks: 107

NeumTry/NeumAI

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

Language: Python - Size: 3.83 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 854 - Forks: 47

epsilla-cloud/vectordb

Epsilla is a high performance Vector Database Management System

Language: C++ - Size: 1010 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 851 - Forks: 41

AnswerDotAI/byaldi

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Language: Python - Size: 1.94 MB - Last synced at: about 13 hours ago - Pushed at: 3 months ago - Stars: 774 - Forks: 81

michaelthwan/searchGPT

Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.

Language: Python - Size: 1.17 MB - Last synced at: about 4 hours ago - Pushed at: 8 months ago - Stars: 693 - Forks: 71

shamangary/awesome-local-global-descriptor

My personal note about local and global descriptor

Size: 2.47 MB - Last synced at: 12 days ago - Pushed at: over 4 years ago - Stars: 648 - Forks: 95

lucidrains/memorizing-transformers-pytorch

Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch

Language: Python - Size: 34.2 MB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 633 - Forks: 47

ContextualAI/gritlm

Generative Representational Instruction Tuning

Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 608 - Forks: 42

redis-developer/ArXivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

Language: Python - Size: 2.95 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 543 - Forks: 68

Anush008/fastembed-rs

Rust library for generating vector embeddings, reranking locally

Language: Rust - Size: 598 KB - Last synced at: about 8 hours ago - Pushed at: 9 days ago - Stars: 479 - Forks: 64

AkariAsai/learning_to_retrieve_reasoning_paths

The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".

Language: Python - Size: 337 KB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 432 - Forks: 63

DataScienceUIBK/Rankify

๐Ÿ”ฅ Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation ๐Ÿ”ฅ. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techniques, 24+ state-of-the-art Reranking models, and multiple RAG methods.

Language: Python - Size: 5.11 MB - Last synced at: about 21 hours ago - Pushed at: about 21 hours ago - Stars: 420 - Forks: 34

KarelDO/xmc.dspy

In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.

Language: Python - Size: 45.4 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 420 - Forks: 26

SapienzaNLP/relik

Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)

Language: Python - Size: 908 KB - Last synced at: 14 days ago - Pushed at: 7 months ago - Stars: 407 - Forks: 31

Aquila-Network/aquila

An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.

Language: HTML - Size: 1.5 MB - Last synced at: 8 months ago - Pushed at: 12 months ago - Stars: 376 - Forks: 25

raphaelsty/cherche

Neural Search

Language: Python - Size: 41.6 MB - Last synced at: 16 days ago - Pushed at: 11 months ago - Stars: 328 - Forks: 15

LongmaoTeamTf/deep_recommenders

Deep Recommenders

Language: Python - Size: 2.3 MB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 327 - Forks: 108

tonywu71/colpali-cookbooks

Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. ๐Ÿ‘จ๐Ÿปโ€๐Ÿณ

Size: 10.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 269 - Forks: 17

arcee-ai/DALM

Domain Adapted Language Modeling Toolkit - E2E RAG

Language: Python - Size: 18.9 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 263 - Forks: 32

chao1224/MoleculeSTM

Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)

Language: Python - Size: 39.9 MB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 224 - Forks: 21

meinardmueller/libfmp

libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)

Language: Python - Size: 7.04 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 207 - Forks: 18

FasterDecoding/REST

REST: Retrieval-Based Speculative Decoding, NAACL 2024

Language: C - Size: 1.06 MB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 199 - Forks: 12

illuin-tech/vidore-benchmark

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

Language: Python - Size: 2.97 MB - Last synced at: 2 days ago - Pushed at: 11 days ago - Stars: 197 - Forks: 24

naver/bergen

Benchmarking library for RAG

Language: Jupyter Notebook - Size: 139 MB - Last synced at: 8 days ago - Pushed at: 13 days ago - Stars: 190 - Forks: 20

zou-group/avatar

AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning (NeurIPS 2024)

Language: Python - Size: 13.4 MB - Last synced at: 17 days ago - Pushed at: about 2 months ago - Stars: 187 - Forks: 17

jxmorris12/cde

code for training & evaluating Contextual Document Embedding models

Language: Python - Size: 1.62 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 180 - Forks: 11

mendableai/rag-arena

Open-source RAG evaluation through users' feedback

Language: TypeScript - Size: 18.7 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 180 - Forks: 19

m-bain/CondensedMovies

Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]

Language: Python - Size: 22 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 175 - Forks: 28

ARM-DOE/ACT

Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets

Language: Python - Size: 286 MB - Last synced at: about 5 hours ago - Pushed at: 3 days ago - Stars: 159 - Forks: 38

rom1504/image_embeddings

Using efficientnet to provide embeddings for retrieval

Language: Jupyter Notebook - Size: 16.2 MB - Last synced at: 14 days ago - Pushed at: almost 2 years ago - Stars: 157 - Forks: 31

TIGER-AI-Lab/VLM2Vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25]

Language: Python - Size: 8.92 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 156 - Forks: 6

chao1224/ChatDrug

LLM for Drug Editing, ICLR 2024

Language: Python - Size: 4.48 MB - Last synced at: 15 days ago - Pushed at: 11 months ago - Stars: 150 - Forks: 7

luyug/COIL

NAACL2021 - COIL Contextualized Lexical Retriever

Language: Python - Size: 91.8 KB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 149 - Forks: 28

zeroentropy-ai/zchunk

A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.

Language: Python - Size: 57.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 148 - Forks: 8

simaiden/Clothing-Detection

Language: Python - Size: 151 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 147 - Forks: 45

zjunlp/OneGen

[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.

Language: Python - Size: 842 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 137 - Forks: 15

EdoardoBotta/RQ-VAE-Recommender

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Language: Python - Size: 123 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 135 - Forks: 16

Reason-Wang/ToolGen

[ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"

Language: Python - Size: 722 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 133 - Forks: 13

denser-org/denser-chat

Chat with PDF files with source highlights

Language: Python - Size: 31 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 123 - Forks: 12

DRSY/MoTIS

[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

Language: Swift - Size: 16.5 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 121 - Forks: 10

Anush008/fastembed-js

Library to generate vector embeddings in NodeJS

Language: TypeScript - Size: 1.07 MB - Last synced at: 1 day ago - Pushed at: 12 days ago - Stars: 117 - Forks: 9

protyposis/Aurio

Audio Fingerprinting & Retrieval for .NET

Language: C# - Size: 8.09 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 115 - Forks: 25

protyposis/AudioAlign

Audio Synchronization and Analysis Tool

Language: C# - Size: 613 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 114 - Forks: 14

gmftbyGMFTBY/OpenDialog

An Open-Source Package for Chinese Open-domain Conversational Chatbot (ไธญๆ–‡้—ฒ่Šๅฏน่ฏ็ณป็ปŸ๏ผŒไธ€้”ฎ้ƒจ็ฝฒๅพฎไฟก้—ฒ่Šๆœบๅ™จไบบ)

Language: Python - Size: 1.48 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 108 - Forks: 20

luyug/GC-DPR

Train Dense Passage Retriever (DPR) with a single GPU

Language: Python - Size: 94.7 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 105 - Forks: 18

icodingc/ImageRetrieval-tf

ๅŸบไบŽtensorflow & tf-servering & flask ็š„ๅ›พๅƒๆฃ€็ดข

Language: JavaScript - Size: 11.1 MB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 102 - Forks: 29

PkuRainBow/HDC.caffe

Complete Code for "Hard-Aware-Deeply-Cascaded-Embedding"

Language: Python - Size: 89.3 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 98 - Forks: 35

xlang-ai/BRIGHT

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Language: Python - Size: 12.4 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 96 - Forks: 9

Dylancer1998/awesome-visual-localization-papers

The relocalization task aims to estimate the 6-DoF pose of a novel (unseen) frame in the coordinate system given by the prior model of the world.

Size: 55.7 KB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 93 - Forks: 8

dataplayer12/Fly-LSH

An implementation of efficient LSH inspired by fruit fly brain

Language: Python - Size: 214 KB - Last synced at: 18 days ago - Pushed at: over 6 years ago - Stars: 88 - Forks: 27

noagarcia/keras_rmac

RMAC implementation in Keras

Language: Python - Size: 2.25 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 82 - Forks: 28

feymanpriv/pymetric

pytorch metric learning tools and pycls

Language: Python - Size: 960 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 78 - Forks: 8

lucidrains/marge-pytorch

Implementation of Marge, Pre-training via Paraphrasing, in Pytorch

Language: Python - Size: 166 KB - Last synced at: 7 days ago - Pushed at: over 4 years ago - Stars: 75 - Forks: 11

HITsz-TMG/KaLM-Embedding

Code for KaLM-Embedding models

Language: Python - Size: 319 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 74 - Forks: 6

Langboat/mengzi-retrieval-lm

An experimental implementation of the retrieval-enhanced language model

Language: Python - Size: 189 KB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 74 - Forks: 5

CoIR-team/coir

A Comprehensive Benchmark for Code Information Retrieval.

Language: Python - Size: 2.45 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 73 - Forks: 8

axflow/original-demo-ui

Demo UI for the axgen library

Language: TypeScript - Size: 899 KB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 73 - Forks: 12

neulab/retomaton

PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)

Language: Python - Size: 6.62 MB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 71 - Forks: 4

Anush008/fastembed-go

Go implementation of @qdrant/fastembed.

Language: Go - Size: 3.22 MB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 70 - Forks: 4

TIGER-AI-Lab/UniIR

Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers"

Language: Python - Size: 53.6 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 69 - Forks: 11

lucidrains/retrieval-augmented-ddpm

Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch

Language: Python - Size: 4.88 KB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 64 - Forks: 5

mingjm3/image_retrieval_system

A image retrieval program implemented by VLAD algorithm (paper: https://ieeexplore.ieee.org/document/5540039). Can train you own image set and build your own image retrieval system.

Language: C++ - Size: 476 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 63 - Forks: 0

ducha-aiki/google-retrieval-challenge-2019-fastai-starter

fast.ai starter kit for Google Landmark Retrieval 2019 challenge

Language: Jupyter Notebook - Size: 2.74 MB - Last synced at: 3 days ago - Pushed at: about 6 years ago - Stars: 61 - Forks: 19

aimagelab/safe-clip

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024

Language: Python - Size: 17.5 MB - Last synced at: 10 days ago - Pushed at: 8 months ago - Stars: 58 - Forks: 0

vitrivr/cineast

Cineast is a multi-feature content-based mulitmedia retrieval engine. It is capable of retrieving images, audio- and video sequences as well as 3d models based on edge or color sketches, textual descriptions and example objects.

Language: Java - Size: 19.2 MB - Last synced at: 10 days ago - Pushed at: 10 months ago - Stars: 58 - Forks: 49

UDLF/UDLF

An Unsupervised Distance Learning Framework for Multimedia Retrieval

Language: C++ - Size: 8.46 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 57 - Forks: 11

algoprog/Quin

An easy to use framework for large-scale fact-checking and question answering

Language: Python - Size: 51.8 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 57 - Forks: 7

Confusezius/CVPR2020_PADS

(CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet mining with Reinforcement Learning.

Language: Python - Size: 193 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 57 - Forks: 9

LongxingTan/open-retrievals

All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers

Language: Python - Size: 1.36 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 55 - Forks: 12

imatge-upc/salbow

Saliency Weighted Convolutional Features for Instance Search

Language: Python - Size: 1.47 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 55 - Forks: 6

yanbeic/VAL

Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning

Language: Python - Size: 3.83 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 54 - Forks: 10

fmaglia/keras_rmac_plus

Keras implementation of R-MAC+ descriptors

Language: Python - Size: 29.3 KB - Last synced at: 5 months ago - Pushed at: about 6 years ago - Stars: 53 - Forks: 15

ai4protein/VenusREM

๐Ÿงฌ Augmenting zero-shot mutant prediction by retrieval-based logits fusion. (ISMB/ECCB 2025)

Language: Python - Size: 359 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 51 - Forks: 8

teddylee777/openai-api-kr

OpenAI ๊ณต์‹ Document, Cookbook, ๊ทธ ๋ฐ–์˜ ์‹ค์šฉ ์˜ˆ์ œ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ์ž‘์„ฑํ•œ ํ•œ๊ตญ์–ด ํŠœํ† ๋ฆฌ์–ผ์ž…๋‹ˆ๋‹ค. ๋ณธ ํŠœํ† ๋ฆฌ์–ผ์„ ํ†ตํ•ด Python OpenAI API ๋ฅผ ๋” ์‰ฝ๊ณ  ํšจ๊ณผ์ ์œผ๋กœ ์‚ฌ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ๋ฐฐ์šธ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

Language: Jupyter Notebook - Size: 39.2 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 49 - Forks: 24

ArkanDash/Advanced-RVC-Inference

Advanced RVC Inference for quicker and effortless model downloads

Language: Python - Size: 5.01 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 48 - Forks: 18

ioanacroi/qb-norm

Cross Modal Retrieval with Querybank Normalisation

Language: Python - Size: 42 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 48 - Forks: 2

lorenzhs/BuRR

Bumped Ribbon Retrieval and Approximate Membership Query

Language: C++ - Size: 121 KB - Last synced at: 13 days ago - Pushed at: 19 days ago - Stars: 43 - Forks: 6

Ryota-Kawamura/LangChain-Chat-with-Your-Data

Start building practical applications that allow you to interact with data using LangChain and LLMs.

Language: Jupyter Notebook - Size: 71.8 MB - Last synced at: 2 days ago - Pushed at: almost 2 years ago - Stars: 43 - Forks: 42

voyage-ai/voyageai-python

Voyage AI Official Python Library

Language: Python - Size: 178 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 42 - Forks: 5

orionw/FollowIR

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Language: Python - Size: 81.2 MB - Last synced at: 8 days ago - Pushed at: 10 months ago - Stars: 42 - Forks: 0

WHU-USI3DV/PatchAugNet

PatchAugNet: Patch feature augmentation-based heterogeneous point cloud place recognition in large-scale street scenes

Language: Python - Size: 109 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 41 - Forks: 0

vitrivr/cottontaildb

Cottontail DB is a column store vector database aimed at multimedia retrieval. It allows for classical boolean as well as vector-space retrieval (nearest neighbour search) used in similarity search using a unified data and query model.

Language: Kotlin - Size: 14.3 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 40 - Forks: 20

ahmdtaha/constrained_attention_filter

(ECCV2020) Tensorflow implementation of A Generic Visualization Approach for Convolutional Neural Networks

Language: Python - Size: 62.1 MB - Last synced at: 11 days ago - Pushed at: over 3 years ago - Stars: 40 - Forks: 8

zhaoxin111/imageRetrieval

Language: Jupyter Notebook - Size: 1.28 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 40 - Forks: 27

zjunlp/RAP

[SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction

Language: Python - Size: 17.1 MB - Last synced at: 9 months ago - Pushed at: about 2 years ago - Stars: 39 - Forks: 3

ahmdtaha/tf_retrieval_baseline

A Tensorflow retrieval (space embedding) baseline. Metric learning baseline on CUB and Stanford Online Products.

Language: Python - Size: 881 KB - Last synced at: 11 days ago - Pushed at: over 3 years ago - Stars: 39 - Forks: 6

palladian/palladian

Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.

Language: Java - Size: 274 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 38 - Forks: 10