An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: vector-similarity

milvus-io/milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Language: Go - Size: 219 MB - Last synced at: about 9 hours ago - Pushed at: about 12 hours ago - Stars: 34,235 - Forks: 3,162

rapidsai/cuvs

cuVS - a library for vector search and clustering on the GPU

Language: Cuda - Size: 7.9 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 382 - Forks: 96

rapidsai/raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

Language: Cuda - Size: 15.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 870 - Forks: 205

a-tokyo/ai-zero-shot-classifier

🧠 leverage advanced AI embeddings to perform multilingual zero-shot text classification. Whether you're dealing with unlabelled data or seeking to classify text against dynamic and user-defined labels, this library provides a seamless and efficient solution.

Language: TypeScript - Size: 1.25 MB - Last synced at: 4 days ago - Pushed at: 12 days ago - Stars: 5 - Forks: 0

nitaiaharoni1/vector-storage

Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows searching for similar documents based on cosine similarity.

Language: TypeScript - Size: 175 KB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 220 - Forks: 38

VQLite/VQLite

VQLite - Simple and Lightweight Vector Search Engine based on Google ScaNN

Language: Go - Size: 89.8 KB - Last synced at: 21 days ago - Pushed at: 9 months ago - Stars: 90 - Forks: 6

RelevanceAI/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

Size: 11.6 MB - Last synced at: 8 days ago - Pushed at: 8 months ago - Stars: 558 - Forks: 57

christivn/mebox

🗃️✨ Mebox is an open-source alternative to OpenAI's file_search tool, designed to efficiently process, store, and retrieve file-based information using Supabase and open source embeddings.

Language: HTML - Size: 8.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

lperezmo/embeddings-extraction

Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings for use in context-augmented LLM queries.

Language: Python - Size: 15 MB - Last synced at: 15 days ago - Pushed at: 8 months ago - Stars: 13 - Forks: 4

timeless-residents/handson-pinecone

FastAPI-based vector similarity search API using Pinecone for efficient vector database operations

Language: Python - Size: 8.79 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

guillermoscript/repo-assistant

AI Github assistant for your repo. Your proactive GitHub bot that auto-detects duplicates using OpenAI embeddings and Supabase magic!

Language: TypeScript - Size: 452 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 33 - Forks: 3

Huffon/sentence-similarity 📦

This repository contains various ways to calculate sentence vector similarity using NLP models

Language: Python - Size: 215 KB - Last synced at: 4 days ago - Pushed at: about 5 years ago - Stars: 199 - Forks: 34

timescale/vector-cookbook

Timescale Vector Cookbook. A collection of recipes to build applications with LLMs using PostgreSQL and Timescale Vector.

Language: Jupyter Notebook - Size: 8.52 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 111 - Forks: 40

taki0112/Vector_Similarity

Python, Java implementation of TS-SS called from "A Hybrid Geometric Approach for Measuring Similarity Level Among Documents and Document Clustering"

Language: Python - Size: 1.81 MB - Last synced at: 14 days ago - Pushed at: over 5 years ago - Stars: 297 - Forks: 44

srbhr/website-for-resume-matcher 📦

OLD Resume Matcher Website (Not used anymore)

Language: Astro - Size: 4.82 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 9

Florents-Tselai/vasco

Maximal Information Coefficient (MIC) Extension for Postgres

Language: C - Size: 5.23 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 33 - Forks: 2

matsjfunke/rag-from-scratch

implemented vector similarity algorithms to understand their inner workings, used local embeddding models

Language: Python - Size: 11.9 MB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

bariscamli/Vector-Search-with-FAISS

Vector search using embeddings, FAISS and Product Quantization with custom index & KMeans

Language: Python - Size: 424 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

shotit/shotit-frontend

The frontend of shotit, with full documentation.

Language: MDX - Size: 63.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

easonlai/chatbot_with_pdf_streamlit

This code example shows how to make a chatbot for semantic search over documents using Streamlit, LangChain, and various vector databases. The chatbot lets users ask questions and get answers from a document collection. The code is in Python and can be customized for different scenarios and data.

Language: Jupyter Notebook - Size: 6.57 MB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 5

shotit/shotit

Shotit is a screenshot-to-video search engine tailored for TV & Film, blazing-fast and compute-efficient.

Size: 4.19 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 3

shotit/shotit-api

The ultimate brain of Shotit, in charge of task coordination.

Language: JavaScript - Size: 1.82 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 5 - Forks: 1

shotit/.github

The README profile of Shotit.

Size: 2.25 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

vector-ai/vectorai

Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.

Language: Python - Size: 27 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 305 - Forks: 37

serp-ai/V3CTRON-vector-database-embedding-neural-search-retrieval-chatgpt-plugin Fork of openai/chatgpt-retrieval-plugin

V3CTRON | Vector Embeddings Data Retrieval | ChatGPT Plugin

Language: Python - Size: 6.61 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 20 - Forks: 7

EsraaMadi/similarity-search-weaviate

Text/Image search for similar products

Language: Python - Size: 68.9 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 1

robpetrosino/c0vEM5oxUa6ndKp8

Apziva AI Residency Program 2024 -- Project 3

Language: Jupyter Notebook - Size: 26.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

dvsh243/Seekr

in-memory fuzzy matching

Language: Python - Size: 28.7 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

erhant/halo2-vectordb

Verifiable vector similarity queries PoC with Halo2.

Language: Rust - Size: 220 KB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

struct-chat/embedding

Vector Embedding Server in under 100 lines of code

Language: Python - Size: 15.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 19 - Forks: 2

ChriStingo/Distributed-computation-of-Approximate-Nearest-Neighbors

Implementation and analysis of various algorithms, libraries and systems, distributed and not, for Approximate Nearest Neighbors searches

Language: Python - Size: 34.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ChetanXpro/chat-with-pdf

This is a web app where user can talk with there pdf , just need to run few scripts to ingest there pdf, and then with web interface they can talk with pdf

Language: TypeScript - Size: 194 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Macarena-Chang/DataHive

Your Data as Knowledge Base + AI. ChatBot

Language: Python - Size: 187 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

shotit/shotit-sorter

Sort the search results of Shotit to increase the correctness of Top1 result by using Keras and Faiss.

Language: Python - Size: 7.03 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

shotit/shotit-meta-service

Provide meta information and utility for shotit, for example, image proxy, cast and poster etc.

Language: JavaScript - Size: 55.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

shotit/shotit-worker

Four core workers of shotit: watcher, hasher, loader and searcher.

Language: JavaScript - Size: 30.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

shotit/shotit-media

Media broker for serving video preview for shotit

Language: JavaScript - Size: 890 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

sayakpaul/near-dup-parser

Holds code for near-duplicate image parser using optimized image classifiers.

Language: Jupyter Notebook - Size: 6.32 MB - Last synced at: 20 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

ma-an-jong/diagnostic

Language: Python - Size: 15.2 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

wikp/faiss-rest-api

REST API for facebook's faiss

Language: Rust - Size: 11.7 KB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 1

Related Keywords
vector-similarity 40 vector-search 23 vector 12 approximate-nearest-neighbor-search 12 faiss 12 vector-database 12 nearest-neighbor-search 12 anns 11 embedding-similarity 11 python 10 search 9 search-engine 9 javascript 9 distributed 9 image-search 9 nodejs 8 machine-learning 8 liresolr 8 video 8 video-search 8 visual-search 8 screenshot 7 node 7 openai 6 llm 6 embeddings 6 semantic-search 5 natural-language-processing 4 transformers 4 similarity-search 4 tensorflow 4 vector-similarity-search 4 vector-similarity-database 4 ai 4 artificial-intelligence 4 embedding-vectors 3 vector-search-engine 3 pinecone 3 langchain 3 rag 3 vector-store 3 deep-learning 3 clustering 3 bot 2 typescript 2 word-embeddings 2 supabase 2 gpt-3 2 information-retrieval 2 streamlit 2 embedding-database 2 vector-similarity-search-engine 2 huggingface 2 encodings 2 gpu 2 neighborhood-methods 2 statistics 2 pytorch 2 sparse 2 rust 2 nlp 2 ollama 2 react 2 nearest-neighbors 2 vue 2 cosine-similarity 2 keras 2 cuda 2 distance 2 fastapi 2 open-source 2 gpt-4 2 large-language-models 2 v3ctron 1 semantic-search-engine 1 neural-search-engine 1 vector-database-embedding 1 vector-embedding-database 1 vector-database-search 1 vector-embeddings 1 rest-api 1 tensorrt 1 sentence-transformers 1 azure 1 azure-cognitive-search 1 azure-openai 1 chroma 1 document-search 1 embedding-models 1 gpt-35-turbo 1 langchain-python 1 compare-vectors 1 neural-networks 1 vector-analytics 1 chatgpt-plugins 1 chatgpt-retrieval 1 neural-search 1 ann-algorithm 1 disease-prediction 1 lsh 1