An open API service providing repository metadata for many open source software ecosystems.

Topic: "semantic-search"

microsoft/generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI ๐Ÿ”— https://microsoft.github.io/generative-ai-for-beginners/

Language: Jupyter Notebook - Size: 125 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 78,372 - Forks: 40,640

meilisearch/meilisearch

A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.

Language: Rust - Size: 69.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 50,474 - Forks: 1,992

khoj-ai/khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

Language: Python - Size: 109 MB - Last synced at: 39 minutes ago - Pushed at: about 4 hours ago - Stars: 28,763 - Forks: 1,608

typesense/typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch โšก ๐Ÿ” โœจ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

Language: C++ - Size: 12.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 22,793 - Forks: 714

deepset-ai/haystack

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language: Python - Size: 48.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 20,293 - Forks: 2,132

arc53/DocsGPT

DocsGPT is an open-source genAI tool that helps users get reliable answers from knowledge source, while avoiding hallucinations. It enables private and reliable information retrieval, with tooling and agentic system capability built in.

Language: TypeScript - Size: 81.1 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 15,560 - Forks: 1,659

weaviate/weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native databaseโ€‹.

Language: Go - Size: 964 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 13,117 - Forks: 926

neuml/txtai

๐Ÿ’ก All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language: Python - Size: 52 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 10,705 - Forks: 679

zilliztech/GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language: Python - Size: 22.3 MB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 7,511 - Forks: 533

lancedb/lancedb

Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.

Language: Python - Size: 17.4 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 6,210 - Forks: 453

superduper-io/superduper

Superduper: End-to-end framework for building custom AI applications and agents.

Language: Python - Size: 73.2 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 5,032 - Forks: 492

marqo-ai/marqo

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Language: Python - Size: 79.5 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 4,821 - Forks: 202

docarray/docarray

Represent, send, store and search multimodal data

Language: Python - Size: 242 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 3,042 - Forks: 233

ddangelov/Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

Language: Python - Size: 83.4 MB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 3,022 - Forks: 374

gmpetrov/databerry

The no-code platform for building custom LLM Agents

Size: 73.2 MB - Last synced at: 12 days ago - Pushed at: 10 months ago - Stars: 2,932 - Forks: 429

pinecone-io/examples

Jupyter Notebooks to help you get hands-on with Pinecone vector databases

Language: Jupyter Notebook - Size: 314 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 2,870 - Forks: 1,050

mazzzystar/Queryable

Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos.

Language: Swift - Size: 1.93 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 2,830 - Forks: 435

filipecalegario/awesome-generative-ai

A curated list of Generative AI tools, works, models, and references

Size: 1.56 MB - Last synced at: 4 days ago - Pushed at: 12 days ago - Stars: 2,778 - Forks: 460

unum-cloud/usearch

Fast Open-Source Search & Clustering engine ร— for Vectors & ๐Ÿ”œ Strings ร— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram ๐Ÿ”

Language: C++ - Size: 4.22 MB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 2,654 - Forks: 175

freedmand/semantra

Multi-tool for semantic search

Language: Python - Size: 9.01 MB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 2,597 - Forks: 153

rom1504/clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language: Jupyter Notebook - Size: 3.75 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 2,536 - Forks: 222

embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

Language: Python - Size: 34.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2,418 - Forks: 373

microsoft/kernel-memory

RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.

Language: C# - Size: 25.6 MB - Last synced at: 11 days ago - Pushed at: 28 days ago - Stars: 1,882 - Forks: 355

NotJoeMartinez/yt-fts

YouTube Full Text Search - Search all of a YouTube channel from the command line

Language: Python - Size: 367 KB - Last synced at: about 2 hours ago - Pushed at: 7 months ago - Stars: 1,685 - Forks: 85

IntelLabs/fastRAG

Efficient Retrieval Augmentation and Generation Framework

Language: Python - Size: 20.4 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 1,513 - Forks: 139

frutik/awesome-search

Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness

Language: HTML - Size: 1.26 MB - Last synced at: 9 days ago - Pushed at: 22 days ago - Stars: 1,432 - Forks: 124

gnes-ai/gnes ๐Ÿ“ฆ

GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.

Language: Python - Size: 52 MB - Last synced at: 26 days ago - Pushed at: over 5 years ago - Stars: 1,267 - Forks: 209

aws-samples/aws-genai-llm-chatbot

A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS

Language: TypeScript - Size: 82.6 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1,221 - Forks: 371

lotus-data/lotus

LOTUS: A semantic query engine for fast and easy LLM-powered data processing

Language: Python - Size: 1.48 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,155 - Forks: 100

unum-cloud/uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and ๐Ÿ”œ video, up to 5x faster than OpenAI CLIP and LLaVA ๐Ÿ–ผ๏ธ & ๐Ÿ–‹๏ธ

Language: Python - Size: 669 KB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 1,116 - Forks: 63

model-zoo/shift-ctrl-f ๐Ÿ“ฆ

๐Ÿ”Ž Search the information available on a webpage using natural language instead of an exact string match.

Language: JavaScript - Size: 87.8 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1,105 - Forks: 42

superlinked/superlinked

Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.

Language: Jupyter Notebook - Size: 110 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,023 - Forks: 73

Dicklesworthstone/swiss_army_llama

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

Language: Python - Size: 7.25 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 1,014 - Forks: 61

Muennighoff/sgpt

SGPT: GPT Sentence Embeddings for Semantic Search

Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 863 - Forks: 54

PrithivirajDamodaran/FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

Language: Python - Size: 2.47 MB - Last synced at: about 2 hours ago - Pushed at: 5 months ago - Stars: 786 - Forks: 55

hayabhay/frogbase ๐Ÿ“ฆ

Transform audio-visual content into navigable knowledge.

Language: Python - Size: 1.22 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 785 - Forks: 95

IntelLabs/RAG-FiT

Framework for enhancing LLMs for RAG tasks using fine-tuning.

Language: Python - Size: 925 KB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 737 - Forks: 56

primeqa/primeqa

The prime repository for state-of-the-art Multilingual Question Answering research and development.

Language: Python - Size: 51 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 733 - Forks: 57

koursaros-ai/nboost

NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)

Language: Python - Size: 14.1 MB - Last synced at: 22 days ago - Pushed at: over 4 years ago - Stars: 678 - Forks: 69

cocoindex-io/cocoindex

ETL framework to turn your data AI-ready - with realtime incremental updates and support custom logic like lego.

Language: Rust - Size: 3.58 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 579 - Forks: 42

aryn-ai/sycamore

๐Ÿ Sycamore is an LLM-powered search and analytics platform for unstructured data.

Language: Python - Size: 99.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 506 - Forks: 59

hamelsmu/code_search

Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deepย Learning"

Language: Jupyter Notebook - Size: 73.6 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 488 - Forks: 137

qdrant/mcp-server-qdrant

An official Qdrant Model Context Protocol (MCP) server implementation

Language: Python - Size: 313 KB - Last synced at: 1 day ago - Pushed at: 7 days ago - Stars: 463 - Forks: 52

jina-ai/examples ๐Ÿ“ฆ

Jina examples and demos to help you get started

Language: Python - Size: 189 MB - Last synced at: about 18 hours ago - Pushed at: over 3 years ago - Stars: 459 - Forks: 142

kelindar/search

Go library for embedded vector search and semantic embeddings using llama.cpp

Language: Go - Size: 714 KB - Last synced at: about 20 hours ago - Pushed at: about 1 month ago - Stars: 430 - Forks: 13

nixiesearch/nixiesearch

Hybrid search engine, combining best features of text and semantic search worlds

Language: Scala - Size: 13 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 413 - Forks: 9

alexklibisz/elastiknn

Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.

Language: Scala - Size: 139 MB - Last synced at: 7 days ago - Pushed at: 14 days ago - Stars: 379 - Forks: 49

JohnGiorgi/DeCLUTR

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!

Language: Python - Size: 702 KB - Last synced at: 22 days ago - Pushed at: about 2 years ago - Stars: 379 - Forks: 33

raphaelsty/neural-cherche

Neural Search

Language: Python - Size: 3.1 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 354 - Forks: 18

Agrover112/awesome-semantic-search

A curated list of awesome resources related to Semantic Search๐Ÿ”Ž and Semantic Similarity tasks.

Size: 371 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 353 - Forks: 29

askaitools/askaitools-community-edition

A cutting-edge search engine project tailored specifically for the AI product

Language: TypeScript - Size: 742 KB - Last synced at: 28 days ago - Pushed at: 9 months ago - Stars: 347 - Forks: 29

raphaelsty/cherche

Neural Search

Language: Python - Size: 41.6 MB - Last synced at: 16 days ago - Pushed at: 11 months ago - Stars: 328 - Forks: 15

deepset-ai/haystack-tutorials

Here you can find all the Tutorials for Haystack ๐Ÿ““

Language: Jupyter Notebook - Size: 5.17 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 315 - Forks: 107

vector-ai/vectorai

Vector AI โ€” A platform for building vector based applications. Encode, query and analyse data using vectors.

Language: Python - Size: 27 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 305 - Forks: 37

alondmnt/joplin-plugin-jarvis

Joplin (note-taking) assistant running a very intelligent system (GPT, Claude, Gemini, Ollama, Hugging Face)

Language: TypeScript - Size: 3.43 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 275 - Forks: 25

do-me/SemanticFinder

SemanticFinder - frontend-only live semantic search with transformers.js

Language: JavaScript - Size: 30.7 MB - Last synced at: 8 days ago - Pushed at: 25 days ago - Stars: 268 - Forks: 18

DiceTechJobs/ConceptualSearch

Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs

Language: Jupyter Notebook - Size: 93.8 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 257 - Forks: 59

zilliztech/akcio

Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses vector databases to fetch relevant documents to enhance the quality and relevance of the output.

Language: Python - Size: 1.56 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 255 - Forks: 39

treygrainger/ai-powered-search

The codebase for the book "AI-Powered Search" (Manning Publications, 2024)

Language: Jupyter Notebook - Size: 65.3 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 254 - Forks: 63

ZachNagengast/similarity-search-kit

๐Ÿ”Ž SimilaritySearchKit is a Swift package providing on-device text embeddings and semantic search functionality for iOS and macOS applications.

Language: Swift - Size: 175 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 253 - Forks: 24

intelligentnode/IntelliNode

Access the latest AI models like ChatGPT, LLaMA, Deepseek, Diffusion, Hugging face, and beyond through a unified prompt layer and performance evaluation

Language: JavaScript - Size: 10 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 250 - Forks: 16

Hellisotherpeople/CX_DB8

a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)

Language: Python - Size: 6.21 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 226 - Forks: 26

pinecone-io/pinecone-ts-client

The official TypeScript/Node client for the Pinecone vector database

Language: TypeScript - Size: 2 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 222 - Forks: 40

maxent-ai/ocrpy

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

Language: Jupyter Notebook - Size: 32.4 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 222 - Forks: 11

nitaiaharoni1/vector-storage

Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows searching for similar documents based on cosine similarity.

Language: TypeScript - Size: 175 KB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 220 - Forks: 38

Mihaiii/semantic-autocomplete

A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity.

Language: JavaScript - Size: 4.9 MB - Last synced at: 9 days ago - Pushed at: 8 months ago - Stars: 216 - Forks: 4

AmenRa/retriv

A Python Search Engine for Humans ๐Ÿฅธ

Language: Python - Size: 372 KB - Last synced at: 9 days ago - Pushed at: 12 months ago - Stars: 213 - Forks: 25

fzliu/radient

Radient turns many data types (not just text) into vectors for similarity search, clustering, regression analysis, and more.

Language: Python - Size: 65.4 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 207 - Forks: 7

Ravn-Tech/HyperTag

HyperTag - Intuitive Knowledge Management WebApp & CLI for Humans using Deep Learning & Tags

Language: Python - Size: 1 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 196 - Forks: 14

sjy-dv/coltt

Coltt is a vector database that supports Multi-Vector Search, high-performance HNSW, FLAT and quantization, and enables fast searches through sophisticated internal data shard design.

Language: Go - Size: 55.2 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 184 - Forks: 1

doobidoo/mcp-memory-service

MCP server providing semantic memory and persistent storage capabilities for Claude using ChromaDB and sentence transformers.

Language: Python - Size: 766 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 167 - Forks: 27

kuutsav/information-retrieval ๐Ÿ“ฆ

Neural information retrieval / Semantic search / Bi-encoders

Language: Jupyter Notebook - Size: 6.69 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 167 - Forks: 21

DmitryKey/bert-solr-search

Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU

Language: Jupyter Notebook - Size: 3.46 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 163 - Forks: 32

md-experiments/elastic_transformers

Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers

Language: Jupyter Notebook - Size: 447 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 159 - Forks: 28

augustwester/searchthearxiv

The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.

Language: Python - Size: 124 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 145 - Forks: 14

rom1504/awesome-semantic-search

Semantic search with embeddings: index anything

Size: 14.6 KB - Last synced at: 11 days ago - Pushed at: about 3 years ago - Stars: 139 - Forks: 7

dmotz/emdash

๐Ÿ“š๐Ÿง™โ€โ™‚๏ธ Wisdom indexer โ€” use AI to organize text snippets so you can actually remember & learn from what you read

Language: Elm - Size: 4.76 MB - Last synced at: 35 minutes ago - Pushed at: 15 days ago - Stars: 138 - Forks: 9

patricktrainer/duckdb-embedding-search

Fast similarity search using DuckDB

Language: Python - Size: 5.97 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 128 - Forks: 3

yeldarby/nycerebro

Language: TypeScript - Size: 2.33 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 124 - Forks: 12

DRSY/MoTIS

[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

Language: Swift - Size: 16.5 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 121 - Forks: 10

ashvardanian/SwiftSemanticSearch

Real-time on-device text-to-image and image-to-image Semantic Search with video stream camera capture using USearch & UForm AI Swift SDKs for Apple devices ๐Ÿ

Language: Swift - Size: 623 KB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 118 - Forks: 7

deepset-ai/haystack-demos

Fully working applications that demonstrate how to use Haystack to implement various use cases

Language: Python - Size: 13.3 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 117 - Forks: 24

nomic-ai/semantic-search-app-template

Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI

Language: Python - Size: 32.2 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 115 - Forks: 26

TheMind-AI/fluid-db

Fluid Database

Language: Python - Size: 1.45 MB - Last synced at: 10 days ago - Pushed at: 7 months ago - Stars: 114 - Forks: 8

transitive-bullshit/bens-bites-ai-search

AI search for all the best resources in AI โ€“ย powered by Ben's Bites ๐Ÿ’ฏ

Language: TypeScript - Size: 3.82 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 114 - Forks: 19

ChatFAQ/ChatFAQ

Open-source ecosystem for building AI-powered conversational solutions using RAG, agents, FSMs, and LLMs.

Language: Python - Size: 10.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 108 - Forks: 9

foxminchan/LawKnowledge

A legal knowledge search and Q&A application based on Vietnam's Legal Code and legal document database โš–๏ธ

Language: TypeScript - Size: 174 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 107 - Forks: 8

0xDebabrata/citrus

(distributed) vector database

Language: Python - Size: 935 KB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 104 - Forks: 13

sinanuozdemir/oreilly-retrieval-augmented-gen-ai

See how to augment LLMs with real-time data for dynamic, context-aware apps - Rag + Agents + GraphRAG.

Language: Jupyter Notebook - Size: 18 MB - Last synced at: 15 days ago - Pushed at: 2 months ago - Stars: 93 - Forks: 51

ddangelov/RESTful-Top2Vec

Expose a Top2Vec model with a REST API.

Language: Python - Size: 243 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 89 - Forks: 20

acheong08/vectordb ๐Ÿ“ฆ

A simple vector database: Text encoding, semantic search, document storage

Language: Go - Size: 145 KB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 88 - Forks: 6

mikeroyal/NLP-Guide

Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.

Language: Python - Size: 315 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 85 - Forks: 16

DiceTechJobs/VectorsInSearch

Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015

Language: Python - Size: 49.8 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 82 - Forks: 15

colonelwatch/abstracts-search

Semantic search engine indexing 110 million academic publications

Language: Python - Size: 143 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 80 - Forks: 4

weaviate/typescript-client

Official Weaviate TypeScript Client

Language: TypeScript - Size: 4.24 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 78 - Forks: 25

CLUEbenchmark/QBQTC

QBQTC: ๅคง่ง„ๆจกๆœ็ดขๅŒน้…ๆ•ฐๆฎ้›†

Language: Python - Size: 10.6 MB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 78 - Forks: 9

haven-jeon/LegalQA

Korean LegalQA using SentenceKoBART

Language: Python - Size: 140 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 77 - Forks: 26

sazonovanton/SirChatalot

SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or YandexART for image creation. It can use vision capabilities, tools and semantic search in vector DB.

Language: Python - Size: 674 KB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 72 - Forks: 13

unmonoqueteclea/voilib

๐ŸŽง Podcast Search Engine. Try it now for free or run your own instance.

Language: Python - Size: 6.61 MB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 71 - Forks: 5

ibm-self-serve-assets/Blended-RAG

Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers

Language: Jupyter Notebook - Size: 6.72 MB - Last synced at: 20 days ago - Pushed at: 11 months ago - Stars: 65 - Forks: 4