Topic: "semantic-search"
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI ๐ https://microsoft.github.io/generative-ai-for-beginners/
Language: Jupyter Notebook - Size: 125 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 78,372 - Forks: 40,640

meilisearch/meilisearch
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
Language: Rust - Size: 69.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 50,474 - Forks: 1,992

khoj-ai/khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
Language: Python - Size: 109 MB - Last synced at: 39 minutes ago - Pushed at: about 4 hours ago - Stars: 28,763 - Forks: 1,608

typesense/typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch โก ๐ โจ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
Language: C++ - Size: 12.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 22,793 - Forks: 714

deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Language: Python - Size: 48.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 20,293 - Forks: 2,132

arc53/DocsGPT
DocsGPT is an open-source genAI tool that helps users get reliable answers from knowledge source, while avoiding hallucinations. It enables private and reliable information retrieval, with tooling and agentic system capability built in.
Language: TypeScript - Size: 81.1 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 15,560 - Forks: 1,659

weaviate/weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native databaseโ.
Language: Go - Size: 964 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 13,117 - Forks: 926

neuml/txtai
๐ก All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Language: Python - Size: 52 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 10,705 - Forks: 679

zilliztech/GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
Language: Python - Size: 22.3 MB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 7,511 - Forks: 533

lancedb/lancedb
Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
Language: Python - Size: 17.4 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 6,210 - Forks: 453

superduper-io/superduper
Superduper: End-to-end framework for building custom AI applications and agents.
Language: Python - Size: 73.2 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 5,032 - Forks: 492

marqo-ai/marqo
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Language: Python - Size: 79.5 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 4,821 - Forks: 202

docarray/docarray
Represent, send, store and search multimodal data
Language: Python - Size: 242 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 3,042 - Forks: 233

ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
Language: Python - Size: 83.4 MB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 3,022 - Forks: 374

gmpetrov/databerry
The no-code platform for building custom LLM Agents
Size: 73.2 MB - Last synced at: 12 days ago - Pushed at: 10 months ago - Stars: 2,932 - Forks: 429

pinecone-io/examples
Jupyter Notebooks to help you get hands-on with Pinecone vector databases
Language: Jupyter Notebook - Size: 314 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 2,870 - Forks: 1,050

mazzzystar/Queryable
Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos.
Language: Swift - Size: 1.93 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 2,830 - Forks: 435

filipecalegario/awesome-generative-ai
A curated list of Generative AI tools, works, models, and references
Size: 1.56 MB - Last synced at: 4 days ago - Pushed at: 12 days ago - Stars: 2,778 - Forks: 460

unum-cloud/usearch
Fast Open-Source Search & Clustering engine ร for Vectors & ๐ Strings ร in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram ๐
Language: C++ - Size: 4.22 MB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 2,654 - Forks: 175

freedmand/semantra
Multi-tool for semantic search
Language: Python - Size: 9.01 MB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 2,597 - Forks: 153

rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
Language: Jupyter Notebook - Size: 3.75 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 2,536 - Forks: 222

embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Language: Python - Size: 34.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2,418 - Forks: 373

microsoft/kernel-memory
RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.
Language: C# - Size: 25.6 MB - Last synced at: 11 days ago - Pushed at: 28 days ago - Stars: 1,882 - Forks: 355

NotJoeMartinez/yt-fts
YouTube Full Text Search - Search all of a YouTube channel from the command line
Language: Python - Size: 367 KB - Last synced at: about 2 hours ago - Pushed at: 7 months ago - Stars: 1,685 - Forks: 85

IntelLabs/fastRAG
Efficient Retrieval Augmentation and Generation Framework
Language: Python - Size: 20.4 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 1,513 - Forks: 139

frutik/awesome-search
Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness
Language: HTML - Size: 1.26 MB - Last synced at: 9 days ago - Pushed at: 22 days ago - Stars: 1,432 - Forks: 124

gnes-ai/gnes ๐ฆ
GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.
Language: Python - Size: 52 MB - Last synced at: 26 days ago - Pushed at: over 5 years ago - Stars: 1,267 - Forks: 209

aws-samples/aws-genai-llm-chatbot
A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS
Language: TypeScript - Size: 82.6 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1,221 - Forks: 371

lotus-data/lotus
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
Language: Python - Size: 1.48 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,155 - Forks: 100

unum-cloud/uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and ๐ video, up to 5x faster than OpenAI CLIP and LLaVA ๐ผ๏ธ & ๐๏ธ
Language: Python - Size: 669 KB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 1,116 - Forks: 63

model-zoo/shift-ctrl-f ๐ฆ
๐ Search the information available on a webpage using natural language instead of an exact string match.
Language: JavaScript - Size: 87.8 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1,105 - Forks: 42

superlinked/superlinked
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
Language: Jupyter Notebook - Size: 110 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,023 - Forks: 73

Dicklesworthstone/swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
Language: Python - Size: 7.25 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 1,014 - Forks: 61

Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 863 - Forks: 54

PrithivirajDamodaran/FlashRank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.
Language: Python - Size: 2.47 MB - Last synced at: about 2 hours ago - Pushed at: 5 months ago - Stars: 786 - Forks: 55

hayabhay/frogbase ๐ฆ
Transform audio-visual content into navigable knowledge.
Language: Python - Size: 1.22 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 785 - Forks: 95

IntelLabs/RAG-FiT
Framework for enhancing LLMs for RAG tasks using fine-tuning.
Language: Python - Size: 925 KB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 737 - Forks: 56

primeqa/primeqa
The prime repository for state-of-the-art Multilingual Question Answering research and development.
Language: Python - Size: 51 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 733 - Forks: 57

koursaros-ai/nboost
NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)
Language: Python - Size: 14.1 MB - Last synced at: 22 days ago - Pushed at: over 4 years ago - Stars: 678 - Forks: 69

cocoindex-io/cocoindex
ETL framework to turn your data AI-ready - with realtime incremental updates and support custom logic like lego.
Language: Rust - Size: 3.58 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 579 - Forks: 42

aryn-ai/sycamore
๐ Sycamore is an LLM-powered search and analytics platform for unstructured data.
Language: Python - Size: 99.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 506 - Forks: 59

hamelsmu/code_search
Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deepย Learning"
Language: Jupyter Notebook - Size: 73.6 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 488 - Forks: 137

qdrant/mcp-server-qdrant
An official Qdrant Model Context Protocol (MCP) server implementation
Language: Python - Size: 313 KB - Last synced at: 1 day ago - Pushed at: 7 days ago - Stars: 463 - Forks: 52

jina-ai/examples ๐ฆ
Jina examples and demos to help you get started
Language: Python - Size: 189 MB - Last synced at: about 18 hours ago - Pushed at: over 3 years ago - Stars: 459 - Forks: 142

kelindar/search
Go library for embedded vector search and semantic embeddings using llama.cpp
Language: Go - Size: 714 KB - Last synced at: about 20 hours ago - Pushed at: about 1 month ago - Stars: 430 - Forks: 13

nixiesearch/nixiesearch
Hybrid search engine, combining best features of text and semantic search worlds
Language: Scala - Size: 13 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 413 - Forks: 9

alexklibisz/elastiknn
Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.
Language: Scala - Size: 139 MB - Last synced at: 7 days ago - Pushed at: 14 days ago - Stars: 379 - Forks: 49

JohnGiorgi/DeCLUTR
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
Language: Python - Size: 702 KB - Last synced at: 22 days ago - Pushed at: about 2 years ago - Stars: 379 - Forks: 33

raphaelsty/neural-cherche
Neural Search
Language: Python - Size: 3.1 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 354 - Forks: 18

Agrover112/awesome-semantic-search
A curated list of awesome resources related to Semantic Search๐ and Semantic Similarity tasks.
Size: 371 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 353 - Forks: 29

askaitools/askaitools-community-edition
A cutting-edge search engine project tailored specifically for the AI product
Language: TypeScript - Size: 742 KB - Last synced at: 28 days ago - Pushed at: 9 months ago - Stars: 347 - Forks: 29

raphaelsty/cherche
Neural Search
Language: Python - Size: 41.6 MB - Last synced at: 16 days ago - Pushed at: 11 months ago - Stars: 328 - Forks: 15

deepset-ai/haystack-tutorials
Here you can find all the Tutorials for Haystack ๐
Language: Jupyter Notebook - Size: 5.17 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 315 - Forks: 107

vector-ai/vectorai
Vector AI โ A platform for building vector based applications. Encode, query and analyse data using vectors.
Language: Python - Size: 27 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 305 - Forks: 37

alondmnt/joplin-plugin-jarvis
Joplin (note-taking) assistant running a very intelligent system (GPT, Claude, Gemini, Ollama, Hugging Face)
Language: TypeScript - Size: 3.43 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 275 - Forks: 25

do-me/SemanticFinder
SemanticFinder - frontend-only live semantic search with transformers.js
Language: JavaScript - Size: 30.7 MB - Last synced at: 8 days ago - Pushed at: 25 days ago - Stars: 268 - Forks: 18

DiceTechJobs/ConceptualSearch
Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
Language: Jupyter Notebook - Size: 93.8 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 257 - Forks: 59

zilliztech/akcio
Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses vector databases to fetch relevant documents to enhance the quality and relevance of the output.
Language: Python - Size: 1.56 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 255 - Forks: 39

treygrainger/ai-powered-search
The codebase for the book "AI-Powered Search" (Manning Publications, 2024)
Language: Jupyter Notebook - Size: 65.3 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 254 - Forks: 63

ZachNagengast/similarity-search-kit
๐ SimilaritySearchKit is a Swift package providing on-device text embeddings and semantic search functionality for iOS and macOS applications.
Language: Swift - Size: 175 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 253 - Forks: 24

intelligentnode/IntelliNode
Access the latest AI models like ChatGPT, LLaMA, Deepseek, Diffusion, Hugging face, and beyond through a unified prompt layer and performance evaluation
Language: JavaScript - Size: 10 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 250 - Forks: 16

Hellisotherpeople/CX_DB8
a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Language: Python - Size: 6.21 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 226 - Forks: 26

pinecone-io/pinecone-ts-client
The official TypeScript/Node client for the Pinecone vector database
Language: TypeScript - Size: 2 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 222 - Forks: 40

maxent-ai/ocrpy
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
Language: Jupyter Notebook - Size: 32.4 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 222 - Forks: 11

nitaiaharoni1/vector-storage
Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows searching for similar documents based on cosine similarity.
Language: TypeScript - Size: 175 KB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 220 - Forks: 38

Mihaiii/semantic-autocomplete
A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity.
Language: JavaScript - Size: 4.9 MB - Last synced at: 9 days ago - Pushed at: 8 months ago - Stars: 216 - Forks: 4

AmenRa/retriv
A Python Search Engine for Humans ๐ฅธ
Language: Python - Size: 372 KB - Last synced at: 9 days ago - Pushed at: 12 months ago - Stars: 213 - Forks: 25

fzliu/radient
Radient turns many data types (not just text) into vectors for similarity search, clustering, regression analysis, and more.
Language: Python - Size: 65.4 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 207 - Forks: 7

Ravn-Tech/HyperTag
HyperTag - Intuitive Knowledge Management WebApp & CLI for Humans using Deep Learning & Tags
Language: Python - Size: 1 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 196 - Forks: 14

sjy-dv/coltt
Coltt is a vector database that supports Multi-Vector Search, high-performance HNSW, FLAT and quantization, and enables fast searches through sophisticated internal data shard design.
Language: Go - Size: 55.2 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 184 - Forks: 1

doobidoo/mcp-memory-service
MCP server providing semantic memory and persistent storage capabilities for Claude using ChromaDB and sentence transformers.
Language: Python - Size: 766 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 167 - Forks: 27

kuutsav/information-retrieval ๐ฆ
Neural information retrieval / Semantic search / Bi-encoders
Language: Jupyter Notebook - Size: 6.69 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 167 - Forks: 21

DmitryKey/bert-solr-search
Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU
Language: Jupyter Notebook - Size: 3.46 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 163 - Forks: 32

md-experiments/elastic_transformers
Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers
Language: Jupyter Notebook - Size: 447 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 159 - Forks: 28

augustwester/searchthearxiv
The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.
Language: Python - Size: 124 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 145 - Forks: 14

rom1504/awesome-semantic-search
Semantic search with embeddings: index anything
Size: 14.6 KB - Last synced at: 11 days ago - Pushed at: about 3 years ago - Stars: 139 - Forks: 7

dmotz/emdash
๐๐งโโ๏ธ Wisdom indexer โ use AI to organize text snippets so you can actually remember & learn from what you read
Language: Elm - Size: 4.76 MB - Last synced at: 35 minutes ago - Pushed at: 15 days ago - Stars: 138 - Forks: 9

patricktrainer/duckdb-embedding-search
Fast similarity search using DuckDB
Language: Python - Size: 5.97 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 128 - Forks: 3

yeldarby/nycerebro
Language: TypeScript - Size: 2.33 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 124 - Forks: 12

DRSY/MoTIS
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
Language: Swift - Size: 16.5 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 121 - Forks: 10

ashvardanian/SwiftSemanticSearch
Real-time on-device text-to-image and image-to-image Semantic Search with video stream camera capture using USearch & UForm AI Swift SDKs for Apple devices ๐
Language: Swift - Size: 623 KB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 118 - Forks: 7

deepset-ai/haystack-demos
Fully working applications that demonstrate how to use Haystack to implement various use cases
Language: Python - Size: 13.3 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 117 - Forks: 24

nomic-ai/semantic-search-app-template
Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI
Language: Python - Size: 32.2 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 115 - Forks: 26

TheMind-AI/fluid-db
Fluid Database
Language: Python - Size: 1.45 MB - Last synced at: 10 days ago - Pushed at: 7 months ago - Stars: 114 - Forks: 8

transitive-bullshit/bens-bites-ai-search
AI search for all the best resources in AI โย powered by Ben's Bites ๐ฏ
Language: TypeScript - Size: 3.82 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 114 - Forks: 19

ChatFAQ/ChatFAQ
Open-source ecosystem for building AI-powered conversational solutions using RAG, agents, FSMs, and LLMs.
Language: Python - Size: 10.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 108 - Forks: 9

foxminchan/LawKnowledge
A legal knowledge search and Q&A application based on Vietnam's Legal Code and legal document database โ๏ธ
Language: TypeScript - Size: 174 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 107 - Forks: 8

0xDebabrata/citrus
(distributed) vector database
Language: Python - Size: 935 KB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 104 - Forks: 13

sinanuozdemir/oreilly-retrieval-augmented-gen-ai
See how to augment LLMs with real-time data for dynamic, context-aware apps - Rag + Agents + GraphRAG.
Language: Jupyter Notebook - Size: 18 MB - Last synced at: 15 days ago - Pushed at: 2 months ago - Stars: 93 - Forks: 51

ddangelov/RESTful-Top2Vec
Expose a Top2Vec model with a REST API.
Language: Python - Size: 243 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 89 - Forks: 20

acheong08/vectordb ๐ฆ
A simple vector database: Text encoding, semantic search, document storage
Language: Go - Size: 145 KB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 88 - Forks: 6

mikeroyal/NLP-Guide
Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.
Language: Python - Size: 315 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 85 - Forks: 16

DiceTechJobs/VectorsInSearch
Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Language: Python - Size: 49.8 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 82 - Forks: 15

colonelwatch/abstracts-search
Semantic search engine indexing 110 million academic publications
Language: Python - Size: 143 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 80 - Forks: 4

weaviate/typescript-client
Official Weaviate TypeScript Client
Language: TypeScript - Size: 4.24 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 78 - Forks: 25

CLUEbenchmark/QBQTC
QBQTC: ๅคง่งๆจกๆ็ดขๅน้ ๆฐๆฎ้
Language: Python - Size: 10.6 MB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 78 - Forks: 9

haven-jeon/LegalQA
Korean LegalQA using SentenceKoBART
Language: Python - Size: 140 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 77 - Forks: 26

sazonovanton/SirChatalot
SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or YandexART for image creation. It can use vision capabilities, tools and semantic search in vector DB.
Language: Python - Size: 674 KB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 72 - Forks: 13

unmonoqueteclea/voilib
๐ง Podcast Search Engine. Try it now for free or run your own instance.
Language: Python - Size: 6.61 MB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 71 - Forks: 5

ibm-self-serve-assets/Blended-RAG
Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers
Language: Jupyter Notebook - Size: 6.72 MB - Last synced at: 20 days ago - Pushed at: 11 months ago - Stars: 65 - Forks: 4
