Topic: "document-retrieval"
chroma-core/chroma
the AI-native open-source embedding database
Language: Rust - Size: 515 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 19,793 - Forks: 1,610

vearch/vearch
Distributed vector search for AI-native applications
Language: Go - Size: 35.9 MB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 2,168 - Forks: 339

Mintplex-Labs/vector-admin
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
Language: TypeScript - Size: 12.6 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 1,860 - Forks: 292

redis-developer/redis-arXiv-search
Vector search demo with the arXiv paper dataset, RedisVL, HuggingFace, OpenAI, Cohere, FastAPI, React, and Redis.
Language: Python - Size: 1000 KB - Last synced at: about 7 hours ago - Pushed at: about 1 month ago - Stars: 144 - Forks: 24

grafana/vectorapi 📦
pgvector + embeddings API
Language: Python - Size: 559 KB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 4

Amitha353/Machine-Learning-Foundation-Case-Study
Language: Jupyter Notebook - Size: 2.68 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 19 - Forks: 9

vTuanpham/Vietnamese_QA_System
Vietnamese long form question answering system with documents retrieval.
Language: Python - Size: 444 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 6

HennyJie/GNN-DocRetrieval
Implementation of ECIR 2022 Paper: How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generation
Language: Python - Size: 189 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 1

Syed007Hassan/Document-Querying-With-VectorDB
Document Querying with LLMs - Google PaLM API: Semantic Search With LLM Embeddings
Language: Python - Size: 395 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

manan-paneri-99/Vector-Space-based-Document-Retrieval-system
Retrieves the top 10 documents from the Wikipedia corpus for a user inputted free-text query
Language: Python - Size: 4.05 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 2

aniketwdubey/chatpdf
This project is a Document Retrieval application that utilizes Retrieval-Augmented Generation (RAG) techniques to enable users to interact with uploaded PDF documents. By leveraging a Large Language Model (LLM), users can ask questions about the content of the documents and receive accurate answers based on the information retrieved.
Language: Jupyter Notebook - Size: 961 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 6 - Forks: 1

marcomoldovan/hierarchical-language-modeling
We address the task of learning contextualized word, sentence and document representations with a hierarchical language model by stacking Transformer-based encoders on a sentence level and subsequently on a document level and performing masked token prediction.
Language: Jupyter Notebook - Size: 6.83 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 0

DebanjanSarkar/askdoc
The Intelligent "ASKDOC" project combines the power of Langchain, Azure, OpenAI models, and Python to deliver an intelligent question-answering system, that scans your PDF documents and answer queries based on its contents. It can be queried using Human Natural Language.
Language: Python - Size: 520 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

boudinfl/redefining-absent-keyphrases
Code and dataset for the paper "Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness"
Language: Python - Size: 147 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

SubhangiSati/LangChat-Explorer
"LangChat Explorer: Your intuitive document companion. Effortlessly explore vast information with natural language conversations. Simplify queries, gain insights, and embark on a seamless journey of knowledge discovery. Unleash the power of language with LangChat Explorer."
Language: Python - Size: 471 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

maxsagt/lambda-instructor
Run text embeddings with Instructor-Large on AWS Lambda.
Language: Shell - Size: 7.81 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

shrebox/Information-Retrieval
Compilation of Information Retrieval codes.
Language: Jupyter Notebook - Size: 89.2 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 1

ndtands/Information-Retrieval
Language: Python - Size: 67.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

anaramirli/snlp-information-retrieval
A two-stage information retrieval model using baseline TF-IDF model and refined BM25.
Language: Python - Size: 6.03 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

unendschlossen2/chatbot_jade_hs_planspiel
An RAG-Chatbot developed for a business-oriented-game at the JADE HOCHSCHULE
Language: Python - Size: 153 KB - Last synced at: 28 days ago - Pushed at: 29 days ago - Stars: 2 - Forks: 0

MohammedNasserAhmed/CodeXpert
CodeXpert: A cutting-edge AI-powered code analysis tool leveraging CodeLlama, FAISS, and HuggingFace for efficient code understanding, explanation, and optimization. 🚀✨
Language: Python - Size: 589 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

wlzhao22/mirlecture
course slides for Multimedia Information Retrieval
Language: TeX - Size: 160 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

Jiho-YesNLP/text-summ-for-doc-retrieval
Neural text summarization for document retrieval
Language: Python - Size: 619 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

xuan25/COM3110-Document-Retrieval-Assignment
Language: Python - Size: 1.13 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

spyros-briakos/Document-Retrieval-and-Question-Answering-with-BERT
Initially implement Document-Retrieval-System with SBERT embeddings and evaluate it in CORD-19 dataset. Afterwards, fine tune BERT model with SQuAD.v2 dataset so as to evaluate it in Question Answering task.
Language: Jupyter Notebook - Size: 263 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

Md-Emon-Hasan/Retrieval-Augmented-Generation-RAG
RAG enhances LLMs by retrieving relevant external knowledge before generating responses, improving accuracy and reducing hallucinations.
Language: Jupyter Notebook - Size: 569 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

VerisimilitudeX/pathologyandprotein
Language: Jupyter Notebook - Size: 599 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

heydido/DocumentQnA
A Document QnA bot
Language: Jupyter Notebook - Size: 849 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

timothyckl/iota
a minimal local embedding database.
Language: Python - Size: 668 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

DanieleMorotti/Argument-retrieval-for-comparative-questions
Neural Language Processing (NLP) project (AY 2022/2023)
Language: Jupyter Notebook - Size: 1.37 MB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

AGiannoutsos/COVID19-document-retrieval-with-BERT
This project is about developing a document retrieval system to return titles and the context of scientific papers containing the answer to a given user question
Language: Jupyter Notebook - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

R-Aravind/document-retrieval
Language: Jupyter Notebook - Size: 54.9 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

anne27/Information-Retrieval
An implementation of basic IR techniques from scratch.
Language: Python - Size: 27.8 MB - Last synced at: 11 months ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

tehmas/document-retrieval-system
A program to construct and read an inverted index.
Language: Python - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 2

snoai/magi-markdown
MAGI: Markdown for Agent Guidance & Instruction - A next-generation markdown extension designed specifically for AI systems. MAGI enhances standard markdown with structured metadata, embedded AI instructions, and explicit document relationships, creating a seamless bridge between human-readable content and LLM/agent processing. Perfect for RAG,KAG
Language: TypeScript - Size: 512 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

Erfanafshar/document-search-engine
This Java project builds a search engine using information retrieval techniques like TF-IDF and cosine similarity.
Language: Java - Size: 1.66 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

AadityaRajGupta/AetherCare_Platform
AetherCare is an AI-powered healthcare platform that leverages Generative AI to assist users with medical inquiries, symptom-based disease prediction, hospital location services, and a knowledge repository for healthcare education. This project aims to enhance accessibility to healthcare information.
Language: HTML - Size: 13.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

AadityaRajGupta/AetherCare_ChatBot
This repository contains a healthcare-based chatbot project that integrates advanced generative AI techniques with document retrieval for answering medical queries. It leverages vector-based search for relevant information retrieval and uses transformer-based models for generating responses.
Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lh0x00/embs
embs is a Python toolkit for retrieving documents (via Docsifer), generating embeddings (via Lightweight Embeddings API), and ranking texts with an optional caching system.
Language: Python - Size: 112 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

hbujakow/NLP2024Z_PDF_Assistant
Language: Jupyter Notebook - Size: 2.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

huacenxu/Embedding-Models-for-AI-Retrieval
This project develops a domain-specific embedding model to enhance document retrieval in AI-powered search systems. It incorporates techniques like synthetic data generation, model fine-tuning, and vector search using FAISS, evaluated with MRR@5 for performance.
Language: Python - Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

prasannaghimiree/AI-Powered-Documentation-Retrieval-Assistant-for-Open-Source-Projects
An AI tool that uses the Ollama Mistral LLM for understanding and summarizing code, with Ollama all-MiniLM-v3 embeddings stored in a ChromaDB vector database. It provides automatic documentation summaries, answers questions, and generates contribution guides to simplify onboarding and boost productivity for open-source developers
Language: Python - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

jdlflr/sense_aware_query_expansion
An out-of-the-box, corpus-agnostic query expansion tool for lexical retrieval systems.
Language: Python - Size: 16.6 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

HamidrezaGholamrezaei/LLM_LangChain_ChatBot
LLM_LangChain_ChatBot is a contextual document retrieval chatbot that leverages LangChain to process user queries and generate accurate responses based on the content of retrieved documents. Ideal for applications requiring precise information retrieval and context-aware interactions.
Language: Jupyter Notebook - Size: 84 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

hzionn/Information-Retrieval
A basic and intuitive Python module for (Vector Space) IR system. (Focuses on simplicity and understandability)
Language: Python - Size: 8.17 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

hzionn/Documents-Retrieval
Evaluations among different Retrieval Models
Language: Perl - Size: 1.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

saadtariq-ds/langchain_chat_with_data
Dive into LangChain, a powerful platform that lets you interact with your data like never before. This guide offers insights on its unique capabilities, helping you tap into your data in conversational ways.
Language: Jupyter Notebook - Size: 2.26 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

WD-Leong/NLP-BERT-Retrieval
Document Retrieval using BERT.
Language: Python - Size: 69.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

blaze7451/Project-JaQUAd-QA-System
Extractive QA system using JaQUAd dataset
Language: Jupyter Notebook - Size: 3.24 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

IsuruBoyagane15/sinhala-song-lyrics-search-engine
Elasticsearch based song lyrics search engine for Sinhala.
Language: Python - Size: 928 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

n1ghtf4l1/bookish-waddle
Implement Document Retrieval techniques to find who is closest to whom based on wikipedia data.
Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

angelosps/Document-Retrieval-System
🗂️ A document retrieval system on CORD-19 dataset
Language: Jupyter Notebook - Size: 139 KB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

JeremyLeiLiu/ProxLogPRF
ProxLogPRF: A Proximity-based Log-logistic Feedback Model for Pseudo-relevance Feedback
Language: Java - Size: 20.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

rajatb115/Document-Reranking
Assignment on Document Reranking
Language: Python - Size: 345 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

IsuruBoyagane15/vue4logs-parser
Automatic structuring of textual computer system logs using document retrieval.
Language: Python - Size: 73.8 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

zmf0507/IRIS
This repositiory implements various concepts and algorithms of Information Retrieval such as document classification, document retrieval, positional and logical text queries, Rocchio algorithm, retrieval evaluation metric etc.
Language: Jupyter Notebook - Size: 925 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

arpytanshu/latent-semantic-indexing
This Latent Semantic Indexing [ LSI ] model collects, parses, and stores documents to facilitate fast and accurate information retrieval through queries.
Language: Python - Size: 3.45 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

phreakyphoenix/Graphlab-ML-Projects
These individiual projects are part of the coursework for the Coursera University of Washington course. I've also tried some fun new experiments with the data. Have fun checking them out !!
Language: Jupyter Notebook - Size: 218 MB - Last synced at: 3 months ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

Magho/Document-Retrieval
Document retrieval from Wikipedia data using graph-lab
Language: Jupyter Notebook - Size: 55.1 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Jicol95/Vector-Space-Retrieval
Using Vector Space Retreival to return documents related to a search query
Language: Python - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

annieyan/clustering
Kmeans, Kmeans++, Gaussian Mixtures
Language: Python - Size: 229 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0
