An open API service providing repository metadata for many open source software ecosystems.

Topic: "document-retrieval"

chroma-core/chroma

the AI-native open-source embedding database

Language: Rust - Size: 515 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 19,793 - Forks: 1,610

vearch/vearch

Distributed vector search for AI-native applications

Language: Go - Size: 35.9 MB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 2,168 - Forks: 339

Mintplex-Labs/vector-admin

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

Language: TypeScript - Size: 12.6 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 1,860 - Forks: 292

redis-developer/redis-arXiv-search

Vector search demo with the arXiv paper dataset, RedisVL, HuggingFace, OpenAI, Cohere, FastAPI, React, and Redis.

Language: Python - Size: 1000 KB - Last synced at: about 7 hours ago - Pushed at: about 1 month ago - Stars: 144 - Forks: 24

grafana/vectorapi 📦

pgvector + embeddings API

Language: Python - Size: 559 KB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 4

Amitha353/Machine-Learning-Foundation-Case-Study

Language: Jupyter Notebook - Size: 2.68 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 19 - Forks: 9

vTuanpham/Vietnamese_QA_System

Vietnamese long form question answering system with documents retrieval.

Language: Python - Size: 444 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 6

HennyJie/GNN-DocRetrieval

Implementation of ECIR 2022 Paper: How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generation

Language: Python - Size: 189 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 1

Syed007Hassan/Document-Querying-With-VectorDB

Document Querying with LLMs - Google PaLM API: Semantic Search With LLM Embeddings

Language: Python - Size: 395 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

manan-paneri-99/Vector-Space-based-Document-Retrieval-system

Retrieves the top 10 documents from the Wikipedia corpus for a user inputted free-text query

Language: Python - Size: 4.05 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 2

aniketwdubey/chatpdf

This project is a Document Retrieval application that utilizes Retrieval-Augmented Generation (RAG) techniques to enable users to interact with uploaded PDF documents. By leveraging a Large Language Model (LLM), users can ask questions about the content of the documents and receive accurate answers based on the information retrieved.

Language: Jupyter Notebook - Size: 961 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 6 - Forks: 1

marcomoldovan/hierarchical-language-modeling

We address the task of learning contextualized word, sentence and document representations with a hierarchical language model by stacking Transformer-based encoders on a sentence level and subsequently on a document level and performing masked token prediction.

Language: Jupyter Notebook - Size: 6.83 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 0

DebanjanSarkar/askdoc

The Intelligent "ASKDOC" project combines the power of Langchain, Azure, OpenAI models, and Python to deliver an intelligent question-answering system, that scans your PDF documents and answer queries based on its contents. It can be queried using Human Natural Language.

Language: Python - Size: 520 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

boudinfl/redefining-absent-keyphrases

Code and dataset for the paper "Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness"

Language: Python - Size: 147 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

SubhangiSati/LangChat-Explorer

"LangChat Explorer: Your intuitive document companion. Effortlessly explore vast information with natural language conversations. Simplify queries, gain insights, and embark on a seamless journey of knowledge discovery. Unleash the power of language with LangChat Explorer."

Language: Python - Size: 471 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

maxsagt/lambda-instructor

Run text embeddings with Instructor-Large on AWS Lambda.

Language: Shell - Size: 7.81 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

shrebox/Information-Retrieval

Compilation of Information Retrieval codes.

Language: Jupyter Notebook - Size: 89.2 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 1

ndtands/Information-Retrieval

Language: Python - Size: 67.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

anaramirli/snlp-information-retrieval

A two-stage information retrieval model using baseline TF-IDF model and refined BM25.

Language: Python - Size: 6.03 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

unendschlossen2/chatbot_jade_hs_planspiel

An RAG-Chatbot developed for a business-oriented-game at the JADE HOCHSCHULE

Language: Python - Size: 153 KB - Last synced at: 28 days ago - Pushed at: 29 days ago - Stars: 2 - Forks: 0

MohammedNasserAhmed/CodeXpert

CodeXpert: A cutting-edge AI-powered code analysis tool leveraging CodeLlama, FAISS, and HuggingFace for efficient code understanding, explanation, and optimization. 🚀✨

Language: Python - Size: 589 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

wlzhao22/mirlecture

course slides for Multimedia Information Retrieval

Language: TeX - Size: 160 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

Jiho-YesNLP/text-summ-for-doc-retrieval

Neural text summarization for document retrieval

Language: Python - Size: 619 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

xuan25/COM3110-Document-Retrieval-Assignment

Language: Python - Size: 1.13 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

spyros-briakos/Document-Retrieval-and-Question-Answering-with-BERT

Initially implement Document-Retrieval-System with SBERT embeddings and evaluate it in CORD-19 dataset. Afterwards, fine tune BERT model with SQuAD.v2 dataset so as to evaluate it in Question Answering task.

Language: Jupyter Notebook - Size: 263 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

Md-Emon-Hasan/Retrieval-Augmented-Generation-RAG

RAG enhances LLMs by retrieving relevant external knowledge before generating responses, improving accuracy and reducing hallucinations.

Language: Jupyter Notebook - Size: 569 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

VerisimilitudeX/pathologyandprotein

Language: Jupyter Notebook - Size: 599 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

heydido/DocumentQnA

A Document QnA bot

Language: Jupyter Notebook - Size: 849 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

timothyckl/iota

a minimal local embedding database.

Language: Python - Size: 668 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

DanieleMorotti/Argument-retrieval-for-comparative-questions

Neural Language Processing (NLP) project (AY 2022/2023)

Language: Jupyter Notebook - Size: 1.37 MB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

AGiannoutsos/COVID19-document-retrieval-with-BERT

This project is about developing a document retrieval system to return titles and the context of scientific papers containing the answer to a given user question

Language: Jupyter Notebook - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

R-Aravind/document-retrieval

Language: Jupyter Notebook - Size: 54.9 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

anne27/Information-Retrieval

An implementation of basic IR techniques from scratch.

Language: Python - Size: 27.8 MB - Last synced at: 11 months ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

tehmas/document-retrieval-system

A program to construct and read an inverted index.

Language: Python - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 2

snoai/magi-markdown

MAGI: Markdown for Agent Guidance & Instruction - A next-generation markdown extension designed specifically for AI systems. MAGI enhances standard markdown with structured metadata, embedded AI instructions, and explicit document relationships, creating a seamless bridge between human-readable content and LLM/agent processing. Perfect for RAG,KAG

Language: TypeScript - Size: 512 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

Erfanafshar/document-search-engine

This Java project builds a search engine using information retrieval techniques like TF-IDF and cosine similarity.

Language: Java - Size: 1.66 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

AadityaRajGupta/AetherCare_Platform

AetherCare is an AI-powered healthcare platform that leverages Generative AI to assist users with medical inquiries, symptom-based disease prediction, hospital location services, and a knowledge repository for healthcare education. This project aims to enhance accessibility to healthcare information.

Language: HTML - Size: 13.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

AadityaRajGupta/AetherCare_ChatBot

This repository contains a healthcare-based chatbot project that integrates advanced generative AI techniques with document retrieval for answering medical queries. It leverages vector-based search for relevant information retrieval and uses transformer-based models for generating responses.

Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lh0x00/embs

embs is a Python toolkit for retrieving documents (via Docsifer), generating embeddings (via Lightweight Embeddings API), and ranking texts with an optional caching system.

Language: Python - Size: 112 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

hbujakow/NLP2024Z_PDF_Assistant

Language: Jupyter Notebook - Size: 2.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

huacenxu/Embedding-Models-for-AI-Retrieval

This project develops a domain-specific embedding model to enhance document retrieval in AI-powered search systems. It incorporates techniques like synthetic data generation, model fine-tuning, and vector search using FAISS, evaluated with MRR@5 for performance.

Language: Python - Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

prasannaghimiree/AI-Powered-Documentation-Retrieval-Assistant-for-Open-Source-Projects

An AI tool that uses the Ollama Mistral LLM for understanding and summarizing code, with Ollama all-MiniLM-v3 embeddings stored in a ChromaDB vector database. It provides automatic documentation summaries, answers questions, and generates contribution guides to simplify onboarding and boost productivity for open-source developers

Language: Python - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

jdlflr/sense_aware_query_expansion

An out-of-the-box, corpus-agnostic query expansion tool for lexical retrieval systems.

Language: Python - Size: 16.6 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

HamidrezaGholamrezaei/LLM_LangChain_ChatBot

LLM_LangChain_ChatBot is a contextual document retrieval chatbot that leverages LangChain to process user queries and generate accurate responses based on the content of retrieved documents. Ideal for applications requiring precise information retrieval and context-aware interactions.

Language: Jupyter Notebook - Size: 84 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

hzionn/Information-Retrieval

A basic and intuitive Python module for (Vector Space) IR system. (Focuses on simplicity and understandability)

Language: Python - Size: 8.17 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

hzionn/Documents-Retrieval

Evaluations among different Retrieval Models

Language: Perl - Size: 1.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

saadtariq-ds/langchain_chat_with_data

Dive into LangChain, a powerful platform that lets you interact with your data like never before. This guide offers insights on its unique capabilities, helping you tap into your data in conversational ways.

Language: Jupyter Notebook - Size: 2.26 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

WD-Leong/NLP-BERT-Retrieval

Document Retrieval using BERT.

Language: Python - Size: 69.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

blaze7451/Project-JaQUAd-QA-System

Extractive QA system using JaQUAd dataset

Language: Jupyter Notebook - Size: 3.24 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

IsuruBoyagane15/sinhala-song-lyrics-search-engine

Elasticsearch based song lyrics search engine for Sinhala.

Language: Python - Size: 928 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

n1ghtf4l1/bookish-waddle

Implement Document Retrieval techniques to find who is closest to whom based on wikipedia data.

Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

angelosps/Document-Retrieval-System

🗂️ A document retrieval system on CORD-19 dataset

Language: Jupyter Notebook - Size: 139 KB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

JeremyLeiLiu/ProxLogPRF

ProxLogPRF: A Proximity-based Log-logistic Feedback Model for Pseudo-relevance Feedback

Language: Java - Size: 20.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

rajatb115/Document-Reranking

Assignment on Document Reranking

Language: Python - Size: 345 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

IsuruBoyagane15/vue4logs-parser

Automatic structuring of textual computer system logs using document retrieval.

Language: Python - Size: 73.8 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

zmf0507/IRIS

This repositiory implements various concepts and algorithms of Information Retrieval such as document classification, document retrieval, positional and logical text queries, Rocchio algorithm, retrieval evaluation metric etc.

Language: Jupyter Notebook - Size: 925 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

arpytanshu/latent-semantic-indexing

This Latent Semantic Indexing [ LSI ] model collects, parses, and stores documents to facilitate fast and accurate information retrieval through queries.

Language: Python - Size: 3.45 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

phreakyphoenix/Graphlab-ML-Projects

These individiual projects are part of the coursework for the Coursera University of Washington course. I've also tried some fun new experiments with the data. Have fun checking them out !!

Language: Jupyter Notebook - Size: 218 MB - Last synced at: 3 months ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

Magho/Document-Retrieval

Document retrieval from Wikipedia data using graph-lab

Language: Jupyter Notebook - Size: 55.1 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Jicol95/Vector-Space-Retrieval

Using Vector Space Retreival to return documents related to a search query

Language: Python - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

annieyan/clustering

Kmeans, Kmeans++, Gaussian Mixtures

Language: Python - Size: 229 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

Related Topics
information-retrieval 13 machine-learning 13 embeddings 11 nlp 8 rag 8 python 7 tf-idf 7 vector-database 7 deep-learning 6 natural-language-processing 6 langchain 6 llms 6 vector-search 6 faiss 6 retrieval-augmented-generation 5 question-answering 5 langchain-python 5 huggingface 4 vector-space-model 4 chatbot 4 clustering 4 pinecone 4 ai-chatbot 4 openai 3 python3 3 flask 3 bert 3 generative-ai 3 transformers 3 cord-19-dataset 3 llm 3 chatbots 3 sentence-embeddings 3 language-model 2 chat-application 2 chroma 2 bm25 2 tfidf 2 song-recommender 2 pytorch 2 sentiment-analysis 2 sbert 2 regression 2 embedding-models 2 okapi-bm25 2 transformer 2 evaluation-metrics 2 turicreate 2 healthcare 2 sentence-transformers 2 natural-language-understanding 2 ai-agents 2 ai-native 2 semantic-search 2 ai 2 classification 2 chatapp 1 chatgpt 1 rag-pipeline 1 retrieval-qa 1 chat-app 1 svd 1 query-expansion 1 text-embedding 1 text-summarization 1 precision-medicine 1 bert-model 1 api 1 covid19 1 pdf-document-processor 1 q-and-a-bot 1 search-interface 1 covid-19 1 convolutional-neural-networks 1 cse416 1 universityofwashington 1 pgvector 1 absent-keyphrases 1 digital-library 1 keyphrase-generation 1 retrieval-effectiveness 1 chromadb 1 custom-llm 1 huggingface-rag 1 knowledge-augmented-llm 1 bert-embeddings 1 knowledge-graph 1 langchain-rag 1 llm-applications 1 llm-retrieval 1 multi-modal-rag 1 prompt-engineering 1 word-sense-disambiguation 1 pdf-document-query 1 fastapi 1 large-language-models 1 ai-native-database 1 cloud-native 1 hybrid-search 1 vectors 1