Topic: "text-retrieval"
HanXinzi-AI/awesome-NLP-resources
a collection of NLP projects&tools. 自然语言处理方向项目和工具集合。
Size: 17.2 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 209 - Forks: 33

arian-askari/ChatGPT-RetrievalQA-CIKM2023
A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.
Language: Jupyter Notebook - Size: 24.9 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 142 - Forks: 7

jiepujiang/LuceneTutorial
A simple tutorial of Lucene for LIS 501 Introduction to Text Mining students at the University of Wisconsin-Madison (Fall 2021).
Language: Java - Size: 372 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 72 - Forks: 17

ElmiraGhorbani/chatgpt-long-term-memory
The ChatGPT Long Term Memory package is a powerful tool designed to empower your projects with the ability to handle a large number of simultaneous users and external sources.
Language: Python - Size: 43.9 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 56 - Forks: 3

wjpoom/SPEC
[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"
Language: Jupyter Notebook - Size: 14.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 43 - Forks: 0

miccunifi/Cross-the-Gap
[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion
Language: Python - Size: 23.2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 36 - Forks: 0

GoodAI/goodai-ltm
A Python library for long-term memory in language models. Improve conversational scenarios and create autonomous learning agents with enhanced context.
Language: Python - Size: 1.39 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 36 - Forks: 12

lxucs/commoncrawl-warc-retrieval
Python tools to retrieve text from CommonCrawl WARC files based on cdx index.
Language: Python - Size: 11.7 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 3

usnistgov/trec-browser
Metadata browser of TREC
Language: Jupyter Notebook - Size: 103 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9 - Forks: 2

anhquan075/CS336-legal-text-retrieval
CS336 Final Project - Vietnamese Legal Text Retrieval
Language: Python - Size: 157 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 0

soulteary/text-retrieval-example
Let's talk about text retrieval.
Language: Go - Size: 12.7 KB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 2

jarvis0/image-search
🌄 Search images through text by writing a caption or a description. You will be intelligently assisted while typing.
Language: Jupyter Notebook - Size: 55.9 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

SavanK/FakeNewsChallenge
Combating fake news problem
Language: Java - Size: 40.9 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 0

nhtlongcs/elastic-search-docker
This is a docker compose file for running elastic search in a docker container. It is based on the official elastic search docker image
Language: Python - Size: 11.4 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

Bladefidz/data-mining
Fundamental of Data Mining: Study case and implementations.
Language: Jupyter Notebook - Size: 243 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 4 - Forks: 2

MChatzakis/DIS-TextRetrieval
A Text Retrieval Approach Using BM25+ and BERT Reranking
Language: Jupyter Notebook - Size: 14.3 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

rohinb2/hqbot
A simple bot for HQ trivia that uses OpenCV.
Language: Python - Size: 175 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 0

mhasnat/LPM_CityDB
This repository provides the materials to experiment with the CityDB dataset for License Plate Matching (LPM)
Language: Jupyter Notebook - Size: 2.68 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

tkhan11/NLP-based-Text-Retrieval
Natural Language Processing Based Text Retrieval System in Python
Language: Python - Size: 5.42 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

hymn-ing/text-retrieval-by-posting-list
Language: C++ - Size: 43.3 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Farhaj499/RAG_with_ChromaDB
This project implements an Extractive Question Answering (EQA) system that extracts answers from a set of downloaded text files based on user queries.
Language: Jupyter Notebook - Size: 177 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Mounir-charef/sentiment-analysis-binary-rating
This project utilizes machine learning and deep learning techniques to perform sentiment analysis on text reviews, automatically categorizing them as positive or negative. It provides valuable insights into user opinions and emotions expressed in textual data.
Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

gorjanradevski/macchina
Codebase for "Self-supervised context-aware Covid-19 document exploration through atlas grounding" as well as links to the tools mentioned in the paper. Work done within ESAT-PSI at KU Leuven.
Language: Python - Size: 553 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

AdithyaSanyal/Voice-based-Personal-Assistant
A voice based personal assistant which has different functionalities right from voice based text, image retrieval, a chatbot to a text summarizer and an automatic question generator. Made by amalgamating different concepts of NLP and Machine Learning together
Language: Python - Size: 124 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

JalajVora/Text-Analytics-with-Multi-Class-and-Imbalanced-Learning
Genre Identification task along with Text Analytics with Multi-Class and Imbalanced Learning on Gutenberg Corpus
Language: HTML - Size: 166 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

sarthak268/Multimedia-Computing-and-Applications
This repository contains code for all assignments in the Multimedia Computing and Applications (CSE563) course.
Language: Python - Size: 3.68 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

trungdangtapcode/Flashcard-Recommendation-Extension
Flashcard recommendation based on BEIT and TF score and can be used as normal flashcard extension
Language: TypeScript - Size: 1.06 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

vedaant00/uhsr
UHSR (Unified Hyperbolic Spectral Retrieval) is a next-generation hybrid text retrieval framework that combines BM25 (Lexical Search) with FAISS/Pinecone (Semantic Search), enhanced by Spectral Re-Ranking & AI-Powered Reranking. It supports multiple similarity metrics, provides interpretable normalized scores, & is designed for scalability & speed.
Language: Python - Size: 484 KB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

bnvulpe/code-extractor
Transforming images into code at a click. Upload a photo or screenshot and copy the code to your script in seconds!
Language: HTML - Size: 295 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

sourceduty/Geo-Historic_Word_Valuation
🔤 A new original text mining and information retrieval method.
Size: 12.7 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

iiTzMohit/YTQueryBot
YTQueryBot is a web app that answers questions about YouTube videos. It uses Streamlit for the UI, LangChain for transcript processing, and OpenAI for generating responses from video data.
Language: Python - Size: 29.3 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

LeonardoSaccotelli/Numerical-Methods-For-Computer-Science
Basic and advanced linear algebra and numerical problems, numerical algorithms, and techniques with multiple applications in the field of Computer Science.
Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

rootguillen/Patent-Search-System-with-Gradio
Developed by Gyudong HAN, Counsellor, WIPO ([email protected]). Developed this system with reference to the general text retrieval system which was uploaded together with the video clip named "LangChain Retrieval QA Over Multiple Files with ChromaDB". I only added the implementation of Gradio for its UI.
Language: Jupyter Notebook - Size: 35.2 KB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 1

hriaz17/hw1-python
A template repository for the Python version of Homework 1 from CSC 483-583: Text Retrieval and Web Search taught by Prof. Mihai Surdeanu & Haris Riaz, Spring 2024
Language: Python - Size: 351 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hriaz17/hw1-java
A template repository for the Java version of Homework 1 from CSC 483-583: Text Retrieval and Web Search taught by Mihai Surdeanu & Haris Riaz, Spring 2024
Language: Java - Size: 358 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

VedangW/upr-kilt
Unsupervised Passage Retrieval for Question Answering, Fact Checking, and Entity Linking on the KILT benchmark using the T5 language model series.
Language: Python - Size: 82 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

avojak/oise
Open IRC Search Engine
Language: Java - Size: 166 KB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Adarsh-sophos/Smart-Library
Identifying Books on Library Shelves using Supervised Deep Learning.
Language: Jupyter Notebook - Size: 469 MB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 1

DavidCrgh/RIT_Progra1_2018
Buscador de man pages con modelo vectorial y BM25.
Language: C# - Size: 97.7 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

chen0040/java-text-retrieval
Text retrieval framework that implements vector space model and language model
Language: Shell - Size: 47.9 KB - Last synced at: 3 months ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 1
