An open API service providing repository metadata for many open source software ecosystems.

Topic: "text-retrieval"

HanXinzi-AI/awesome-NLP-resources

a collection of NLP projects&tools. 自然语言处理方向项目和工具集合。

Size: 17.2 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 209 - Forks: 33

arian-askari/ChatGPT-RetrievalQA-CIKM2023

A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.

Language: Jupyter Notebook - Size: 24.9 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 142 - Forks: 7

jiepujiang/LuceneTutorial

A simple tutorial of Lucene for LIS 501 Introduction to Text Mining students at the University of Wisconsin-Madison (Fall 2021).

Language: Java - Size: 372 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 72 - Forks: 17

ElmiraGhorbani/chatgpt-long-term-memory

The ChatGPT Long Term Memory package is a powerful tool designed to empower your projects with the ability to handle a large number of simultaneous users and external sources.

Language: Python - Size: 43.9 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 56 - Forks: 3

wjpoom/SPEC

[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"

Language: Jupyter Notebook - Size: 14.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 43 - Forks: 0

miccunifi/Cross-the-Gap

[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion

Language: Python - Size: 23.2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 36 - Forks: 0

GoodAI/goodai-ltm

A Python library for long-term memory in language models. Improve conversational scenarios and create autonomous learning agents with enhanced context.

Language: Python - Size: 1.39 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 36 - Forks: 12

lxucs/commoncrawl-warc-retrieval

Python tools to retrieve text from CommonCrawl WARC files based on cdx index.

Language: Python - Size: 11.7 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 3

usnistgov/trec-browser

Metadata browser of TREC

Language: Jupyter Notebook - Size: 103 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9 - Forks: 2

anhquan075/CS336-legal-text-retrieval

CS336 Final Project - Vietnamese Legal Text Retrieval

Language: Python - Size: 157 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 0

soulteary/text-retrieval-example

Let's talk about text retrieval.

Language: Go - Size: 12.7 KB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 2

jarvis0/image-search

🌄 Search images through text by writing a caption or a description. You will be intelligently assisted while typing.

Language: Jupyter Notebook - Size: 55.9 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

SavanK/FakeNewsChallenge

Combating fake news problem

Language: Java - Size: 40.9 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 0

nhtlongcs/elastic-search-docker

This is a docker compose file for running elastic search in a docker container. It is based on the official elastic search docker image

Language: Python - Size: 11.4 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

Bladefidz/data-mining

Fundamental of Data Mining: Study case and implementations.

Language: Jupyter Notebook - Size: 243 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 4 - Forks: 2

MChatzakis/DIS-TextRetrieval

A Text Retrieval Approach Using BM25+ and BERT Reranking

Language: Jupyter Notebook - Size: 14.3 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

rohinb2/hqbot

A simple bot for HQ trivia that uses OpenCV.

Language: Python - Size: 175 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 0

mhasnat/LPM_CityDB

This repository provides the materials to experiment with the CityDB dataset for License Plate Matching (LPM)

Language: Jupyter Notebook - Size: 2.68 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

tkhan11/NLP-based-Text-Retrieval

Natural Language Processing Based Text Retrieval System in Python

Language: Python - Size: 5.42 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

hymn-ing/text-retrieval-by-posting-list

Language: C++ - Size: 43.3 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Farhaj499/RAG_with_ChromaDB

This project implements an Extractive Question Answering (EQA) system that extracts answers from a set of downloaded text files based on user queries.

Language: Jupyter Notebook - Size: 177 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Mounir-charef/sentiment-analysis-binary-rating

This project utilizes machine learning and deep learning techniques to perform sentiment analysis on text reviews, automatically categorizing them as positive or negative. It provides valuable insights into user opinions and emotions expressed in textual data.

Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

gorjanradevski/macchina

Codebase for "Self-supervised context-aware Covid-19 document exploration through atlas grounding" as well as links to the tools mentioned in the paper. Work done within ESAT-PSI at KU Leuven.

Language: Python - Size: 553 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

AdithyaSanyal/Voice-based-Personal-Assistant

A voice based personal assistant which has different functionalities right from voice based text, image retrieval, a chatbot to a text summarizer and an automatic question generator. Made by amalgamating different concepts of NLP and Machine Learning together

Language: Python - Size: 124 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

JalajVora/Text-Analytics-with-Multi-Class-and-Imbalanced-Learning

Genre Identification task along with Text Analytics with Multi-Class and Imbalanced Learning on Gutenberg Corpus

Language: HTML - Size: 166 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

sarthak268/Multimedia-Computing-and-Applications

This repository contains code for all assignments in the Multimedia Computing and Applications (CSE563) course.

Language: Python - Size: 3.68 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

trungdangtapcode/Flashcard-Recommendation-Extension

Flashcard recommendation based on BEIT and TF score and can be used as normal flashcard extension

Language: TypeScript - Size: 1.06 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

vedaant00/uhsr

UHSR (Unified Hyperbolic Spectral Retrieval) is a next-generation hybrid text retrieval framework that combines BM25 (Lexical Search) with FAISS/Pinecone (Semantic Search), enhanced by Spectral Re-Ranking & AI-Powered Reranking. It supports multiple similarity metrics, provides interpretable normalized scores, & is designed for scalability & speed.

Language: Python - Size: 484 KB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

bnvulpe/code-extractor

Transforming images into code at a click. Upload a photo or screenshot and copy the code to your script in seconds!

Language: HTML - Size: 295 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

sourceduty/Geo-Historic_Word_Valuation

🔤 A new original text mining and information retrieval method.

Size: 12.7 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

iiTzMohit/YTQueryBot

YTQueryBot is a web app that answers questions about YouTube videos. It uses Streamlit for the UI, LangChain for transcript processing, and OpenAI for generating responses from video data.

Language: Python - Size: 29.3 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

LeonardoSaccotelli/Numerical-Methods-For-Computer-Science

Basic and advanced linear algebra and numerical problems, numerical algorithms, and techniques with multiple applications in the field of Computer Science.

Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

rootguillen/Patent-Search-System-with-Gradio

Developed by Gyudong HAN, Counsellor, WIPO ([email protected]). Developed this system with reference to the general text retrieval system which was uploaded together with the video clip named "LangChain Retrieval QA Over Multiple Files with ChromaDB". I only added the implementation of Gradio for its UI.

Language: Jupyter Notebook - Size: 35.2 KB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 1

hriaz17/hw1-python

A template repository for the Python version of Homework 1 from CSC 483-583: Text Retrieval and Web Search taught by Prof. Mihai Surdeanu & Haris Riaz, Spring 2024

Language: Python - Size: 351 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hriaz17/hw1-java

A template repository for the Java version of Homework 1 from CSC 483-583: Text Retrieval and Web Search taught by Mihai Surdeanu & Haris Riaz, Spring 2024

Language: Java - Size: 358 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

VedangW/upr-kilt

Unsupervised Passage Retrieval for Question Answering, Fact Checking, and Entity Linking on the KILT benchmark using the T5 language model series.

Language: Python - Size: 82 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

avojak/oise

Open IRC Search Engine

Language: Java - Size: 166 KB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Adarsh-sophos/Smart-Library

Identifying Books on Library Shelves using Supervised Deep Learning.

Language: Jupyter Notebook - Size: 469 MB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 1

DavidCrgh/RIT_Progra1_2018

Buscador de man pages con modelo vectorial y BM25.

Language: C# - Size: 97.7 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

chen0040/java-text-retrieval

Text retrieval framework that implements vector space model and language model

Language: Shell - Size: 47.9 KB - Last synced at: 3 months ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 1

Related Topics
information-retrieval 9 python 8 machine-learning 7 text-mining 5 deep-learning 5 image-retrieval 4 semantic-search 4 sentiment-analysis 3 python3 3 text-summarization 3 language-model 3 nlp-machine-learning 3 search-engine 3 nlp 3 precision-recall 2 vector-database 2 chatgpt 2 faiss 2 text 2 search 2 langchain 2 text-classification 2 text-matching 2 chatbot 2 vision-language-model 2 vision-language 2 chromadb 2 clip 2 multimodal 2 openai 2 elastic-search 2 database 2 text-embedding 2 tf-idf 2 similarity-search 2 gpt-3 2 university-of-arizona 2 lucene 2 word-classification 1 word-classifier 1 word-sentiment 1 word-sorting 1 word-value 1 chatgpt-api 1 context 1 datastore 1 embedding-similarity 1 embeddings 1 semantic-similarity 1 text-analytics 1 compostional 1 computer-vision 1 cvpr2024 1 fine-grained 1 language 1 robustness 1 vision 1 question-generation 1 speech-to-text 1 bm25-plus 1 document-reranking 1 sentence-embeddings 1 data-mining 1 openai-api 1 vector-space-model 1 classification 1 geo-historic 1 information 1 ordering 1 sentiment 1 sorting 1 text-info 1 contrastive-learning 1 iclr2025 1 image-classification 1 inter-modal 1 intra-modal 1 intra-modal-misalignment 1 modality-gap 1 modality-inversion 1 oti 1 ovi 1 siglip 1 textual-inversion 1 visual-inversion 1 vlm 1 ai 1 chatgpt-information-retrieval 1 chatgpt-ir 1 data-augmentation 1 dataset 1 gpt2 1 gpt3 1 information-retrieval-chatgpt 1 ir 1 ir-chatgpt 1 sequence-to-sequence 1 artificial-intelligence 1 data-science 1 gpt-35-turbo 1