Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: information-retrieval

gopala-kr/summary

summaries of all the papers I read

Size: 215 MB - Last synced: about 3 hours ago - Pushed: about 4 hours ago - Stars: 126 - Forks: 38

felladrin/MiniSearch

Minimalist web-searching app with an AI assistant that runs directly from your browser. Uses Web-LLM, Ratchet-ML, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space

Language: TypeScript - Size: 25 MB - Last synced: about 1 hour ago - Pushed: about 5 hours ago - Stars: 68 - Forks: 7

FlagOpen/FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language: Python - Size: 19.8 MB - Last synced: about 7 hours ago - Pushed: about 8 hours ago - Stars: 5,086 - Forks: 345

aplz/aplz.github.io

personal page

Language: HTML - Size: 19.4 MB - Last synced: about 8 hours ago - Pushed: about 9 hours ago - Stars: 0 - Forks: 0

YunaBraska/semver-info-action

Cleans, parses, and compares semantic versions, providing essential insights into versioning, stability, and compatibility, making software release management a breeze!

Language: TypeScript - Size: 25.6 MB - Last synced: 26 days ago - Pushed: 27 days ago - Stars: 2 - Forks: 0

kreeben/resin

Vector space index based search engine that's available as a HTTP service or as an embedded library.

Language: C# - Size: 63.3 MB - Last synced: about 11 hours ago - Pushed: about 11 hours ago - Stars: 563 - Forks: 39

weaviate/weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

Language: Go - Size: 961 MB - Last synced: about 10 hours ago - Pushed: about 11 hours ago - Stars: 9,633 - Forks: 639

miccunifi/CIRCO

[ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset

Language: Python - Size: 568 KB - Last synced: about 13 hours ago - Pushed: about 14 hours ago - Stars: 36 - Forks: 1

shaoxiongji/knowledge-graphs

A collection of research on knowledge graphs

Language: JavaScript - Size: 199 KB - Last synced: about 8 hours ago - Pushed: over 1 year ago - Stars: 1,612 - Forks: 287

aryn-ai/sycamore

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.

Language: Python - Size: 54.2 MB - Last synced: about 4 hours ago - Pushed: about 24 hours ago - Stars: 166 - Forks: 16

N-G-Asker/TasteRank

TasteRank: Personalized Image Search and Recommendation. This research project proposes an AI-based method for scoring photos on relevance to user interests. TasteRank leverages language and vision models, including Mistral LLMs and OpenAI’s CLIP, and applies multimodal machine-learning techniques.

Language: Jupyter Notebook - Size: 1.76 MB - Last synced: about 18 hours ago - Pushed: about 19 hours ago - Stars: 0 - Forks: 0

nicolay-r/nicolay-r

Size: 30.3 KB - Last synced: about 23 hours ago - Pushed: about 24 hours ago - Stars: 0 - Forks: 0

castorini/anserini

Anserini is a Lucene toolkit for reproducible information retrieval research

Language: Java - Size: 88.3 MB - Last synced: about 24 hours ago - Pushed: 1 day ago - Stars: 982 - Forks: 413

ashvardanian/SimSIMD

Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, and C, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐

Language: C - Size: 685 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 727 - Forks: 35

resentful1/Noxious-Stealer

Simple Discord Stealer made in python

Language: Python - Size: 49.8 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 1 - Forks: 0

jzhoubu/VDR

VDR: Vocabulary Disentangled Retrieval (ICLR2024)

Language: Python - Size: 4.21 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 30 - Forks: 0

naiveHobo/InvoiceNet

Deep neural network to extract intelligent information from invoice documents.

Language: Python - Size: 43.9 MB - Last synced: 1 day ago - Pushed: 6 days ago - Stars: 2,395 - Forks: 381

hscells/pybool_ir

Toolkit for domain-specific information retrieval experimentation

Language: Python - Size: 1.4 MB - Last synced: about 12 hours ago - Pushed: 1 day ago - Stars: 17 - Forks: 2

muazhari/research-assistant-infrastructure

Research Assistant Infrastructure

Language: Dockerfile - Size: 102 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 1 - Forks: 0

enstit/Steflix

Recommender System project that uses Weighted Matrix Factorisation to learn user and items embeddings from a (sparse) feedbacks matrix, and uses them to perform user-specific suggestions

Language: Python - Size: 1.29 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 0 - Forks: 0

luisrodriguesphd/resume-worth

This repository offers the ResumeWorth app, aimed at helping individuals find their true market value through AI-powered analysis of their resumes. It combines advanced AI with market data to provide personalized salary ranges, job matches, and resume-job match explanation, ensuring users can optimize their earnings potential.

Language: Jupyter Notebook - Size: 689 KB - Last synced: about 22 hours ago - Pushed: 1 day ago - Stars: 3 - Forks: 0

YunaBraska/git-info-action

Instant insights into the latest changes and commits. Provides valuable outputs such as ticket number detection, breaking changes, latest branch & commit & tag information, variety of programming languages and conventions.

Language: TypeScript - Size: 26.9 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 3 - Forks: 1

YunaBraska/java-info-action

Fast Maven/Gradle parser. This dynamic GitHub action automatically detects and extracts crucial information such as Java version, project version, and encoding. It also provides essential build commands and properties to make your development process more independent, efficient and streamlined.

Language: TypeScript - Size: 36.7 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 2 - Forks: 1

kajallochab/Link-Relevance-Ranking

Implementing HITS and PageRank Algorithms

Language: Python - Size: 13.7 KB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 0 - Forks: 0

Agrover112/awesome-semantic-search

A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.

Size: 371 KB - Last synced: about 1 hour ago - Pushed: 5 months ago - Stars: 321 - Forks: 28

Aquila-Network/aquila

An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.

Language: HTML - Size: 1.5 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 374 - Forks: 26

sharmilathirumalai/TF-IDF

IR implemented by using TF-IDF method

Language: Java - Size: 10.7 MB - Last synced: 2 days ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

dmeoli/CranSearchEngine

Search Engine for the Cranfield Collection

Language: Java - Size: 4.71 MB - Last synced: 2 days ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

relari-ai/continuous-eval

Open-Source Evaluation for GenAI Application Pipelines

Language: Python - Size: 1.88 MB - Last synced: 2 days ago - Pushed: 9 days ago - Stars: 318 - Forks: 13

edoardottt/csprecon

Discover new target domains using Content Security Policy

Language: Go - Size: 6.21 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 315 - Forks: 36

edoardottt/scilla

Information Gathering tool - DNS / Subdomains / Ports / Directories enumeration

Language: Go - Size: 31.4 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 747 - Forks: 96

NEOS-AI/Neosearch

AI-based search engine done right

Language: HTML - Size: 72.8 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 5 - Forks: 0

danyocummings/depth

An OSINT Multi-tool

Language: Python - Size: 8.79 KB - Last synced: 2 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

pisa-engine/pisa

PISA: Performant Indexes and Search for Academia

Language: C++ - Size: 35.4 MB - Last synced: 2 days ago - Pushed: 21 days ago - Stars: 863 - Forks: 61

henrypp/errorlookup

Simple tool for retrieving information about Windows errors codes.

Language: C - Size: 2.76 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 209 - Forks: 40

snap-stanford/stark

Official Code of "STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases"

Language: Python - Size: 5.84 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 197 - Forks: 20

BirdsAreFlyingCameras/BirdGlance

A simple python script that takes a URL and gathers helpful information for web development and penetration testing

Language: Python - Size: 586 KB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 0

anakin87/content-collection

Collection of content I have created and shared over time

Language: Python - Size: 104 KB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 0

NikosBakalis/Document_Similarity

University project, program that shows you the percentage similarity between the documents you import

Language: Python - Size: 422 KB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 1 - Forks: 0

xlang-ai/instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language: Python - Size: 170 MB - Last synced: 3 days ago - Pushed: 16 days ago - Stars: 1,714 - Forks: 125

mhbashari/awesome-persian-nlp-ir

Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources

Size: 192 KB - Last synced: about 8 hours ago - Pushed: 6 months ago - Stars: 700 - Forks: 113

kajallochab/CulinaryCanvas

Culinary Canvas is an Information Retrieval System course project aimed at creating an advanced recipe search algorithm. The project utilizes techniques such as topic modeling, keyword extraction, and similarity scoring to provide users with relevant recipes based on their input.

Language: Jupyter Notebook - Size: 44.9 KB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 1 - Forks: 0

codetalker7/ColBERT.jl

Efficient late-interaction retrieval systems in Julia!

Language: Julia - Size: 98.6 KB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 0

NTMC-Community/awesome-neural-models-for-semantic-match

A curated list of papers dedicated to neural text (semantic) matching.

Language: HTML - Size: 158 KB - Last synced: 4 days ago - Pushed: 5 months ago - Stars: 770 - Forks: 125

KittyKatt/screenFetch

Fetches system/theme information in terminal for Linux desktop screenshots.

Language: Shell - Size: 4.37 MB - Last synced: 4 days ago - Pushed: about 1 month ago - Stars: 3,745 - Forks: 443

catalyst-team/catalyst

Accelerated deep learning R&D

Language: Python - Size: 52.6 MB - Last synced: about 15 hours ago - Pushed: about 2 months ago - Stars: 3,230 - Forks: 385

NoHaxito/deploys-top

Search & compare free and paid providers. Find the best option for your needs quickly and easily!

Language: TypeScript - Size: 1.03 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 3 - Forks: 1

QiushiSun/DaSE-Information-Retrieval-2021

DaSE-Information-Retrieval-2021

Language: Jupyter Notebook - Size: 38.3 MB - Last synced: 4 days ago - Pushed: over 2 years ago - Stars: 2 - Forks: 0

momegas/megabots

🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵

Language: Python - Size: 141 KB - Last synced: 3 days ago - Pushed: 11 months ago - Stars: 335 - Forks: 36

nicolo-urbani/FactFinder

Fact Finder - a Fact Search Engine

Language: Jupyter Notebook - Size: 7.63 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 0 - Forks: 0

JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language: Python - Size: 154 MB - Last synced: 4 days ago - Pushed: about 1 month ago - Stars: 22,052 - Forks: 2,939

Blake-Madden/OleanderStemmingLibrary

Porter stemming library (C++)

Language: C++ - Size: 1.07 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 47 - Forks: 25

vladislavpyatnitskiy/financial.data.scraping

Repository with capabilities to get data

Language: R - Size: 236 KB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 0 - Forks: 0

navraj213/FiND

Scripts and packages for the FiND website

Language: HTML - Size: 326 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 1 - Forks: 0

ict-bigdatalab/awesome-pretrained-models-for-information-retrieval

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

Size: 437 KB - Last synced: 1 day ago - Pushed: 4 months ago - Stars: 596 - Forks: 44

microsoft/rag-experiment-accelerator

The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.

Language: Python - Size: 3.84 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 76 - Forks: 19

KID-22/LLM-IR-Bias-Fairness-Survey

This is the repo for the survey of Bias and Fairness in IR with LLMs.

Size: 829 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 15 - Forks: 1

KonradSzafer/hugging-face-qa-bot

Open source Hugging Face Question Answering Bot to aid users in developing and troubleshooting ML solutions.

Language: Python - Size: 255 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 40 - Forks: 5

rahulrajpl/netizenship

a commandline #OSINT tool to find the online presence of a username in popular social media websites like Facebook, Instagram, Twitter, etc.

Language: Python - Size: 7.12 MB - Last synced: 2 days ago - Pushed: over 1 year ago - Stars: 45 - Forks: 12

cyberboysumanjay/GaanaAPI

Unofficial Gaana API

Language: Python - Size: 17.6 KB - Last synced: 5 days ago - Pushed: over 3 years ago - Stars: 106 - Forks: 57

kevspa/proginfo 📦

a console app that provides useful information of various programming languages (in progress)

Language: Nim - Size: 18.9 MB - Last synced: 5 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

ContextualAI/gritlm

Generative Representational Instruction Tuning

Language: Jupyter Notebook - Size: 4.8 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 390 - Forks: 26

Yomguithereal/talisman

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

Language: JavaScript - Size: 3.39 MB - Last synced: 4 days ago - Pushed: about 1 year ago - Stars: 699 - Forks: 50

oaqa/FlexNeuART

Flexible classic and NeurAl Retrieval Toolkit

Language: Java - Size: 36.3 MB - Last synced: 2 days ago - Pushed: 23 days ago - Stars: 208 - Forks: 33

usnistgov/KAIROS

Scoring and analysis software for the evaluation of Knowledge Directed Artificial Intelligence Reasoning Over Schemas (KAIROS)

Language: Python - Size: 273 KB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 5 - Forks: 1

thepushkarp/nalcos

Search Git commits in natural language

Language: Python - Size: 408 KB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 53 - Forks: 8

naver/splade

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Language: Python - Size: 3.1 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 644 - Forks: 77

Twenkid/Vsy-Jack-Of-All-Trades-AGI-Bulgarian-Internet-Archive-And-Search-Engine

Artificial General Intelligence Infrastructure of "The Sacred Computer" AGI Institute : Custom Intelligent Selective Internet Archiving and Exploration/Crawling; Information Retrieval, Media Monitoring, Search Engine, Smart DB, Data Preservation, Knowledge Extraction,Datasets creation,AI Generative models building and testing,Experiments etc.

Language: Jupyter Notebook - Size: 13.2 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 5 - Forks: 0

GeneDx/txt2hpo

Python library for extracting HPO encoded phenotypes from text

Language: Python - Size: 44.7 MB - Last synced: 6 days ago - Pushed: 17 days ago - Stars: 22 - Forks: 4

apache/solr-sandbox

Apache Solr open-source search software plugin modules sandbox

Language: Java - Size: 13.9 MB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 7 - Forks: 10

georgms/information-retrieval

Introduction to Information Retrieval

Language: JavaScript - Size: 90.6 MB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 1 - Forks: 4

weezymatt/Retrieval-with-Wordle

Discover a clever strategy for mastering Wordle! Our project dives into various Information Retrieval techniques to efficiently guess the daily word—all aiming for Wordle domination!

Language: Python - Size: 118 MB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 0 - Forks: 0

guillaC/SQLiteDiskExplorer

SQLiteDiskExplorer enables you to explore, catalog, and batch extract SQLite files from disks and removable media.

Language: C# - Size: 386 KB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 11 - Forks: 0

AmenRa/ranx

⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍

Language: Python - Size: 34.5 MB - Last synced: 7 days ago - Pushed: 11 days ago - Stars: 348 - Forks: 21

terrier-org/terrier-core

Terrier IR Platform

Language: Java - Size: 7.48 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 241 - Forks: 63

embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

Language: Python - Size: 2.08 MB - Last synced: 25 days ago - Pushed: 25 days ago - Stars: 1,303 - Forks: 127

oroszgy/awesome-hungarian-nlp

A curated list of NLP resources for Hungarian

Size: 110 KB - Last synced: 6 days ago - Pushed: 6 months ago - Stars: 207 - Forks: 18

gaoisbest/NLP-Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

Language: OpenEdge ABL - Size: 384 MB - Last synced: 4 days ago - Pushed: over 3 years ago - Stars: 503 - Forks: 149

QpxDesign/quail-api

As seen in TREC 2023, A QA-First, Hallucination-Lite, Multi-LM Summarizer

Language: Python - Size: 19.5 KB - Last synced: 8 days ago - Pushed: 9 days ago - Stars: 0 - Forks: 0

NicholasMamo/multiplex-plot

Multiplex: visualizations that tell stories—A Python library to create and annotate beautiful network graph visualizations, text visualizations and more.

Language: Python - Size: 94.2 MB - Last synced: 8 days ago - Pushed: over 1 year ago - Stars: 104 - Forks: 15

penguineer/cleanURI-webui

WebUI for the cleanURI service.

Language: JavaScript - Size: 3.27 MB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 1

danielme85/simple-server-info

Get CPU information and load. Memory and storage/volume usage and information. Made with efficiency and simplicity in mind.

Language: PHP - Size: 38.1 KB - Last synced: 8 days ago - Pushed: about 1 year ago - Stars: 4 - Forks: 0

heinrichreimer/grimjack

🤺 Argument retrieval using axiomatic re-ranking and query reformulation.

Language: TeX - Size: 3.13 MB - Last synced: 8 days ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0

penguineer/cleanURI-apigateway

This is the API gateway for the cleanURI service.

Language: Java - Size: 79.1 KB - Last synced: 8 days ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

penguineer/cleanURI

URL reduction and meta-data enrichment.

Size: 12.7 KB - Last synced: 8 days ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

quan-to/go-vsm

Vector Space Model implementation in Go

Language: Go - Size: 33.2 KB - Last synced: 8 days ago - Pushed: almost 4 years ago - Stars: 10 - Forks: 1

halegreen/IR_system_form_scratch

Building a IR&IE system from scratch.

Language: Java - Size: 284 KB - Last synced: 8 days ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0

rafalposwiata/pl-mteb

PL-MTEB: Polish Massive Text Embedding Benchmark

Language: Python - Size: 177 KB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 5 - Forks: 0

dhdaines/serafim

SystÈme de Recherche Adélois pour Fouiller dans les Informations Municipales

Language: TypeScript - Size: 79.6 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0

yaph/gh-commit-locations

Scripts used for analyzing GitHub commit locations to create a map visualization

Language: Python - Size: 179 KB - Last synced: 8 days ago - Pushed: almost 12 years ago - Stars: 3 - Forks: 4

piskvorky/gensim

Topic Modelling for Humans

Language: Python - Size: 101 MB - Last synced: 8 days ago - Pushed: 15 days ago - Stars: 15,255 - Forks: 4,345

DerYeger/gir-wt-2021-2022 📦

Solutions for the GIR course of the TU Wien from the WT 2021/22.

Language: Python - Size: 819 MB - Last synced: 8 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

ainsleyclark/nlp

NLP (Natrual Language Processing) API via the pke (Python Keyphrase Extraction) engine.

Language: Python - Size: 48.8 KB - Last synced: 8 days ago - Pushed: about 2 years ago - Stars: 1 - Forks: 1

razrez/nuget-master

VS Code plugin for NuGet packages recomedations based on your textual description of developing project.

Language: Jupyter Notebook - Size: 848 KB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0

uutils/platform-info

A cross-platform way to get information about your machine

Language: Rust - Size: 171 KB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 77 - Forks: 23

webis-de/ir_axioms

↕️ Intuitive axiomatic retrieval experimentation.

Language: Python - Size: 1.43 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 21 - Forks: 1

PRITHIVSAKTHIUR/Medical-Term-Article-Search

HealthCare-Informatics-MediSearch

Language: Python - Size: 136 KB - Last synced: 7 days ago - Pushed: 9 days ago - Stars: 4 - Forks: 0

AmenRa/retriv

A Python Search Engine for Humans 🥸

Language: Python - Size: 372 KB - Last synced: 8 days ago - Pushed: 17 days ago - Stars: 156 - Forks: 18

Kekkodf/WBB-QueryObfuscation

Repository of the Paper "Words Blending Boxes. Obfuscating Queries in Information Retrieval using Differential Privacy."

Language: Python - Size: 14.3 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 0 - Forks: 0

castorini/pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Language: Python - Size: 7.45 MB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 1,460 - Forks: 326

Related Keywords
information-retrieval 2,050 python 317 search-engine 281 nlp 242 natural-language-processing 198 machine-learning 193 tf-idf 133 deep-learning 95 java 88 python3 84 information-extraction 77 inverted-index 77 search 70 lucene 68 question-answering 68 bm25 64 vector-space-model 59 elasticsearch 57 indexing 56 text-mining 55 recommender-system 50 cosine-similarity 47 data-mining 47 nltk 45 ir 43 crawler 43 semantic-search 42 data-science 41 text-classification 38 bert 37 language-model 37 pytorch 37 nlp-machine-learning 36 information-gathering 35 pagerank 34 solr 33 llm 33 flask 31 boolean-retrieval 30 artificial-intelligence 30 transformers 29 ranking 29 clustering 29 tfidf 28 dataset 26 word2vec 26 large-language-models 26 neural-network 25 knowledge-graph 24 information 23 text-processing 23 ai 23 sentiment-analysis 22 learning-to-rank 21 retrieval-augmented-generation 21 rag 20 search-algorithm 20 stemming 20 chatbot 20 query-expansion 20 django 19 wikipedia 19 computer-vision 18 docker 18 trec 18 classification 18 evaluation 17 pandas 17 dense-retrieval 17 tokenization 16 retrieval 16 embeddings 16 keyword-extraction 16 vector-search 16 summarization 16 recommendation-system 16 osint 16 topic-modeling 16 golang 15 research 15 stemmer 15 query 15 information-retrieval-engine 14 webscraping 14 crawling 14 linux 14 twitter 14 javascript 14 chatgpt 14 jupyter-notebook 13 pagerank-algorithm 13 nodejs 13 word-embeddings 13 web-scraping 13 html 13 neural-search 13 preprocessing 13 ranking-algorithm 13 collaborative-filtering 13 react 13