An open API service providing repository metadata for many open source software ecosystems.

Topic: "text-search"

lucaong/minisearch

Tiny and powerful JavaScript full-text search engine for browser and Node

Language: TypeScript - Size: 5.29 MB - Last synced at: 28 days ago - Pushed at: 4 months ago - Stars: 5,276 - Forks: 145

toolgood/ToolGood.Words

一款高性能敏感词(非法词/脏字)检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。

Language: JavaScript - Size: 102 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 4,927 - Forks: 865

zombodb/zombodb

Making Postgres and Elasticsearch work together like it's 2023

Language: PLpgSQL - Size: 381 MB - Last synced at: 21 days ago - Pushed at: 4 months ago - Stars: 4,737 - Forks: 214

ddangelov/Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

Language: Python - Size: 83.4 MB - Last synced at: 28 days ago - Pushed at: 7 months ago - Stars: 3,045 - Forks: 373

unum-cloud/usearch

Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

Language: C++ - Size: 4.28 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2,766 - Forks: 188

vrothberg/vgrep

a user-friendly pager for grep

Language: Go - Size: 7.53 MB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 680 - Forks: 41

ad-freiburg/qlever

Very fast SPARQL Engine, which can handle very large knowledge graphs like the complete Wikidata, offers context-sensitive autocompletion for SPARQL queries, and allows combination with text search. It's faster than engines like Blazegraph or Virtuoso, especially for queries involving large result sets.

Language: C++ - Size: 195 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 581 - Forks: 79

cjinhuo/text-search-engine

A text search engine that supports mixed Chinese and English fuzzy search.

Language: TypeScript - Size: 3.93 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 445 - Forks: 17

taleinat/fuzzysearch

Find parts of long text or data, allowing for some changes/typos.

Language: Python - Size: 997 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 319 - Forks: 25

skim-rs/fuzzy-matcher

Fuzzy Matching Library for Rust

Language: Rust - Size: 66.4 KB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 280 - Forks: 19

weareprestatech/hotpdf

hotpdf is a fast PDF parsing library to extract text and find text within PDF documents built on top of pdfminer.six

Language: Python - Size: 16.5 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 194 - Forks: 9

mikegoatly/lifti

A lightweight full text indexer for .NET

Language: C# - Size: 74.5 MB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 190 - Forks: 9

unruledboy/WuManber

WuManber text search/matching implementation using c#

Language: C# - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 152 - Forks: 27

Nukesor/sticker-finder 📦

:zap: A telegram bot for searching all the stickers (just like @gif).

Language: Python - Size: 45.1 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 97 - Forks: 21

ddangelov/RESTful-Top2Vec

Expose a Top2Vec model with a REST API.

Language: Python - Size: 243 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 90 - Forks: 20

dav009/flash

Golang Keyword extraction/replacement Datastructure using Tries instead of regexes

Language: Go - Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 89 - Forks: 6

DocSavage/appengine-search 📦

Simple full text search demo for Google App Engine

Language: Python - Size: 165 KB - Last synced at: 2 months ago - Pushed at: almost 16 years ago - Stars: 78 - Forks: 13

petar-dambovaliev/aho-corasick

efficient string matching in Golang via the aho-corasick algorithm.

Language: Go - Size: 35.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 74 - Forks: 11

oracle/soda-for-java

SODA (Simple Oracle Document Access) for Java is an Oracle library for writing Java apps that work with JSON (and not only JSON!) in the Oracle Database. SODA allows your Java app to use the Oracle Database as a NoSQL document store.

Language: Java - Size: 19.4 MB - Last synced at: 23 days ago - Pushed at: about 2 months ago - Stars: 68 - Forks: 23

lggruspe/slipbox

A static site generator for Zettelkasten notes

Language: Python - Size: 7.72 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 64 - Forks: 5

Schlechtwetterfront/fuzzy-rs

Fuzzy matching algorithm based on Sublime Text's string search.

Language: Rust - Size: 45.9 KB - Last synced at: 11 days ago - Pushed at: over 4 years ago - Stars: 60 - Forks: 3

xndien2004/LLM_Powered_Video_Search

[SOICT 2024] LLM-Powered Video Search: A Comprehensive Multimedia Retrieval System

Language: Jupyter Notebook - Size: 5.4 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 53 - Forks: 1

zayedrais/DocumentSearchEngine

Document Search Engine project with TF-IDF abd Google universal sentence encoder model

Language: Jupyter Notebook - Size: 28.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 53 - Forks: 24

Lipairui/textgo

Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!

Language: Python - Size: 532 KB - Last synced at: 3 days ago - Pushed at: about 3 years ago - Stars: 45 - Forks: 3

pilotpirxie/coresearch

🔎 .NET Core cross-platform, in-memory, full text search library for building search engines. Made to learn C#.

Language: C# - Size: 133 KB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 44 - Forks: 27

tensorchord/vechord

Turn PostgreSQL into your search engine in a Pythonic way.

Language: Python - Size: 315 KB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 43 - Forks: 4

jankovicsandras/plpgsql_bm25

BM25 search implemented in PL/pgSQL

Language: Jupyter Notebook - Size: 1.26 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 42 - Forks: 0

jermp/autocomplete

Efficient and effective query auto-completion in C++.

Language: C++ - Size: 1.59 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 39 - Forks: 6

alessiosavi/GoGPUtils

Enhance productivity and avoid to reinvent the wheel every time that you start a Go project

Language: Go - Size: 1.95 MB - Last synced at: 5 days ago - Pushed at: 10 months ago - Stars: 35 - Forks: 6

flolu/mongo-search

Fuzzy Text Search And Autocompletion With MongoDB And Node.js

Language: TypeScript - Size: 554 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 29 - Forks: 9

taleinat/levenshtein-search

A Javascript library for fuzzy substring search.

Language: JavaScript - Size: 1.05 MB - Last synced at: 26 days ago - Pushed at: over 2 years ago - Stars: 27 - Forks: 1

iceniveth/firestore-search

This source code is reference to my post on Medium about Firebase Firestore Text Search and Pagination.

Language: HTML - Size: 329 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 25 - Forks: 9

wroberts/fsed

Aho-Corasick string replacement utility

Language: sed - Size: 503 KB - Last synced at: 4 days ago - Pushed at: over 5 years ago - Stars: 24 - Forks: 6

KeremZaman/semantic-sh

semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).

Language: Python - Size: 40 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 23 - Forks: 3

jankovicsandras/bm25opt

faster BM25 search algorithms in Python

Language: Jupyter Notebook - Size: 69.3 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 20 - Forks: 1

CGJennings/fjs-string-matching

Official sample code for the very fast Franek-Jennings-Smyth (FJS) full text string search algorithm

Language: Java - Size: 396 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 19 - Forks: 4

mugiseyebrows/mugi-grep

Gui grep (and sed) with sessions and history

Language: C++ - Size: 351 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 12 - Forks: 4

VladPVS/FastSearchLibraryNETCore

The multithreading library that provides opportunity both to fast find files or directories using different search criteria and search text files by their content.

Language: C# - Size: 154 KB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 11 - Forks: 6

chr1st1ank/narrow-down

Fast fuzzy text search

Language: Python - Size: 21.2 MB - Last synced at: 25 days ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 1

icflorescu/postgresql-tsearch-utils

A collection of files and patterns to improve PostgreSQL text search

Size: 3.91 KB - Last synced at: 2 months ago - Pushed at: almost 9 years ago - Stars: 11 - Forks: 2

CityofEdmonton/text_depot 📦

Search and analyze your text data

Language: R - Size: 17 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 10 - Forks: 1

kyr0/clientside-search

A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.

Language: TypeScript - Size: 1.58 MB - Last synced at: 22 days ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 0

rishpandey/snippeteer

A text snippet manager you always needed. Download it here

Language: HTML - Size: 4.63 MB - Last synced at: 27 days ago - Pushed at: almost 7 years ago - Stars: 10 - Forks: 2

unitycoder/FindInFiles

Easy & Free Tool for Searching Text in Files

Language: C# - Size: 39.1 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 3

zilliz-bootcamp/text_search

This project uses Milvus and Bert to build a text search system.

Language: TypeScript - Size: 2.56 MB - Last synced at: 23 days ago - Pushed at: almost 4 years ago - Stars: 9 - Forks: 3

willbchang/alfred-search-selection

One hotkey: Search selected text/image, open url(s) from selected text.

Language: Ruby - Size: 18.6 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 9 - Forks: 0

sadit/TextSearch.jl

Searching methods and models for textual data; it was designed to work with SimilaritySearch.jl

Language: Julia - Size: 2.82 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 5

JuliusBahr/SimpleSimilarity

A framework for semantic text search

Language: Swift - Size: 1.72 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 7 - Forks: 0

starnowski/posjsonhelper

The Posjsonhelper library is an open-source project that adds support for Hibernate queries for Postgresql JSON functions and full-text search.

Language: Java - Size: 1.63 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

pH-7/Free-Text-Search

This full-text search allows users to search real estate properties (present in Daft.ie) by just speaking or typing what their want. The engine will then parse and translate the words and return the correct data.

Language: PHP - Size: 558 KB - Last synced at: 2 months ago - Pushed at: over 8 years ago - Stars: 7 - Forks: 2

valginer0/WinFindGrep

Multi-directory text search and replace utility for Windows with grep-like functionality

Language: C# - Size: 146 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 1

JohnPapad/Mini-Search-Engine

A Mini Search Engine in C++, using an inverted index and a trie.

Language: C++ - Size: 2.62 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

danielgatis/search_in_files

This is a tool for search text in files.

Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: 7 days ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 2

dominikbraun/haystack

A fast & simple text search across files.

Language: Rust - Size: 222 KB - Last synced at: 7 days ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 0

Paulanerus/TextExplorer

A tool designed for the exploration, analysis, and comparison of textual data variants.

Language: Kotlin - Size: 590 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3 - Forks: 0

Zhennor/Multimodal-Video-Retrieval-Engine-with-Vision-and-Text

A video search engine combining OCR, ASR, CLIP, Image Captioning, Object & Color Detection. It enables accurate retrieval based on text, speech, images, objects, and colors in video content.

Size: 20.9 GB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

Tgentil/PDF-text-Search

Este é um script em Python que permite buscar por um texto específico dentro de todos os arquivos PDF em uma pasta.

Language: Python - Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

astrowonk/mastodon_archive_reader

Read a mastodon archive and create a sqlite3 database of archived post content

Language: Python - Size: 18.6 KB - Last synced at: 8 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

GarbageMountain/seaq

Plug and Play ES6 Text Search

Language: TypeScript - Size: 2.3 MB - Last synced at: 23 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

ShihabYasin/simple-django-full-text-searching

Basic and Full-text Search in Django

Language: Python - Size: 16.6 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

jaytimm/textsearch

Tools for corpus search in context.

Language: R - Size: 97.7 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

jdvala/kawadi

kawadi is my collection of tools, that I need more frequently

Language: Python - Size: 2.85 MB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

ox-vgg/vgg_text_search

(MIRROR) Text search engine that runs on a local service. Includes a pipeline for preprocessing a user-defined image dataset.

Language: Python - Size: 48.8 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

AlexandrSachkov/LineCatcher

Line Catcher (LC) is an advanced programmable log parsing tool aimed to simplify and save time on viewing, searching, and parsing large text files

Language: C++ - Size: 1.8 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 1

rsusik/bfsi

Bloom Filter based Semi-Index (BFSI)

Language: C++ - Size: 35.2 KB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

planaria/cb_tree

A crit-bit tree implementation for C++

Language: C++ - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 2

isappit/ifile

"Documents Search Engine" based on Lucene for indexing and searching in many type of the documents

Language: PHP - Size: 4.39 MB - Last synced at: about 1 year ago - Pushed at: over 8 years ago - Stars: 3 - Forks: 0

blackwinter/pms

Poor Man's Search

Language: Ruby - Size: 492 KB - Last synced at: 19 days ago - Pushed at: over 8 years ago - Stars: 3 - Forks: 0

dishmint/LexicalCases

Extract substrings matching a lexical pattern

Language: Mathematica - Size: 60.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

marqo-ai/marqo-ecommerce-embeddings

State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP.

Language: Python - Size: 1.36 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

tomzox/trowser

Trowser is a browser for large line-oriented text files, implemented in 3 alternate programming languages: Tcl/Tk, Python and C++/Qt

Language: C++ - Size: 2.18 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

divyanshu-rawat/checkout-your-cart

Users of this application will be able to browse through products, add products to their cart, and even check out using the stripe API.

Language: JavaScript - Size: 8.73 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 4

patsie75/awk-fuzsearch

Fuzzy text search engine in awk

Language: Awk - Size: 7.81 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

ChrisVilches/Partial-Text-Search

A JavaScript library that finds string patterns in a collection of documents. It efficiently finds matches even if the words in each document do not begin with the query pattern.

Language: JavaScript - Size: 967 KB - Last synced at: 22 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

MarcoLucidi01/privately-search-selected

adds menu item to Firefox to privately search selected text

Language: JavaScript - Size: 32.2 KB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

TeamworkGuy2/FileManipulator

CLI tool for searching, removing, and replacing text in files

Language: Java - Size: 83 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

koudura/project-psaximo

final repository for fornax search engine library. [see psaximo.net and true.fornax.net for details.]

Language: HTML - Size: 102 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 3

junphine/ignite Fork of apache/ignite

Extends of Apache Ignite, add feathers:Mongodb protocol,Gremlin protocol, vector search,fulltext search,graph compute

Language: Java - Size: 469 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 1 - Forks: 0

bernardosulzbach/bigword

An application to find words in a dictionary from a multiset of letters

Language: C++ - Size: 159 KB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 2

remram44/ngram-search

Ngram-based indexing of strings into a binary file

Language: Rust - Size: 88.9 KB - Last synced at: about 2 hours ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

haritonov2/minigrep_rust

`minigrep_rust` is a simple Rust command-line tool for finding lines that contain a specific substring in text files.

Language: Rust - Size: 1000 Bytes - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

garawaa/ripgrep-gui

A desktop file search tool that uses ripgrep. Implemented using pyside6

Language: Python - Size: 20.5 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

raiyanyahya/fast-text-search

☄️ A go package to do extremely fast concurrent text search across files and folders.

Language: Go - Size: 2.56 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

abhinav-nath/ecommerce-search

Language: Java - Size: 553 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 2

amiralv82/Grep

This Rust-based Text Search Tool empowers users to efficiently search for specific patterns within files and directories. Whether you need to find text within documents, explore directories, or perform advanced search operations, this versatile tool has you covered.

Language: Rust - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

MythicManiac/b64-regex

Build regex patterns to search through base64 encoded text.

Language: Python - Size: 185 KB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

rsusik/cf2

Approximate pattern matching with Counting Filter on q-grams using SSE instructions (CF2)

Language: C++ - Size: 11.7 KB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

frank-connolly/text-analyser-challenge

Coding challenge

Language: Java - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

fabiothiroki/spring-pokemon-textsearch

A spring boot application implementing full-text search using Postgresql

Language: Java - Size: 2.41 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 2

Dimama/text_search_labs

Laboratory work on text search course

Language: Python - Size: 6.84 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

AndySoundy/Search

A mulit-threaded command line tool that lets you search for file names, text in files, or file types.

Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

rsusik/magetdc

MAG on ETDC

Language: C++ - Size: 87.9 KB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

rsusik/maga

MAG for approximate pattern matching (MAGA)

Language: C++ - Size: 697 KB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

rsusik/mag

Multi AOSO on q-Grams (MAG). One of the fastest multi-pattern matching algorithms.

Language: C++ - Size: 3.77 MB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

arindamnayak/solr-tutorial

Why and How Solr

Language: XSLT - Size: 478 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

ichtrojan/Text-Search-Using-Algolia

A text search integration using Algolia and Laravel

Language: PHP - Size: 784 KB - Last synced at: 14 days ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 1

za-arthur/pg_textparser

Fast parser for text search of PostgreSQL

Language: Lex - Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

visalchea/find-in-files

Find text in file(s) of folder(s)

Language: C# - Size: 125 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

valginer0/WinFindGrepWebsite

A website for a multi-directory grep-like tool for Windows with GUI, WinFindGrep

Size: 759 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

lvce-editor/text-search-worker

Text Search Worker

Language: TypeScript - Size: 3.95 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0