Topic: "text-search"
lucaong/minisearch
Tiny and powerful JavaScript full-text search engine for browser and Node
Language: TypeScript - Size: 5.29 MB - Last synced at: 28 days ago - Pushed at: 4 months ago - Stars: 5,276 - Forks: 145

toolgood/ToolGood.Words
一款高性能敏感词(非法词/脏字)检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。
Language: JavaScript - Size: 102 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 4,927 - Forks: 865

zombodb/zombodb
Making Postgres and Elasticsearch work together like it's 2023
Language: PLpgSQL - Size: 381 MB - Last synced at: 21 days ago - Pushed at: 4 months ago - Stars: 4,737 - Forks: 214

ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
Language: Python - Size: 83.4 MB - Last synced at: 28 days ago - Pushed at: 7 months ago - Stars: 3,045 - Forks: 373

unum-cloud/usearch
Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
Language: C++ - Size: 4.28 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2,766 - Forks: 188

vrothberg/vgrep
a user-friendly pager for grep
Language: Go - Size: 7.53 MB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 680 - Forks: 41

ad-freiburg/qlever
Very fast SPARQL Engine, which can handle very large knowledge graphs like the complete Wikidata, offers context-sensitive autocompletion for SPARQL queries, and allows combination with text search. It's faster than engines like Blazegraph or Virtuoso, especially for queries involving large result sets.
Language: C++ - Size: 195 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 581 - Forks: 79

cjinhuo/text-search-engine
A text search engine that supports mixed Chinese and English fuzzy search.
Language: TypeScript - Size: 3.93 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 445 - Forks: 17

taleinat/fuzzysearch
Find parts of long text or data, allowing for some changes/typos.
Language: Python - Size: 997 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 319 - Forks: 25

skim-rs/fuzzy-matcher
Fuzzy Matching Library for Rust
Language: Rust - Size: 66.4 KB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 280 - Forks: 19

weareprestatech/hotpdf
hotpdf is a fast PDF parsing library to extract text and find text within PDF documents built on top of pdfminer.six
Language: Python - Size: 16.5 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 194 - Forks: 9

mikegoatly/lifti
A lightweight full text indexer for .NET
Language: C# - Size: 74.5 MB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 190 - Forks: 9

unruledboy/WuManber
WuManber text search/matching implementation using c#
Language: C# - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 152 - Forks: 27

Nukesor/sticker-finder 📦
:zap: A telegram bot for searching all the stickers (just like @gif).
Language: Python - Size: 45.1 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 97 - Forks: 21

ddangelov/RESTful-Top2Vec
Expose a Top2Vec model with a REST API.
Language: Python - Size: 243 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 90 - Forks: 20

dav009/flash
Golang Keyword extraction/replacement Datastructure using Tries instead of regexes
Language: Go - Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 89 - Forks: 6

DocSavage/appengine-search 📦
Simple full text search demo for Google App Engine
Language: Python - Size: 165 KB - Last synced at: 2 months ago - Pushed at: almost 16 years ago - Stars: 78 - Forks: 13

petar-dambovaliev/aho-corasick
efficient string matching in Golang via the aho-corasick algorithm.
Language: Go - Size: 35.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 74 - Forks: 11

oracle/soda-for-java
SODA (Simple Oracle Document Access) for Java is an Oracle library for writing Java apps that work with JSON (and not only JSON!) in the Oracle Database. SODA allows your Java app to use the Oracle Database as a NoSQL document store.
Language: Java - Size: 19.4 MB - Last synced at: 23 days ago - Pushed at: about 2 months ago - Stars: 68 - Forks: 23

lggruspe/slipbox
A static site generator for Zettelkasten notes
Language: Python - Size: 7.72 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 64 - Forks: 5

Schlechtwetterfront/fuzzy-rs
Fuzzy matching algorithm based on Sublime Text's string search.
Language: Rust - Size: 45.9 KB - Last synced at: 11 days ago - Pushed at: over 4 years ago - Stars: 60 - Forks: 3

xndien2004/LLM_Powered_Video_Search
[SOICT 2024] LLM-Powered Video Search: A Comprehensive Multimedia Retrieval System
Language: Jupyter Notebook - Size: 5.4 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 53 - Forks: 1

zayedrais/DocumentSearchEngine
Document Search Engine project with TF-IDF abd Google universal sentence encoder model
Language: Jupyter Notebook - Size: 28.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 53 - Forks: 24

Lipairui/textgo
Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
Language: Python - Size: 532 KB - Last synced at: 3 days ago - Pushed at: about 3 years ago - Stars: 45 - Forks: 3

pilotpirxie/coresearch
🔎 .NET Core cross-platform, in-memory, full text search library for building search engines. Made to learn C#.
Language: C# - Size: 133 KB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 44 - Forks: 27

tensorchord/vechord
Turn PostgreSQL into your search engine in a Pythonic way.
Language: Python - Size: 315 KB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 43 - Forks: 4

jankovicsandras/plpgsql_bm25
BM25 search implemented in PL/pgSQL
Language: Jupyter Notebook - Size: 1.26 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 42 - Forks: 0

jermp/autocomplete
Efficient and effective query auto-completion in C++.
Language: C++ - Size: 1.59 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 39 - Forks: 6

alessiosavi/GoGPUtils
Enhance productivity and avoid to reinvent the wheel every time that you start a Go project
Language: Go - Size: 1.95 MB - Last synced at: 5 days ago - Pushed at: 10 months ago - Stars: 35 - Forks: 6

flolu/mongo-search
Fuzzy Text Search And Autocompletion With MongoDB And Node.js
Language: TypeScript - Size: 554 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 29 - Forks: 9

taleinat/levenshtein-search
A Javascript library for fuzzy substring search.
Language: JavaScript - Size: 1.05 MB - Last synced at: 26 days ago - Pushed at: over 2 years ago - Stars: 27 - Forks: 1

iceniveth/firestore-search
This source code is reference to my post on Medium about Firebase Firestore Text Search and Pagination.
Language: HTML - Size: 329 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 25 - Forks: 9

wroberts/fsed
Aho-Corasick string replacement utility
Language: sed - Size: 503 KB - Last synced at: 4 days ago - Pushed at: over 5 years ago - Stars: 24 - Forks: 6

KeremZaman/semantic-sh
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
Language: Python - Size: 40 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 23 - Forks: 3

jankovicsandras/bm25opt
faster BM25 search algorithms in Python
Language: Jupyter Notebook - Size: 69.3 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 20 - Forks: 1

CGJennings/fjs-string-matching
Official sample code for the very fast Franek-Jennings-Smyth (FJS) full text string search algorithm
Language: Java - Size: 396 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 19 - Forks: 4

mugiseyebrows/mugi-grep
Gui grep (and sed) with sessions and history
Language: C++ - Size: 351 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 12 - Forks: 4

VladPVS/FastSearchLibraryNETCore
The multithreading library that provides opportunity both to fast find files or directories using different search criteria and search text files by their content.
Language: C# - Size: 154 KB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 11 - Forks: 6

chr1st1ank/narrow-down
Fast fuzzy text search
Language: Python - Size: 21.2 MB - Last synced at: 25 days ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 1

icflorescu/postgresql-tsearch-utils
A collection of files and patterns to improve PostgreSQL text search
Size: 3.91 KB - Last synced at: 2 months ago - Pushed at: almost 9 years ago - Stars: 11 - Forks: 2

CityofEdmonton/text_depot 📦
Search and analyze your text data
Language: R - Size: 17 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 10 - Forks: 1

kyr0/clientside-search
A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.
Language: TypeScript - Size: 1.58 MB - Last synced at: 22 days ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 0

rishpandey/snippeteer
A text snippet manager you always needed. Download it here
Language: HTML - Size: 4.63 MB - Last synced at: 27 days ago - Pushed at: almost 7 years ago - Stars: 10 - Forks: 2

unitycoder/FindInFiles
Easy & Free Tool for Searching Text in Files
Language: C# - Size: 39.1 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 3

zilliz-bootcamp/text_search
This project uses Milvus and Bert to build a text search system.
Language: TypeScript - Size: 2.56 MB - Last synced at: 23 days ago - Pushed at: almost 4 years ago - Stars: 9 - Forks: 3

willbchang/alfred-search-selection
One hotkey: Search selected text/image, open url(s) from selected text.
Language: Ruby - Size: 18.6 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 9 - Forks: 0

sadit/TextSearch.jl
Searching methods and models for textual data; it was designed to work with SimilaritySearch.jl
Language: Julia - Size: 2.82 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 5

JuliusBahr/SimpleSimilarity
A framework for semantic text search
Language: Swift - Size: 1.72 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 7 - Forks: 0

starnowski/posjsonhelper
The Posjsonhelper library is an open-source project that adds support for Hibernate queries for Postgresql JSON functions and full-text search.
Language: Java - Size: 1.63 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

pH-7/Free-Text-Search
This full-text search allows users to search real estate properties (present in Daft.ie) by just speaking or typing what their want. The engine will then parse and translate the words and return the correct data.
Language: PHP - Size: 558 KB - Last synced at: 2 months ago - Pushed at: over 8 years ago - Stars: 7 - Forks: 2

valginer0/WinFindGrep
Multi-directory text search and replace utility for Windows with grep-like functionality
Language: C# - Size: 146 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 1

JohnPapad/Mini-Search-Engine
A Mini Search Engine in C++, using an inverted index and a trie.
Language: C++ - Size: 2.62 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

danielgatis/search_in_files
This is a tool for search text in files.
Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: 7 days ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 2

dominikbraun/haystack
A fast & simple text search across files.
Language: Rust - Size: 222 KB - Last synced at: 7 days ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 0

Paulanerus/TextExplorer
A tool designed for the exploration, analysis, and comparison of textual data variants.
Language: Kotlin - Size: 590 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3 - Forks: 0

Zhennor/Multimodal-Video-Retrieval-Engine-with-Vision-and-Text
A video search engine combining OCR, ASR, CLIP, Image Captioning, Object & Color Detection. It enables accurate retrieval based on text, speech, images, objects, and colors in video content.
Size: 20.9 GB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

Tgentil/PDF-text-Search
Este é um script em Python que permite buscar por um texto específico dentro de todos os arquivos PDF em uma pasta.
Language: Python - Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

astrowonk/mastodon_archive_reader
Read a mastodon archive and create a sqlite3 database of archived post content
Language: Python - Size: 18.6 KB - Last synced at: 8 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

GarbageMountain/seaq
Plug and Play ES6 Text Search
Language: TypeScript - Size: 2.3 MB - Last synced at: 23 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

ShihabYasin/simple-django-full-text-searching
Basic and Full-text Search in Django
Language: Python - Size: 16.6 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

jaytimm/textsearch
Tools for corpus search in context.
Language: R - Size: 97.7 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

jdvala/kawadi
kawadi is my collection of tools, that I need more frequently
Language: Python - Size: 2.85 MB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

ox-vgg/vgg_text_search
(MIRROR) Text search engine that runs on a local service. Includes a pipeline for preprocessing a user-defined image dataset.
Language: Python - Size: 48.8 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

AlexandrSachkov/LineCatcher
Line Catcher (LC) is an advanced programmable log parsing tool aimed to simplify and save time on viewing, searching, and parsing large text files
Language: C++ - Size: 1.8 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 1

rsusik/bfsi
Bloom Filter based Semi-Index (BFSI)
Language: C++ - Size: 35.2 KB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

planaria/cb_tree
A crit-bit tree implementation for C++
Language: C++ - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 2

isappit/ifile
"Documents Search Engine" based on Lucene for indexing and searching in many type of the documents
Language: PHP - Size: 4.39 MB - Last synced at: about 1 year ago - Pushed at: over 8 years ago - Stars: 3 - Forks: 0

blackwinter/pms
Poor Man's Search
Language: Ruby - Size: 492 KB - Last synced at: 19 days ago - Pushed at: over 8 years ago - Stars: 3 - Forks: 0

dishmint/LexicalCases
Extract substrings matching a lexical pattern
Language: Mathematica - Size: 60.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

marqo-ai/marqo-ecommerce-embeddings
State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP.
Language: Python - Size: 1.36 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

tomzox/trowser
Trowser is a browser for large line-oriented text files, implemented in 3 alternate programming languages: Tcl/Tk, Python and C++/Qt
Language: C++ - Size: 2.18 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

divyanshu-rawat/checkout-your-cart
Users of this application will be able to browse through products, add products to their cart, and even check out using the stripe API.
Language: JavaScript - Size: 8.73 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 4

patsie75/awk-fuzsearch
Fuzzy text search engine in awk
Language: Awk - Size: 7.81 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

ChrisVilches/Partial-Text-Search
A JavaScript library that finds string patterns in a collection of documents. It efficiently finds matches even if the words in each document do not begin with the query pattern.
Language: JavaScript - Size: 967 KB - Last synced at: 22 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

MarcoLucidi01/privately-search-selected
adds menu item to Firefox to privately search selected text
Language: JavaScript - Size: 32.2 KB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

TeamworkGuy2/FileManipulator
CLI tool for searching, removing, and replacing text in files
Language: Java - Size: 83 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

koudura/project-psaximo
final repository for fornax search engine library. [see psaximo.net and true.fornax.net for details.]
Language: HTML - Size: 102 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 3

junphine/ignite Fork of apache/ignite
Extends of Apache Ignite, add feathers:Mongodb protocol,Gremlin protocol, vector search,fulltext search,graph compute
Language: Java - Size: 469 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 1 - Forks: 0

bernardosulzbach/bigword
An application to find words in a dictionary from a multiset of letters
Language: C++ - Size: 159 KB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 2

remram44/ngram-search
Ngram-based indexing of strings into a binary file
Language: Rust - Size: 88.9 KB - Last synced at: about 2 hours ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

haritonov2/minigrep_rust
`minigrep_rust` is a simple Rust command-line tool for finding lines that contain a specific substring in text files.
Language: Rust - Size: 1000 Bytes - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

garawaa/ripgrep-gui
A desktop file search tool that uses ripgrep. Implemented using pyside6
Language: Python - Size: 20.5 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

raiyanyahya/fast-text-search
☄️ A go package to do extremely fast concurrent text search across files and folders.
Language: Go - Size: 2.56 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

abhinav-nath/ecommerce-search
Language: Java - Size: 553 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 2

amiralv82/Grep
This Rust-based Text Search Tool empowers users to efficiently search for specific patterns within files and directories. Whether you need to find text within documents, explore directories, or perform advanced search operations, this versatile tool has you covered.
Language: Rust - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

MythicManiac/b64-regex
Build regex patterns to search through base64 encoded text.
Language: Python - Size: 185 KB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

rsusik/cf2
Approximate pattern matching with Counting Filter on q-grams using SSE instructions (CF2)
Language: C++ - Size: 11.7 KB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

frank-connolly/text-analyser-challenge
Coding challenge
Language: Java - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

fabiothiroki/spring-pokemon-textsearch
A spring boot application implementing full-text search using Postgresql
Language: Java - Size: 2.41 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 2

Dimama/text_search_labs
Laboratory work on text search course
Language: Python - Size: 6.84 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

AndySoundy/Search
A mulit-threaded command line tool that lets you search for file names, text in files, or file types.
Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

rsusik/magetdc
MAG on ETDC
Language: C++ - Size: 87.9 KB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

rsusik/maga
MAG for approximate pattern matching (MAGA)
Language: C++ - Size: 697 KB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

rsusik/mag
Multi AOSO on q-Grams (MAG). One of the fastest multi-pattern matching algorithms.
Language: C++ - Size: 3.77 MB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

arindamnayak/solr-tutorial
Why and How Solr
Language: XSLT - Size: 478 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

ichtrojan/Text-Search-Using-Algolia
A text search integration using Algolia and Laravel
Language: PHP - Size: 784 KB - Last synced at: 14 days ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 1

za-arthur/pg_textparser
Fast parser for text search of PostgreSQL
Language: Lex - Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

visalchea/find-in-files
Find text in file(s) of folder(s)
Language: C# - Size: 125 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

valginer0/WinFindGrepWebsite
A website for a multi-directory grep-like tool for Windows with GUI, WinFindGrep
Size: 759 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

lvce-editor/text-search-worker
Text Search Worker
Language: TypeScript - Size: 3.95 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0
