An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: approximate-string-matching

wolfgarbe/SymSpell

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Language: C# - Size: 12 MB - Last synced at: 3 days ago - Pushed at: 25 days ago - Stars: 3,230 - Forks: 303

proycon/analiticcl

an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction

Language: Rust - Size: 2.26 MB - Last synced at: about 15 hours ago - Pushed at: about 2 months ago - Stars: 37 - Forks: 4

mammothb/symspellpy

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Language: Python - Size: 5.93 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 822 - Forks: 122

jlgregorio/tkinter-search-engine-python

Search engine app' for easy information retrieval in "databases" or spreadsheet collections

Language: Python - Size: 3.48 MB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

fiatjaf/search-comparison

visualize { fast-fuzzy, fuzzyset, sifter, lunr, fuse, fuzzy } results at the same time

Language: JavaScript - Size: 479 KB - Last synced at: 20 days ago - Pushed at: over 6 years ago - Stars: 54 - Forks: 11

searchhub/preDict

Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts

Language: Java - Size: 6.04 MB - Last synced at: 9 days ago - Pushed at: almost 7 years ago - Stars: 81 - Forks: 11

universal-automata/liblevenshtein-java

Various utilities regarding Levenshtein transducers. (Java)

Language: Java - Size: 5.83 MB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 57 - Forks: 20

hzeng-otterai/ffsearch

Fast fuzzy string search

Language: C++ - Size: 82 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 4

ChrisMuir/refinr

Cluster and merge similar string values: an R implementation of Open Refine clustering algorithms

Language: C++ - Size: 287 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 104 - Forks: 5

wolfgarbe/SymSpellCompound

SymSpellCompound: compound aware automatic spelling correction

Size: 646 KB - Last synced at: 6 months ago - Pushed at: about 7 years ago - Stars: 66 - Forks: 16

CMU-SAFARI/GenASM

Source code for the software implementations of the GenASM algorithms proposed in our MICRO 2020 paper: Senol Cali et. al., "GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis" at https://people.inf.ethz.ch/omutlu/pub/GenASM-approximate-string-matching-framework-for-genome-analysis_micro20.pdf

Language: C - Size: 44.9 KB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 29 - Forks: 7

8573/fuzzy-regex.rs

A @rust-lang library aiming to provide regexes with fuzzy matching

Language: Rust - Size: 9.77 KB - Last synced at: 5 days ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

dbahrdt/srtree

R+-Tree with augmentations to support Textual-Spatial-Queries

Language: C++ - Size: 245 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

Avinash793/String-matching-algorithms-and-applications

Implementation of various string matching algorithms and their experimental analysis. Contains paper describing different string matching algorithms with their time and space complexities.

Language: C++ - Size: 504 KB - Last synced at: 12 days ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 6

ifazk/mula

ML's radishal Universal Levenshtein Automata library.

Language: OCaml - Size: 121 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 0

fmmfonseca/completely

Java autocomplete library.

Language: Java - Size: 290 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 107 - Forks: 19

lksark/Aproximate-String-Matching-multithreading

Search Approximate String Matching using multithreading

Language: C# - Size: 22.5 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

vikhari/Sim-Find 📦

SimFind is a database approach to similarity search (approximate matching).

Language: C++ - Size: 18.7 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

sgrpanchal31/SymSpell

This repo contains the Python 3 compatible code for SymSpell algorithm

Language: Python - Size: 17.6 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

julianthome/prex

Approximate Regular Expression Matcher

Language: Java - Size: 65.4 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 2

joaopaulq/pmt

📄 a command-line interface for pattern matching

Language: C++ - Size: 29.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

wolfgarbe/LinSpell

Fast approximate strings search & spelling correction

Language: C# - Size: 3.55 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 55 - Forks: 6

suamin/PyNemex 📦

Approximate Entity Matching

Language: Python - Size: 150 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 2

uxter/fuzzysearch

:eyeglasses: Implementing a Fuzzy Search Algorithm in Java with possible to extend by adding similarity calculation strategy

Language: Java - Size: 76.2 KB - Last synced at: 18 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 2

colemanator/papyrus-api

Fuzzy search for the Bible

Language: Rust - Size: 1.38 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 2

kaiyoo/NLP-Detect-Lexical-Blends

Detect lexical blending using similarity by approximate string matching and word vectors

Language: Python - Size: 1.33 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

abhabongse/fuzzymatch-go

Approximate string matching library implemented in Go language

Language: Go - Size: 168 KB - Last synced at: 10 months ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 0

Bohdan-Khomtchouk/Smalls

miRNA-UTR approximate matching algorithms in R/Python/Bash

Language: Python - Size: 16.8 MB - Last synced at: about 2 years ago - Pushed at: almost 9 years ago - Stars: 4 - Forks: 1

smanurung/blendtor

Lexical blend word detector

Language: Python - Size: 20.5 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

ward/bitap

Language: Rust - Size: 447 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

Edarke/NearestNeighbors

Nearest Neighbor Implementations for Java

Language: Java - Size: 434 KB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 1

ggabriel96/mapnames

This is the repository for the code of my Computer Science bachelor's degree final paper.

Language: Python - Size: 11.2 MB - Last synced at: about 6 hours ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

AndreyBronin/spellchecker

Simple spellchecker uses approximate string matching(fuzzy search) algorithm based on Levenshtein distance between two strings

Language: C++ - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

Related Keywords
approximate-string-matching 33 fuzzy-search 17 fuzzy-matching 11 edit-distance 9 spelling-correction 8 levenshtein-distance 8 damerau-levenshtein 6 levenshtein 6 spellcheck 6 spell-check 5 string-matching 4 spelling 4 symspell 4 nlp 3 word-segmentation 3 natural-language-processing 2 java 2 cpp 2 string-algorithms 2 computational-biology 2 bioinformatics 2 machine-learning 2 regex 2 trie 2 regular-expression 2 rust 2 string 2 search-engine 2 autocomplete 2 python 2 text-segmentation 2 chinese-word-segmentation 2 chinese-text-segmentation 2 fuzzy-string-matching 1 multithreaded 1 approximate-matching 1 csharp 1 text 1 suffix-tree 1 library 1 indexing 1 ocaml 1 levenshtein-automaton 1 suffix-tries 1 rabin-karp-algorithm 1 kmp-algorithm 1 hybrid-string 1 c-plus-plus 1 brute-force 1 boyer-moore-algorithm 1 automata 1 aho-corasick-algorithm 1 spatial-queries 1 r-tree 1 suffix-array 1 stable-marriage 1 qgram-distance 1 min-cost-flow 1 bipartite-graphs 1 nearest-neighbor-search 1 classification 1 bitap 1 utr-regions 1 mirna 1 golang 1 wordvectors 1 jaro-winkler-similarity 1 jaro-winkler-distance 1 jaro-winkler 1 bible 1 api 1 sorting-algorithm 1 search 1 java-8 1 pattern-matching 1 speedtest 1 similarity-search 1 fuzzy-hashes 1 rust-library 1 string-search 1 prefix-tree 1 entity-search 1 entity-linking 1 universal-automata 1 levenshtein-automata 1 information-retrieval 1 genomics 1 finite-state-transducer 1 finite-state-automata 1 distance-metric 1 dictionary 1 data-science 1 computer-science 1 string-similarity 1 spelling-corrector 1 spelling-checker 1 spellchecker 1 spell-corrector 1 spell-checker 1 tkinter 1