An open API service providing repository metadata for many open source software ecosystems.

Topic: "approximate-string-matching"

wolfgarbe/SymSpell

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Language: C# - Size: 12 MB - Last synced at: about 16 hours ago - Pushed at: about 2 months ago - Stars: 3,242 - Forks: 303

mammothb/symspellpy

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Language: Python - Size: 5.76 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 824 - Forks: 124

fmmfonseca/completely

Java autocomplete library.

Language: Java - Size: 290 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 107 - Forks: 19

ChrisMuir/refinr

Cluster and merge similar string values: an R implementation of Open Refine clustering algorithms

Language: C++ - Size: 287 KB - Last synced at: 28 days ago - Pushed at: about 1 year ago - Stars: 104 - Forks: 5

searchhub/preDict

Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts

Language: Java - Size: 6.04 MB - Last synced at: 4 days ago - Pushed at: almost 7 years ago - Stars: 81 - Forks: 11

wolfgarbe/SymSpellCompound

SymSpellCompound: compound aware automatic spelling correction

Size: 646 KB - Last synced at: 6 months ago - Pushed at: about 7 years ago - Stars: 66 - Forks: 16

universal-automata/liblevenshtein-java

Various utilities regarding Levenshtein transducers. (Java)

Language: Java - Size: 5.83 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 57 - Forks: 20

wolfgarbe/LinSpell

Fast approximate strings search & spelling correction

Language: C# - Size: 3.55 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 55 - Forks: 6

fiatjaf/search-comparison

visualize { fast-fuzzy, fuzzyset, sifter, lunr, fuse, fuzzy } results at the same time

Language: JavaScript - Size: 479 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 54 - Forks: 11

proycon/analiticcl

an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction

Language: Rust - Size: 2.26 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 37 - Forks: 4

CMU-SAFARI/GenASM

Source code for the software implementations of the GenASM algorithms proposed in our MICRO 2020 paper: Senol Cali et. al., "GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis" at https://people.inf.ethz.ch/omutlu/pub/GenASM-approximate-string-matching-framework-for-genome-analysis_micro20.pdf

Language: C - Size: 44.9 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 29 - Forks: 7

RagnarGrootKoerkamp/sassy

Fast approximate string searching

Language: Rust - Size: 164 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 15 - Forks: 1

hzeng-otterai/ffsearch

Fast fuzzy string search

Language: C++ - Size: 82 KB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 4

ifazk/mula

ML's radishal Universal Levenshtein Automata library.

Language: OCaml - Size: 121 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 0

abhabongse/fuzzymatch-go

Approximate string matching library implemented in Go language

Language: Go - Size: 168 KB - Last synced at: 11 months ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 0

Avinash793/String-matching-algorithms-and-applications

Implementation of various string matching algorithms and their experimental analysis. Contains paper describing different string matching algorithms with their time and space complexities.

Language: C++ - Size: 504 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 6

julianthome/prex

Approximate Regular Expression Matcher

Language: Java - Size: 65.4 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 2

Bohdan-Khomtchouk/Smalls

miRNA-UTR approximate matching algorithms in R/Python/Bash

Language: Python - Size: 16.8 MB - Last synced at: about 2 years ago - Pushed at: almost 9 years ago - Stars: 4 - Forks: 1

colemanator/papyrus-api

Fuzzy search for the Bible

Language: Rust - Size: 1.38 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 2

uxter/fuzzysearch

:eyeglasses: Implementing a Fuzzy Search Algorithm in Java with possible to extend by adding similarity calculation strategy

Language: Java - Size: 76.2 KB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 2

suamin/PyNemex 📦

Approximate Entity Matching

Language: Python - Size: 150 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 2

8573/fuzzy-regex.rs

A @rust-lang library aiming to provide regexes with fuzzy matching

Language: Rust - Size: 9.77 KB - Last synced at: 27 days ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

lksark/Aproximate-String-Matching-multithreading

Search Approximate String Matching using multithreading

Language: C# - Size: 22.5 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

sgrpanchal31/SymSpell

This repo contains the Python 3 compatible code for SymSpell algorithm

Language: Python - Size: 17.6 KB - Last synced at: 5 months ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

Alex230207/spell

Spelling correction and string segmentation written in Go spell-check, spellcheck, spelling, spelling-correction, string-segmentation, symspell, text-segmentation, word-segmentation

Language: Go - Size: 13.7 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

jlgregorio/tkinter-search-engine-python

Search engine app' for easy information retrieval in "databases" or spreadsheet collections

Language: Python - Size: 3.48 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

joaopaulq/pmt

📄 a command-line interface for pattern matching

Language: C++ - Size: 29.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

kaiyoo/NLP-Detect-Lexical-Blends

Detect lexical blending using similarity by approximate string matching and word vectors

Language: Python - Size: 1.33 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

vikhari/Sim-Find 📦

SimFind is a database approach to similarity search (approximate matching).

Language: C++ - Size: 18.7 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

dbahrdt/srtree

R+-Tree with augmentations to support Textual-Spatial-Queries

Language: C++ - Size: 245 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

smanurung/blendtor

Lexical blend word detector

Language: Python - Size: 20.5 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

ward/bitap

Language: Rust - Size: 447 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

ggabriel96/mapnames

This is the repository for the code of my Computer Science bachelor's degree final paper.

Language: Python - Size: 11.2 MB - Last synced at: 22 days ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

AndreyBronin/spellchecker

Simple spellchecker uses approximate string matching(fuzzy search) algorithm based on Levenshtein distance between two strings

Language: C++ - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

Edarke/NearestNeighbors

Nearest Neighbor Implementations for Java

Language: Java - Size: 434 KB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 1

Related Topics
fuzzy-search 17 fuzzy-matching 11 edit-distance 11 levenshtein-distance 9 spelling-correction 8 spellcheck 6 levenshtein 6 damerau-levenshtein 6 symspell 5 spell-check 5 spelling 4 string-matching 4 word-segmentation 3 nlp 3 trie 2 java 2 search-engine 2 string 2 autocomplete 2 cpp 2 csharp 2 machine-learning 2 natural-language-processing 2 regex 2 regular-expression 2 text-segmentation 2 spellchecker 2 rust 2 bioinformatics 2 computational-biology 2 string-algorithms 2 python 2 chinese-word-segmentation 2 chinese-text-segmentation 2 aho-corasick-algorithm 1 webapp 1 automata 1 boyer-moore-algorithm 1 brute-force 1 rust-library 1 tkinter 1 pairwise-alignment 1 simd 1 clustering 1 cran 1 data-cleaning 1 data-clustering 1 ngram 1 openrefine 1 r 1 rstats 1 hunspell 1 isomorphic-javascript 1 javascript 1 kotlin 1 learning 1 ruby 1 universal-javascript 1 rust-language 1 rust-lang 1 regexp 1 regexes 1 fuzzy 1 approximate 1 approx 1 normalization 1 sorting-algorithm 1 search 1 java-8 1 universal-automata 1 levenshtein-automata 1 information-retrieval 1 genomics 1 finite-state-transducer 1 finite-state-automata 1 distance-metric 1 dictionary 1 data-science 1 computer-science 1 suffix-tries 1 rabin-karp-algorithm 1 kmp-algorithm 1 hybrid-string 1 c-plus-plus 1 text 1 library 1 indexing 1 pattern-matching 1 wordvectors 1 jaro-winkler-similarity 1 jaro-winkler-distance 1 jaro-winkler 1 suffix-tree 1 suffix-array 1 stable-marriage 1 qgram-distance 1 min-cost-flow 1 bipartite-graphs 1 spatial-queries 1 r-tree 1