Topic: "fuzzy-matching"
wolfgarbe/SymSpell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Language: C# - Size: 12 MB - Last synced at: 2 days ago - Pushed at: 23 days ago - Stars: 3,230 - Forks: 303

teamtnt/tntsearch
A fully featured full text search engine written in PHP
Language: PHP - Size: 7.75 MB - Last synced at: about 9 hours ago - Pushed at: 6 days ago - Stars: 3,152 - Forks: 295

alexpasmantier/television
A cross-platform, fast and extensible general purpose fuzzy finder TUI πΊ
Language: Rust - Size: 10.5 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 2,652 - Forks: 51

leeoniya/uFuzzy
A tiny, efficient fuzzy search that doesn't suck
Language: JavaScript - Size: 2 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2,638 - Forks: 47

Yggdroot/LeaderF
An efficient fuzzy finder that helps to locate files, buffers, mrus, gtags, etc. on the fly for both vim and neovim.
Language: Python - Size: 2.17 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 2,163 - Forks: 184

moj-analytical-services/splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Language: Python - Size: 98.3 MB - Last synced at: 9 days ago - Pushed at: 13 days ago - Stars: 1,547 - Forks: 171

BishopFox/GitGot
Semi-automated, feedback-driven tool to rapidly search through troves of public data on GitHub for sensitive secrets.
Language: Python - Size: 189 KB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 1,490 - Forks: 213

persian-tools/persian-tools
An anthology of a variety of tools for the Persian language in javascript
Language: TypeScript - Size: 4.92 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,114 - Forks: 122

helix-editor/nucleo
A fast and convenient fuzzy matcher library for rust
Language: Rust - Size: 215 KB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 1,038 - Forks: 35

zinggAI/zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Language: Java - Size: 679 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,015 - Forks: 125

krisk/fuse-swift π¦
A lightweight fuzzy-search library, with zero dependencies
Language: Swift - Size: 117 KB - Last synced at: 19 days ago - Pushed at: about 3 years ago - Stars: 945 - Forks: 114

xdrop/fuzzywuzzy
Java fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java
Language: Java - Size: 415 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 833 - Forks: 125

mammothb/symspellpy
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Language: Python - Size: 5.93 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 822 - Forks: 122

Yomguithereal/talisman
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Language: JavaScript - Size: 3.39 MB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 713 - Forks: 47

asabaylus/react-command-palette
An accessible browser compatible javascript command palette
Language: JavaScript - Size: 25.3 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 623 - Forks: 33

nol13/fuzzball.js
Easy to use and powerful fuzzy string matching, port of fuzzywuzzy.
Language: JavaScript - Size: 7.25 MB - Last synced at: 7 days ago - Pushed at: 27 days ago - Stars: 593 - Forks: 42

Genivia/RE-flex
A high-performance C++ regex library and lexical analyzer generator with Unicode support. Extends Flex++ with Unicode support, indent/dedent anchors, lazy quantifiers, functions for lex and syntax error reporting and more. Seamlessly integrates with Bison and other parsers.
Language: C++ - Size: 70.2 MB - Last synced at: 2 days ago - Pushed at: 22 days ago - Stars: 554 - Forks: 89

schollz/closestmatch
Golang library for fuzzy matching within a set of strings :page_with_curl:
Language: Go - Size: 641 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 420 - Forks: 50

J535D165/data-matching-software
A list of free data matching and record linkage software.
Size: 93.8 KB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 378 - Forks: 42

Bergvca/string_grouper
Super Fast String Matching in Python
Language: Python - Size: 2.59 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 366 - Forks: 76

taleinat/fuzzysearch
Find parts of long text or data, allowing for some changes/typos.
Language: Python - Size: 997 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 319 - Forks: 25

rmm5t/liquidmetal
:sweat_drops::metal: A mimetic poly-alloy of the Quicksilver scoring algorithm, essentially LiquidMetal. </Schwarzenegger Voice>
Language: JavaScript - Size: 44.9 KB - Last synced at: 6 days ago - Pushed at: almost 5 years ago - Stars: 295 - Forks: 24

dodona-edu/dolos
:detective: Source code plagiarism detection
Language: TypeScript - Size: 43.1 MB - Last synced at: 8 days ago - Pushed at: 10 days ago - Stars: 288 - Forks: 40

RobinL/fuzzymatcher
Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4
Language: Python - Size: 848 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 281 - Forks: 60

life4/textdistance.rs
π¦π Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.
Language: Rust - Size: 266 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 276 - Forks: 11

skim-rs/fuzzy-matcher
Fuzzy Matching Library for Rust
Language: Rust - Size: 66.4 KB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 276 - Forks: 18

DanHarltey/Fastenshtein
The fastest .Net Levenshtein around
Language: C# - Size: 165 KB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 263 - Forks: 24

Christopher-Thornton/hmni
π Fuzzy Name Matching with Machine Learning
Language: Python - Size: 21.1 MB - Last synced at: 30 days ago - Pushed at: 10 months ago - Stars: 263 - Forks: 51

gandersen101/spaczz
Fuzzy matching and more functionality for spaCy.
Language: Python - Size: 1.4 MB - Last synced at: 17 days ago - Pushed at: 10 months ago - Stars: 256 - Forks: 28

matchms/matchms
Python library for processing (tandem) mass spectrometry data and for computing spectral similarities.
Language: Python - Size: 39 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 208 - Forks: 74

maxharlow/csvmatch
π Finds fuzzy matches between CSV files
Language: Python - Size: 158 KB - Last synced at: 13 days ago - Pushed at: 25 days ago - Stars: 189 - Forks: 22

chrislit/abydos
Abydos NLP/IR library for Python
Language: Python - Size: 52.4 MB - Last synced at: about 10 hours ago - Pushed at: over 2 years ago - Stars: 185 - Forks: 39

google/unisim π¦
UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.
Language: Python - Size: 8.07 MB - Last synced at: 1 day ago - Pushed at: 17 days ago - Stars: 135 - Forks: 8

JonathanReeve/text-matcher
A simple text reuse detection CLI tool.
Language: Python - Size: 67.4 KB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 132 - Forks: 25

FuzzyIdeas/Cling
Instant fuzzy find any file on macOS
Language: Swift - Size: 12.1 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 124 - Forks: 2

ripxorip/bolt.nvim
β‘ Ultrafast multi-pane file manager for Neovim with fuzzy matching
Language: Python - Size: 487 KB - Last synced at: 5 months ago - Pushed at: almost 5 years ago - Stars: 108 - Forks: 3

ChrisMuir/refinr
Cluster and merge similar string values: an R implementation of Open Refine clustering algorithms
Language: C++ - Size: 287 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 104 - Forks: 5

salman-abedin/faint
Extensible TUI fuzzy file file explorer
Language: Shell - Size: 8.6 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 98 - Forks: 0

znwang25/fuzzychinese
A small package to fuzzy match chinese words
Language: Python - Size: 1.81 MB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 85 - Forks: 10

wooorm/levenshtein.c
Levenshtein algorithm in C
Language: C - Size: 22.5 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 84 - Forks: 13

mixpanel/fuzzbunny π¦
Fast fuzzy string searching/matching/highlighting
Language: JavaScript - Size: 2.14 MB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 83 - Forks: 5

searchhub/preDict
Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts
Language: Java - Size: 6.04 MB - Last synced at: 7 days ago - Pushed at: almost 7 years ago - Stars: 81 - Forks: 11

fritshermans/deduplipy
Python package for deduplication/entity resolution using active learning
Language: Python - Size: 521 KB - Last synced at: 8 days ago - Pushed at: 8 months ago - Stars: 78 - Forks: 9

vickumar1981/stringdistance
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Language: Scala - Size: 1.27 MB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 78 - Forks: 14

AI-team-UoA/pyJedAI
An open-source library that leverages Pythonβs data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Language: Python - Size: 139 MB - Last synced at: 9 days ago - Pushed at: 20 days ago - Stars: 76 - Forks: 11

mnowotnik/fzshell
Fuzzy shell completions you didn't know you needed
Language: Go - Size: 93.8 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 76 - Forks: 4

wyndow/fuzzywuzzy
Fuzzy string matching for PHP
Language: PHP - Size: 3.91 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 73 - Forks: 29

iomega/spec2vec
Word2Vec based similarity measure of mass spectrometry data.
Language: Python - Size: 21.4 MB - Last synced at: 9 days ago - Pushed at: 8 months ago - Stars: 67 - Forks: 17

vifon/autocomplete-ALL-the-things
Arbitrary text completion for urxvt. MAINTAINER NEEDED
Language: Perl - Size: 67.4 KB - Last synced at: 15 days ago - Pushed at: over 7 years ago - Stars: 65 - Forks: 3

tajmone/fuzzy-search
A collection of algorithms for fuzzy search like in Sublime Text.
Language: JavaScript - Size: 1.83 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 58 - Forks: 3

Senzing/awesome
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Language: Python - Size: 244 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 57 - Forks: 2

ukushu/Ifrit
Swfit library for fuzzy search. No dependencies lib.
Language: Swift - Size: 3.07 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 56 - Forks: 7

Yggdroot/Yoyo-leaf
Yoyo-leaf is an awesome command-line fuzzy finder.
Language: C++ - Size: 251 KB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 55 - Forks: 1

tmplt/fuzzywuzzy
A C++ port of the Python module under the same name
Language: C - Size: 78.1 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 53 - Forks: 17

SYSTRAN/fuzzy-match
Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
Language: C++ - Size: 4.68 MB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 50 - Forks: 11

viig99/SymSpellCppPy
Fast SymSpell written in c++ and exposes to python via pybind11
Language: C++ - Size: 8.31 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 42 - Forks: 7

rosette-api/python
Babel Street Analytics Client Library for Python
Language: Python - Size: 1.63 MB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 38 - Forks: 38

proycon/analiticcl
an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction
Language: Rust - Size: 2.26 MB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 37 - Forks: 4

Genivia/FuzzyMatcher
Fast fuzzy regex matcher: specify max edit distance to find approximate matches. FuzzyMatcher is now included in RE/flex.
Language: C++ - Size: 67.4 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 36 - Forks: 6

lewinfox/levitate
Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).
Language: R - Size: 510 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 35 - Forks: 2

persian-tools/react-persian-tools
React wrapper component around Persian tools
Language: TypeScript - Size: 3.29 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 35 - Forks: 2

delonnewman/mini-levenshtein
Simple, fast Levenshtein distance and similarity ratio for Ruby
Language: C - Size: 57.6 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 0

Vivino/go-autocomplete-trie
go-autocomplete-trie is a data structure for text auto completion that allows for fuzzy matching and configurable levenshtein distance limits
Language: Go - Size: 16.6 KB - Last synced at: 10 months ago - Pushed at: about 2 years ago - Stars: 29 - Forks: 6

pigmej/fuzzy
Pure nim fuzzy search implementation. Supports substrings etc
Language: Nim - Size: 3.91 KB - Last synced at: 12 days ago - Pushed at: over 4 years ago - Stars: 29 - Forks: 0

laggardkernel/git-ignore
Generate .gitignore files with templates from gitignore.io offline
Language: Shell - Size: 191 KB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 28 - Forks: 0

dbousque/batch_jaro_winkler
Fast batch jaro winkler distance implementation in C99 with Ruby, OCaml and Python bindings.
Language: C - Size: 4.15 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 27 - Forks: 4

OlivierBinette/er-evaluation
An End-to-End Evaluation Framework for Entity Resolution Systems
Language: Python - Size: 62.4 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 9

persian-tools/vue-persian-tools
Persian tools wrapper for vue.js
Language: TypeScript - Size: 3.45 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 25 - Forks: 2

mmkamani7/Skeletonization
In this repository you can see the code for skeletonization of binary images using our novel fuzzy inference system.
Language: Python - Size: 701 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 25 - Forks: 8

itspawanbhardwaj/spark-fuzzy-matching
Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)
Language: Scala - Size: 92.8 KB - Last synced at: 1 day ago - Pushed at: over 5 years ago - Stars: 24 - Forks: 11

gyakovlev/fish-fzy
fzy inegration with fish. Search history, navigate directories and more. Blazingly fast.
Language: Shell - Size: 10.7 KB - Last synced at: 1 day ago - Pushed at: over 3 years ago - Stars: 23 - Forks: 5

nyanlynntherazi/ZLOOKUP
Google Sheet Fuzzy String Matching Function
Language: JavaScript - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 23 - Forks: 8

patrickdet/fuzzy_compare
A fuzzy string comparison library for Elixir
Language: Elixir - Size: 17.6 KB - Last synced at: 29 days ago - Pushed at: 4 months ago - Stars: 22 - Forks: 4

poke1024/simtrie
An efficient data structure for fast string similarity searches
Language: Python - Size: 37.1 KB - Last synced at: 9 days ago - Pushed at: about 4 years ago - Stars: 22 - Forks: 2

romgrk/lister
An experimental Gtk-based fuzzy-finder (with support for neovim)
Language: JavaScript - Size: 5.46 MB - Last synced at: 12 days ago - Pushed at: over 4 years ago - Stars: 22 - Forks: 0

leo-arch/fnf
A simple fuzzy finder for the terminal
Language: C - Size: 184 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 21 - Forks: 4

LoipesMas/cargo-select
Cargo subcommand to easily run targets/examples
Language: Rust - Size: 85.9 KB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 1

aslpavel/sweep.py
sweep.py is yet another command line fuzzy finder (fzf analog)
Language: Python - Size: 301 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 19 - Forks: 3

dangduc/fzf-native
Emacs package for fuzzy match scoring
Language: C - Size: 1.21 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 18 - Forks: 5

esentis/multiple_search_selection
A highly customizable multiple selection widget with fuzzy search functionality.
Language: Dart - Size: 3.06 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 14

OlivierBinette/StringCompare
Efficient String Comparison Functions and Fuzzy String Matching
Language: Python - Size: 3 MB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 16 - Forks: 2

DavidMoraisFerreira/FuzzyWuzzy.pas
Fuzzy String Matching in Free Pascal - Port of FuzzyWuzzy
Language: Pascal - Size: 3.91 KB - Last synced at: 12 months ago - Pushed at: almost 6 years ago - Stars: 16 - Forks: 0

databricks-industry-solutions/customer-er
Translating text attributes (like name, address, phone number) into quantifiable numerical representations Training ML models to determine if these numerical labels form a match Scoring the confidence of each match
Language: Python - Size: 137 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 6

telephil9/fm
fuzzy matching selection gui
Language: C - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 2

hzeng-otterai/ffsearch
Fast fuzzy string search
Language: C++ - Size: 82 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 4

rustyconover/duckdb-fuzzycomplete-extension
DuckDB Extension for fuzzy string matching based autocompletion
Language: C++ - Size: 8.01 MB - Last synced at: 5 days ago - Pushed at: 29 days ago - Stars: 13 - Forks: 0

rmind/nxsearch
nxsearch: a full-text search engine
Language: C - Size: 240 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 13 - Forks: 2

hanifabd/lexifuzz-ner
Python Package for Named Entity Recognition (NER) - Based on Dictionary and Fuzzy Matching (Lexical Fuzzy Named Entity Recognition)
Language: Python - Size: 1.81 MB - Last synced at: 7 days ago - Pushed at: 9 months ago - Stars: 13 - Forks: 2

JohnnyBravo75/TwinFinder
fuzzy data matching
Language: C# - Size: 3.48 MB - Last synced at: 8 months ago - Pushed at: over 7 years ago - Stars: 13 - Forks: 5

solo-studios/kt-fuzzy
A zero-dependency Kotlin Multiplatform library for fuzzy string matching
Language: Kotlin - Size: 1.21 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 1

maxharlow/textmatch
π Finds fuzzy matches between datasets
Language: Python - Size: 120 KB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 12 - Forks: 0

madhurima-nath/nlp_fuzzy_match_algorithms π¦
Language: Jupyter Notebook - Size: 503 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 14

rosette-api/java
Babel Street Analytics Client Library for Java
Language: Java - Size: 64.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 11 - Forks: 35

snapp-incubator/go-symspell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Language: Go - Size: 3.04 MB - Last synced at: 12 days ago - Pushed at: 21 days ago - Stars: 11 - Forks: 3

lt3/nfr
Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine translation.
Language: Python - Size: 34 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 11 - Forks: 2

orangain/json-fuzzy-match
Custom assertion to check whether a JSON string fuzzily matches a pattern for JVM languages.
Language: Kotlin - Size: 265 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 11 - Forks: 0

leo-arch/fzy Fork of jhawthorn/fzy
A fork of fzy with multi-selection, color support, and a few more things. Consult the manpage (fzy.1) for details.
Language: C - Size: 360 KB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 11 - Forks: 2

scossin/IAMsystem
A fast dictionary-based approach for semantic annotation with approximate string matching algorithms
Language: Java - Size: 768 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 3

ii14/fzx
A fuzzy finder, based on fzy
Language: C++ - Size: 299 KB - Last synced at: 20 days ago - Pushed at: 12 months ago - Stars: 10 - Forks: 4

laserpants/fuzzyset-haskell
:sheep: A fuzzy string set implementation in Haskell.
Language: Haskell - Size: 207 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 3
