An open API service providing repository metadata for many open source software ecosystems.

Topic: "fuzzy-matching"

wolfgarbe/SymSpell

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Language: C# - Size: 12 MB - Last synced at: 2 days ago - Pushed at: 23 days ago - Stars: 3,230 - Forks: 303

teamtnt/tntsearch

A fully featured full text search engine written in PHP

Language: PHP - Size: 7.75 MB - Last synced at: about 9 hours ago - Pushed at: 6 days ago - Stars: 3,152 - Forks: 295

alexpasmantier/television

A cross-platform, fast and extensible general purpose fuzzy finder TUI πŸ“Ί

Language: Rust - Size: 10.5 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 2,652 - Forks: 51

leeoniya/uFuzzy

A tiny, efficient fuzzy search that doesn't suck

Language: JavaScript - Size: 2 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2,638 - Forks: 47

Yggdroot/LeaderF

An efficient fuzzy finder that helps to locate files, buffers, mrus, gtags, etc. on the fly for both vim and neovim.

Language: Python - Size: 2.17 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 2,163 - Forks: 184

moj-analytical-services/splink

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Language: Python - Size: 98.3 MB - Last synced at: 9 days ago - Pushed at: 13 days ago - Stars: 1,547 - Forks: 171

BishopFox/GitGot

Semi-automated, feedback-driven tool to rapidly search through troves of public data on GitHub for sensitive secrets.

Language: Python - Size: 189 KB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 1,490 - Forks: 213

persian-tools/persian-tools

An anthology of a variety of tools for the Persian language in javascript

Language: TypeScript - Size: 4.92 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,114 - Forks: 122

helix-editor/nucleo

A fast and convenient fuzzy matcher library for rust

Language: Rust - Size: 215 KB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 1,038 - Forks: 35

zinggAI/zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Language: Java - Size: 679 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,015 - Forks: 125

krisk/fuse-swift πŸ“¦

A lightweight fuzzy-search library, with zero dependencies

Language: Swift - Size: 117 KB - Last synced at: 19 days ago - Pushed at: about 3 years ago - Stars: 945 - Forks: 114

xdrop/fuzzywuzzy

Java fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java

Language: Java - Size: 415 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 833 - Forks: 125

mammothb/symspellpy

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Language: Python - Size: 5.93 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 822 - Forks: 122

Yomguithereal/talisman

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

Language: JavaScript - Size: 3.39 MB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 713 - Forks: 47

asabaylus/react-command-palette

An accessible browser compatible javascript command palette

Language: JavaScript - Size: 25.3 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 623 - Forks: 33

nol13/fuzzball.js

Easy to use and powerful fuzzy string matching, port of fuzzywuzzy.

Language: JavaScript - Size: 7.25 MB - Last synced at: 7 days ago - Pushed at: 27 days ago - Stars: 593 - Forks: 42

Genivia/RE-flex

A high-performance C++ regex library and lexical analyzer generator with Unicode support. Extends Flex++ with Unicode support, indent/dedent anchors, lazy quantifiers, functions for lex and syntax error reporting and more. Seamlessly integrates with Bison and other parsers.

Language: C++ - Size: 70.2 MB - Last synced at: 2 days ago - Pushed at: 22 days ago - Stars: 554 - Forks: 89

schollz/closestmatch

Golang library for fuzzy matching within a set of strings :page_with_curl:

Language: Go - Size: 641 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 420 - Forks: 50

J535D165/data-matching-software

A list of free data matching and record linkage software.

Size: 93.8 KB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 378 - Forks: 42

Bergvca/string_grouper

Super Fast String Matching in Python

Language: Python - Size: 2.59 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 366 - Forks: 76

taleinat/fuzzysearch

Find parts of long text or data, allowing for some changes/typos.

Language: Python - Size: 997 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 319 - Forks: 25

rmm5t/liquidmetal

:sweat_drops::metal: A mimetic poly-alloy of the Quicksilver scoring algorithm, essentially LiquidMetal. </Schwarzenegger Voice>

Language: JavaScript - Size: 44.9 KB - Last synced at: 6 days ago - Pushed at: almost 5 years ago - Stars: 295 - Forks: 24

dodona-edu/dolos

:detective: Source code plagiarism detection

Language: TypeScript - Size: 43.1 MB - Last synced at: 8 days ago - Pushed at: 10 days ago - Stars: 288 - Forks: 40

RobinL/fuzzymatcher

Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4

Language: Python - Size: 848 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 281 - Forks: 60

life4/textdistance.rs

πŸ¦€πŸ“ Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.

Language: Rust - Size: 266 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 276 - Forks: 11

skim-rs/fuzzy-matcher

Fuzzy Matching Library for Rust

Language: Rust - Size: 66.4 KB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 276 - Forks: 18

DanHarltey/Fastenshtein

The fastest .Net Levenshtein around

Language: C# - Size: 165 KB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 263 - Forks: 24

Christopher-Thornton/hmni

πŸ“› Fuzzy Name Matching with Machine Learning

Language: Python - Size: 21.1 MB - Last synced at: 30 days ago - Pushed at: 10 months ago - Stars: 263 - Forks: 51

gandersen101/spaczz

Fuzzy matching and more functionality for spaCy.

Language: Python - Size: 1.4 MB - Last synced at: 17 days ago - Pushed at: 10 months ago - Stars: 256 - Forks: 28

matchms/matchms

Python library for processing (tandem) mass spectrometry data and for computing spectral similarities.

Language: Python - Size: 39 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 208 - Forks: 74

maxharlow/csvmatch

πŸ”Ž Finds fuzzy matches between CSV files

Language: Python - Size: 158 KB - Last synced at: 13 days ago - Pushed at: 25 days ago - Stars: 189 - Forks: 22

chrislit/abydos

Abydos NLP/IR library for Python

Language: Python - Size: 52.4 MB - Last synced at: about 10 hours ago - Pushed at: over 2 years ago - Stars: 185 - Forks: 39

google/unisim πŸ“¦

UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.

Language: Python - Size: 8.07 MB - Last synced at: 1 day ago - Pushed at: 17 days ago - Stars: 135 - Forks: 8

JonathanReeve/text-matcher

A simple text reuse detection CLI tool.

Language: Python - Size: 67.4 KB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 132 - Forks: 25

FuzzyIdeas/Cling

Instant fuzzy find any file on macOS

Language: Swift - Size: 12.1 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 124 - Forks: 2

ripxorip/bolt.nvim

⚑ Ultrafast multi-pane file manager for Neovim with fuzzy matching

Language: Python - Size: 487 KB - Last synced at: 5 months ago - Pushed at: almost 5 years ago - Stars: 108 - Forks: 3

ChrisMuir/refinr

Cluster and merge similar string values: an R implementation of Open Refine clustering algorithms

Language: C++ - Size: 287 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 104 - Forks: 5

salman-abedin/faint

Extensible TUI fuzzy file file explorer

Language: Shell - Size: 8.6 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 98 - Forks: 0

znwang25/fuzzychinese

A small package to fuzzy match chinese words

Language: Python - Size: 1.81 MB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 85 - Forks: 10

wooorm/levenshtein.c

Levenshtein algorithm in C

Language: C - Size: 22.5 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 84 - Forks: 13

mixpanel/fuzzbunny πŸ“¦

Fast fuzzy string searching/matching/highlighting

Language: JavaScript - Size: 2.14 MB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 83 - Forks: 5

searchhub/preDict

Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts

Language: Java - Size: 6.04 MB - Last synced at: 7 days ago - Pushed at: almost 7 years ago - Stars: 81 - Forks: 11

fritshermans/deduplipy

Python package for deduplication/entity resolution using active learning

Language: Python - Size: 521 KB - Last synced at: 8 days ago - Pushed at: 8 months ago - Stars: 78 - Forks: 9

vickumar1981/stringdistance

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..

Language: Scala - Size: 1.27 MB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 78 - Forks: 14

AI-team-UoA/pyJedAI

An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.

Language: Python - Size: 139 MB - Last synced at: 9 days ago - Pushed at: 20 days ago - Stars: 76 - Forks: 11

mnowotnik/fzshell

Fuzzy shell completions you didn't know you needed

Language: Go - Size: 93.8 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 76 - Forks: 4

wyndow/fuzzywuzzy

Fuzzy string matching for PHP

Language: PHP - Size: 3.91 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 73 - Forks: 29

iomega/spec2vec

Word2Vec based similarity measure of mass spectrometry data.

Language: Python - Size: 21.4 MB - Last synced at: 9 days ago - Pushed at: 8 months ago - Stars: 67 - Forks: 17

vifon/autocomplete-ALL-the-things

Arbitrary text completion for urxvt. MAINTAINER NEEDED

Language: Perl - Size: 67.4 KB - Last synced at: 15 days ago - Pushed at: over 7 years ago - Stars: 65 - Forks: 3

tajmone/fuzzy-search

A collection of algorithms for fuzzy search like in Sublime Text.

Language: JavaScript - Size: 1.83 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 58 - Forks: 3

Senzing/awesome

Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.

Language: Python - Size: 244 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 57 - Forks: 2

ukushu/Ifrit

Swfit library for fuzzy search. No dependencies lib.

Language: Swift - Size: 3.07 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 56 - Forks: 7

Yggdroot/Yoyo-leaf

Yoyo-leaf is an awesome command-line fuzzy finder.

Language: C++ - Size: 251 KB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 55 - Forks: 1

tmplt/fuzzywuzzy

A C++ port of the Python module under the same name

Language: C - Size: 78.1 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 53 - Forks: 17

SYSTRAN/fuzzy-match

Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.

Language: C++ - Size: 4.68 MB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 50 - Forks: 11

viig99/SymSpellCppPy

Fast SymSpell written in c++ and exposes to python via pybind11

Language: C++ - Size: 8.31 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 42 - Forks: 7

rosette-api/python

Babel Street Analytics Client Library for Python

Language: Python - Size: 1.63 MB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 38 - Forks: 38

proycon/analiticcl

an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction

Language: Rust - Size: 2.26 MB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 37 - Forks: 4

Genivia/FuzzyMatcher

Fast fuzzy regex matcher: specify max edit distance to find approximate matches. FuzzyMatcher is now included in RE/flex.

Language: C++ - Size: 67.4 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 36 - Forks: 6

lewinfox/levitate

Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).

Language: R - Size: 510 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 35 - Forks: 2

persian-tools/react-persian-tools

React wrapper component around Persian tools

Language: TypeScript - Size: 3.29 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 35 - Forks: 2

delonnewman/mini-levenshtein

Simple, fast Levenshtein distance and similarity ratio for Ruby

Language: C - Size: 57.6 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 0

Vivino/go-autocomplete-trie

go-autocomplete-trie is a data structure for text auto completion that allows for fuzzy matching and configurable levenshtein distance limits

Language: Go - Size: 16.6 KB - Last synced at: 10 months ago - Pushed at: about 2 years ago - Stars: 29 - Forks: 6

pigmej/fuzzy

Pure nim fuzzy search implementation. Supports substrings etc

Language: Nim - Size: 3.91 KB - Last synced at: 12 days ago - Pushed at: over 4 years ago - Stars: 29 - Forks: 0

laggardkernel/git-ignore

Generate .gitignore files with templates from gitignore.io offline

Language: Shell - Size: 191 KB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 28 - Forks: 0

dbousque/batch_jaro_winkler

Fast batch jaro winkler distance implementation in C99 with Ruby, OCaml and Python bindings.

Language: C - Size: 4.15 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 27 - Forks: 4

OlivierBinette/er-evaluation

An End-to-End Evaluation Framework for Entity Resolution Systems

Language: Python - Size: 62.4 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 9

persian-tools/vue-persian-tools

Persian tools wrapper for vue.js

Language: TypeScript - Size: 3.45 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 25 - Forks: 2

mmkamani7/Skeletonization

In this repository you can see the code for skeletonization of binary images using our novel fuzzy inference system.

Language: Python - Size: 701 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 25 - Forks: 8

itspawanbhardwaj/spark-fuzzy-matching

Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)

Language: Scala - Size: 92.8 KB - Last synced at: 1 day ago - Pushed at: over 5 years ago - Stars: 24 - Forks: 11

gyakovlev/fish-fzy

fzy inegration with fish. Search history, navigate directories and more. Blazingly fast.

Language: Shell - Size: 10.7 KB - Last synced at: 1 day ago - Pushed at: over 3 years ago - Stars: 23 - Forks: 5

nyanlynntherazi/ZLOOKUP

Google Sheet Fuzzy String Matching Function

Language: JavaScript - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 23 - Forks: 8

patrickdet/fuzzy_compare

A fuzzy string comparison library for Elixir

Language: Elixir - Size: 17.6 KB - Last synced at: 29 days ago - Pushed at: 4 months ago - Stars: 22 - Forks: 4

poke1024/simtrie

An efficient data structure for fast string similarity searches

Language: Python - Size: 37.1 KB - Last synced at: 9 days ago - Pushed at: about 4 years ago - Stars: 22 - Forks: 2

romgrk/lister

An experimental Gtk-based fuzzy-finder (with support for neovim)

Language: JavaScript - Size: 5.46 MB - Last synced at: 12 days ago - Pushed at: over 4 years ago - Stars: 22 - Forks: 0

leo-arch/fnf

A simple fuzzy finder for the terminal

Language: C - Size: 184 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 21 - Forks: 4

LoipesMas/cargo-select

Cargo subcommand to easily run targets/examples

Language: Rust - Size: 85.9 KB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 1

aslpavel/sweep.py

sweep.py is yet another command line fuzzy finder (fzf analog)

Language: Python - Size: 301 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 19 - Forks: 3

dangduc/fzf-native

Emacs package for fuzzy match scoring

Language: C - Size: 1.21 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 18 - Forks: 5

esentis/multiple_search_selection

A highly customizable multiple selection widget with fuzzy search functionality.

Language: Dart - Size: 3.06 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 14

OlivierBinette/StringCompare

Efficient String Comparison Functions and Fuzzy String Matching

Language: Python - Size: 3 MB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 16 - Forks: 2

DavidMoraisFerreira/FuzzyWuzzy.pas

Fuzzy String Matching in Free Pascal - Port of FuzzyWuzzy

Language: Pascal - Size: 3.91 KB - Last synced at: 12 months ago - Pushed at: almost 6 years ago - Stars: 16 - Forks: 0

databricks-industry-solutions/customer-er

Translating text attributes (like name, address, phone number) into quantifiable numerical representations Training ML models to determine if these numerical labels form a match Scoring the confidence of each match

Language: Python - Size: 137 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 6

telephil9/fm

fuzzy matching selection gui

Language: C - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 2

hzeng-otterai/ffsearch

Fast fuzzy string search

Language: C++ - Size: 82 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 4

rustyconover/duckdb-fuzzycomplete-extension

DuckDB Extension for fuzzy string matching based autocompletion

Language: C++ - Size: 8.01 MB - Last synced at: 5 days ago - Pushed at: 29 days ago - Stars: 13 - Forks: 0

rmind/nxsearch

nxsearch: a full-text search engine

Language: C - Size: 240 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 13 - Forks: 2

hanifabd/lexifuzz-ner

Python Package for Named Entity Recognition (NER) - Based on Dictionary and Fuzzy Matching (Lexical Fuzzy Named Entity Recognition)

Language: Python - Size: 1.81 MB - Last synced at: 7 days ago - Pushed at: 9 months ago - Stars: 13 - Forks: 2

JohnnyBravo75/TwinFinder

fuzzy data matching

Language: C# - Size: 3.48 MB - Last synced at: 8 months ago - Pushed at: over 7 years ago - Stars: 13 - Forks: 5

solo-studios/kt-fuzzy

A zero-dependency Kotlin Multiplatform library for fuzzy string matching

Language: Kotlin - Size: 1.21 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 1

maxharlow/textmatch

πŸ”Ž Finds fuzzy matches between datasets

Language: Python - Size: 120 KB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 12 - Forks: 0

madhurima-nath/nlp_fuzzy_match_algorithms πŸ“¦

Language: Jupyter Notebook - Size: 503 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 14

rosette-api/java

Babel Street Analytics Client Library for Java

Language: Java - Size: 64.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 11 - Forks: 35

snapp-incubator/go-symspell

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Language: Go - Size: 3.04 MB - Last synced at: 12 days ago - Pushed at: 21 days ago - Stars: 11 - Forks: 3

lt3/nfr

Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine translation.

Language: Python - Size: 34 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 11 - Forks: 2

orangain/json-fuzzy-match

Custom assertion to check whether a JSON string fuzzily matches a pattern for JVM languages.

Language: Kotlin - Size: 265 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 11 - Forks: 0

leo-arch/fzy Fork of jhawthorn/fzy

A fork of fzy with multi-selection, color support, and a few more things. Consult the manpage (fzy.1) for details.

Language: C - Size: 360 KB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 11 - Forks: 2

scossin/IAMsystem

A fast dictionary-based approach for semantic annotation with approximate string matching algorithms

Language: Java - Size: 768 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 3

ii14/fzx

A fuzzy finder, based on fzy

Language: C++ - Size: 299 KB - Last synced at: 20 days ago - Pushed at: 12 months ago - Stars: 10 - Forks: 4

laserpants/fuzzyset-haskell

:sheep: A fuzzy string set implementation in Haskell.

Language: Haskell - Size: 207 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 3