Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: fuzzy-matching
SamiSieranoja/stridx
Fast fuzzy string similarity search and indexing (for filenames)
Language: C++ - Size: 549 KB - Last synced: about 8 hours ago - Pushed: about 9 hours ago - Stars: 1 - Forks: 0
google/unisim
UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.
Language: Python - Size: 8.07 MB - Last synced: about 12 hours ago - Pushed: 1 day ago - Stars: 77 - Forks: 3
bent10/boox
Search anything, instantly
Language: TypeScript - Size: 43.4 MB - Last synced: about 20 hours ago - Pushed: about 21 hours ago - Stars: 2 - Forks: 0
Genivia/FuzzyMatcher
Fast fuzzy regex matcher: specify max edit distance to find approximate matches
Language: C++ - Size: 44.9 KB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 35 - Forks: 6
J535D165/data-matching-software
A list of free data matching and record linkage software.
Size: 93.8 KB - Last synced: about 4 hours ago - Pushed: 3 months ago - Stars: 350 - Forks: 41
IDinsight/hindi-fuzzy-merge
Code repository with customisable Fuzzy Matching scripts in STATA and Python, especially useful when working with datasets containing Hindi text transliterated to English.
Language: Python - Size: 120 KB - Last synced: 3 days ago - Pushed: over 3 years ago - Stars: 5 - Forks: 0
SYSTRAN/fuzzy-match
Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
Language: C++ - Size: 4.65 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 44 - Forks: 8
zinggAI/zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Language: Java - Size: 438 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 890 - Forks: 108
Yomguithereal/talisman
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Language: JavaScript - Size: 3.39 MB - Last synced: 2 days ago - Pushed: about 1 year ago - Stars: 701 - Forks: 50
hanifabd/lexifuzz-ner
Python Package for Named Entity Recognition (NER) - Based on Dictionary and Fuzzy Matching (Lexical Fuzzy Named Entity Recognition)
Language: Python - Size: 1.8 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 7 - Forks: 1
wolfgarbe/SymSpell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Language: C# - Size: 12.1 MB - Last synced: 3 days ago - Pushed: about 2 months ago - Stars: 3,048 - Forks: 281
DanHarltey/Fastenshtein
The fastest .Net Levenshtein around
Language: C# - Size: 135 KB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 224 - Forks: 22
stephengtuggy/hippocratic-demographics
Library for processing human demographic data, licensed under the Hippocratic License
Language: Rust - Size: 60.5 KB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 0 - Forks: 0
Genivia/RE-flex
A high-performance C++ regex library and lexical analyzer generator with Unicode support. Extends Flex++ with Unicode support, indent/dedent anchors, lazy quantifiers, functions for lex and syntax error reporting and more. Seamlessly integrates with Bison and other parsers.
Language: C++ - Size: 64.5 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 488 - Forks: 85
AI-team-UoA/pyJedAI
An open-source library that leverages Pythonβs data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Language: Python - Size: 127 MB - Last synced: 6 days ago - Pushed: 25 days ago - Stars: 62 - Forks: 10
asabaylus/react-command-palette
An accessible browser compatible javascript command palette
Language: JavaScript - Size: 25.3 MB - Last synced: about 1 month ago - Pushed: 9 months ago - Stars: 623 - Forks: 33
helix-editor/nucleo
A fast and convenient fuzzy matcher library for rust
Language: Rust - Size: 208 KB - Last synced: 9 days ago - Pushed: about 1 month ago - Stars: 719 - Forks: 23
corentinpla/Fraud-detection-in-a-complex-or-isolated-banking-system Fork of CorentinPernot/Statapp
Prove that the pooling of data from different banking players provides added value for the detection of fraudulent transactions.
Language: Jupyter Notebook - Size: 54.7 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 0 - Forks: 0
matchms/matchms
Python library for processing (tandem) mass spectrometry data and for computing spectral similarities.
Language: Python - Size: 38 MB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 165 - Forks: 57
Senzing/awesome
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Language: Python - Size: 230 KB - Last synced: about 9 hours ago - Pushed: about 1 month ago - Stars: 48 - Forks: 2
rlespinasse/wints
What I Need To See - a fuzzy term-based URLs opener
Language: Rust - Size: 191 KB - Last synced: 17 days ago - Pushed: 18 days ago - Stars: 0 - Forks: 0
gandersen101/spaczz
Fuzzy matching and more functionality for spaCy.
Language: Python - Size: 1.24 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 245 - Forks: 27
persian-tools/persian-tools
An anthology of a variety of tools for the Persian language in javascript
Language: TypeScript - Size: 4.04 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 1,024 - Forks: 113
indxSearch/indx-restapi-search
Typescript component to use Indx Search with RestAPI
Language: TypeScript - Size: 668 KB - Last synced: 2 days ago - Pushed: 4 days ago - Stars: 0 - Forks: 0
leeoniya/uFuzzy
A tiny, efficient fuzzy search that doesn't suck
Language: JavaScript - Size: 3.34 MB - Last synced: 16 days ago - Pushed: 3 months ago - Stars: 2,507 - Forks: 45
Christopher-Thornton/hmni
π Fuzzy Name Matching with Machine Learning
Language: Python - Size: 21.2 MB - Last synced: 13 days ago - Pushed: 3 months ago - Stars: 243 - Forks: 50
maxharlow/csvmatch
π Finds fuzzy matches between CSV files
Language: Python - Size: 118 KB - Last synced: 7 days ago - Pushed: about 1 month ago - Stars: 174 - Forks: 21
scossin/iamsystem_python
Fast dictionary-based approach for semantic annotation / entity linking
Language: Python - Size: 389 KB - Last synced: 18 days ago - Pushed: 19 days ago - Stars: 4 - Forks: 1
Mr-G254/FuzzyLogic
In the sample code provided I have used Java to compare two given strings getting the percentage match using the Fuzzy Logic.
Language: Java - Size: 6.84 KB - Last synced: 18 days ago - Pushed: 19 days ago - Stars: 0 - Forks: 0
teamtnt/tntsearch
A fully featured full text search engine written in PHP
Language: PHP - Size: 7.71 MB - Last synced: 18 days ago - Pushed: 3 months ago - Stars: 3,039 - Forks: 287
Valires/er-evaluation
An End-to-End Evaluation Framework for Entity Resolution Systems
Language: Python - Size: 62.4 MB - Last synced: 18 days ago - Pushed: 6 months ago - Stars: 22 - Forks: 3
beacoder/org-ivy-search
Full text search for org files.
Language: Emacs Lisp - Size: 2.14 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 3 - Forks: 1
ChrisMuir/refinr
Cluster and merge similar string values: an R implementation of Open Refine clustering algorithms
Language: C++ - Size: 287 KB - Last synced: 5 days ago - Pushed: 2 months ago - Stars: 102 - Forks: 5
lotabout/fuzzy-matcher
Fuzzy Matching Library for Rust
Language: Rust - Size: 66.4 KB - Last synced: about 3 hours ago - Pushed: 3 months ago - Stars: 240 - Forks: 16
krisk/fuse-swift π¦
A lightweight fuzzy-search library, with zero dependencies
Language: Swift - Size: 117 KB - Last synced: 17 days ago - Pushed: about 2 years ago - Stars: 921 - Forks: 110
lewinfox/levitate
Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).
Language: R - Size: 467 KB - Last synced: 22 days ago - Pushed: 8 months ago - Stars: 33 - Forks: 2
life4/textdistance.rs
π¦π Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.
Language: Rust - Size: 206 KB - Last synced: 16 days ago - Pushed: 12 months ago - Stars: 252 - Forks: 9
taleinat/fuzzysearch
Find parts of long text or data, allowing for some changes/typos.
Language: Python - Size: 976 KB - Last synced: 9 days ago - Pushed: over 1 year ago - Stars: 282 - Forks: 24
Vivino/go-autocomplete-trie
go-autocomplete-trie is a data structure for text auto completion that allows for fuzzy matching and configurable levenshtein distance limits
Language: Go - Size: 16.6 KB - Last synced: 18 days ago - Pushed: about 1 year ago - Stars: 29 - Forks: 6
indxSearch/indx-restapi-load
Typescript component to load data to Indx Search with RestAPI
Language: TypeScript - Size: 957 KB - Last synced: 23 days ago - Pushed: 24 days ago - Stars: 0 - Forks: 0
JohnnyBravo75/TwinFinder
fuzzy data matching
Language: C# - Size: 3.48 MB - Last synced: 21 days ago - Pushed: over 6 years ago - Stars: 13 - Forks: 5
persian-tools/react-persian-tools
React wrapper component around Persian tools
Language: TypeScript - Size: 2.99 MB - Last synced: 2 months ago - Pushed: 5 months ago - Stars: 32 - Forks: 2
nol13/fuzzball.js
Easy to use and powerful fuzzy string matching, port of fuzzywuzzy.
Language: JavaScript - Size: 7.48 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 501 - Forks: 40
mammothb/symspellpy
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Language: Python - Size: 5.76 MB - Last synced: 21 days ago - Pushed: about 2 months ago - Stars: 766 - Forks: 116
proycon/analiticcl
an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction
Language: Rust - Size: 2.45 MB - Last synced: 26 days ago - Pushed: 27 days ago - Stars: 26 - Forks: 4
dodona-edu/dolos
:detective: Source code plagiarism detection
Language: TypeScript - Size: 39.5 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 206 - Forks: 25
wyndow/fuzzywuzzy
Fuzzy string matching for PHP
Language: PHP - Size: 3.91 KB - Last synced: 14 days ago - Pushed: about 4 years ago - Stars: 70 - Forks: 23
RobinL/fuzzymatcher
Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4
Language: Python - Size: 848 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 280 - Forks: 60
mixpanel/fuzzbunny
Fast fuzzy string searching/matching/highlighting
Language: JavaScript - Size: 2.14 MB - Last synced: 13 days ago - Pushed: 10 months ago - Stars: 26 - Forks: 6
rosette-api/java
Rosette API Client Library for Java
Language: Java - Size: 64.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 10 - Forks: 35
benjamrio/company-names-matching
Algorithm matching company names.
Language: Jupyter Notebook - Size: 15.1 MB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 2 - Forks: 0
rosette-api/R-Binding
R client binding for the Rosette API
Language: R - Size: 1.05 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 5 - Forks: 8
moj-analytical-services/splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Language: Python - Size: 89.1 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 1,072 - Forks: 126
Yggdroot/LeaderF
An efficient fuzzy finder that helps to locate files, buffers, mrus, gtags, etc. on the fly for both vim and neovim.
Language: Python - Size: 2.23 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2,095 - Forks: 170
leo-arch/fnf
A simple fuzzy finder for the terminal
Language: C - Size: 170 KB - Last synced: 18 days ago - Pushed: about 2 months ago - Stars: 13 - Forks: 2
chrislit/abydos
Abydos NLP/IR library for Python
Language: Python - Size: 52.4 MB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 172 - Forks: 32
HellstromIT/go-quickswitch
Quickly jump between git repositories on your filesystem
Language: Go - Size: 154 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0
amiegirl/Analysis_of_AMCAT_Data
The focus of this study is on certain groups of graduates in computer science and engineering in order to better understand the unique job issues they confront.
Language: Jupyter Notebook - Size: 7.68 MB - Last synced: 11 days ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
OlivierBinette/simple-typo-tolerant-search
Efficient typo-tolerant search in 76 lines of code, with no dependencies.
Language: Python - Size: 243 KB - Last synced: 18 days ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0
znwang25/fuzzychinese
A small package to fuzzy match chinese words
Language: Python - Size: 1.81 MB - Last synced: 15 days ago - Pushed: about 1 year ago - Stars: 69 - Forks: 9
aslilac/spirits
Get the spirit of a string, without the whole thing!
Language: TypeScript - Size: 347 KB - Last synced: 10 days ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0
iomega/spec2vec
Word2Vec based similarity measure of mass spectrometry data.
Language: Python - Size: 21.4 MB - Last synced: 16 days ago - Pushed: 9 months ago - Stars: 57 - Forks: 14
orangain/json-fuzzy-match
Custom assertion to check whether a JSON string fuzzily matches a pattern for JVM languages.
Language: Kotlin - Size: 218 KB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 9 - Forks: 0
ii14/fzx
A fuzzy finder, based on fzy
Language: C++ - Size: 299 KB - Last synced: 18 days ago - Pushed: 24 days ago - Stars: 6 - Forks: 3
jonathanmorris180/apex-fuzzy-finder
A fuzzy finder for Apex.
Language: Apex - Size: 279 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 8 - Forks: 2
dvsh243/Seekr
in-memory fuzzy matching
Language: Python - Size: 28.7 MB - Last synced: about 1 month ago - Pushed: 11 months ago - Stars: 1 - Forks: 0
kyr0/clientside-search
A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.
Language: TypeScript - Size: 1.58 MB - Last synced: 17 days ago - Pushed: 10 months ago - Stars: 6 - Forks: 0
wooorm/levenshtein.c
Levenshtein algorithm in C
Language: C - Size: 22.5 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 80 - Forks: 14
patrickdet/fuzzy_compare
A fuzzy string comparison library for Elixir
Language: Elixir - Size: 13.7 KB - Last synced: about 19 hours ago - Pushed: 4 months ago - Stars: 21 - Forks: 4
luqmanoop/use-command-score
Tiny, fast fuzzy β‘οΈearch for React applications
Language: TypeScript - Size: 34.2 KB - Last synced: 12 days ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0
lpimem/ds_handson
Data Science Hands-on Experiments
Language: Jupyter Notebook - Size: 16.2 MB - Last synced: 2 months ago - Pushed: over 7 years ago - Stars: 0 - Forks: 0
pravigo/fuzzy-matching
Fuzzy matching recipe for Local Authority datasets
Size: 584 KB - Last synced: 2 months ago - Pushed: over 6 years ago - Stars: 5 - Forks: 1
PhilaController/schuylkill
Fixing human errors by matching those hard-to-spell words.
Language: Python - Size: 66.4 KB - Last synced: 25 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
JonathanReeve/text-matcher
A simple text reuse detection CLI tool.
Language: Python - Size: 67.4 KB - Last synced: 16 days ago - Pushed: 11 months ago - Stars: 120 - Forks: 24
maxharlow/textmatch
π Finds fuzzy matches between datasets
Language: Python - Size: 82 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 7 - Forks: 0
BishopFox/GitGot
Semi-automated, feedback-driven tool to rapidly search through troves of public data on GitHub for sensitive secrets.
Language: Python - Size: 189 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1,361 - Forks: 201
OlivierBinette/StringCompare
Efficient String Comparison Functions and Fuzzy String Matching
Language: Python - Size: 3 MB - Last synced: 18 days ago - Pushed: about 2 years ago - Stars: 16 - Forks: 2
simonschoe/fuzzy-name-match
Fuzzy match entity names (primarily persons and companies) across databases
Language: Python - Size: 211 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 1
persian-tools/vue-persian-tools
Persian tools wrapper for vue.js
Language: TypeScript - Size: 3.45 MB - Last synced: 29 days ago - Pushed: 5 months ago - Stars: 25 - Forks: 2
fritshermans/deduplipy
Python package for deduplication/entity resolution using active learning
Language: Python - Size: 520 KB - Last synced: 2 months ago - Pushed: 8 months ago - Stars: 71 - Forks: 9
gustaveWPM/OC-Kasa π¦
React webapp made during an OpenClassrooms bootcamp. "Raw React" project (React libs disallowed).
Language: TypeScript - Size: 10.7 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
gustaveWPM/Typescript-Damerau-Levenshtein π¦
Just another Damerau Levenshtein distance implementation, made in TypeScript
Language: TypeScript - Size: 2.93 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
adellegia/GetHelp
GetHelp is envisioned to be a platform that connects students of the Hertie School with each other in times of urgent need!
Language: Python - Size: 508 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
rosette-api/python
Rosette API Client Library for Python
Language: Python - Size: 1.63 MB - Last synced: 29 days ago - Pushed: 30 days ago - Stars: 37 - Forks: 91
wowinter13/fast_fuzzy_matcher
A tiny and blazing-fast fuzzy search in pure Ruby with FFI bindings to Go.
Language: Go - Size: 958 KB - Last synced: 8 days ago - Pushed: 4 months ago - Stars: 2 - Forks: 0
delonnewman/mini-levenshtein
Simple, fast Levenshtein distance and similarity ratio for Ruby
Language: C - Size: 57.6 KB - Last synced: 9 days ago - Pushed: 5 months ago - Stars: 24 - Forks: 0
DavidMoraisFerreira/FuzzyWuzzy.pas
Fuzzy String Matching in Free Pascal - Port of FuzzyWuzzy
Language: Pascal - Size: 3.91 KB - Last synced: 26 days ago - Pushed: about 5 years ago - Stars: 16 - Forks: 0
schollz/closestmatch
Golang library for fuzzy matching within a set of strings :page_with_curl:
Language: Go - Size: 641 KB - Last synced: 17 days ago - Pushed: over 1 year ago - Stars: 416 - Forks: 53
deductiv/fuzzylookup
Fuzzlookup search command for Splunk. Use fuzzy logic to enrich search results using near-matches in your lookups.
Language: Python - Size: 195 KB - Last synced: 18 days ago - Pushed: over 2 years ago - Stars: 2 - Forks: 1
theoparis/fzy π¦
A fork of fzy (MIRROR)
Language: C - Size: 358 KB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
datahappy1/go_fuzzymatch_webapp
Fuzzster - fuzzy matching web application
Language: Go - Size: 1.3 MB - Last synced: 4 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0
ripxorip/bolt.nvim
β‘ Ultrafast multi-pane file manager for Neovim with fuzzy matching
Language: Python - Size: 487 KB - Last synced: 3 months ago - Pushed: almost 4 years ago - Stars: 107 - Forks: 3
xdrop/fuzzywuzzy
Java fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java
Language: Java - Size: 415 KB - Last synced: 4 months ago - Pushed: 10 months ago - Stars: 758 - Forks: 112
madhurima-nath/nlp_fuzzy_match_algorithms π¦
Language: Jupyter Notebook - Size: 503 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 12 - Forks: 14
salman-abedin/faint
Extensible TUI fuzzy file file explorer
Language: Shell - Size: 8.6 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 94 - Forks: 0
mnowotnik/fzshell
Fuzzy shell completions you didn't know you needed
Language: Go - Size: 93.8 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 73 - Forks: 4
esentis/multiple_search_selection
A highly customizable multiple selection widget with fuzzy search functionality.
Language: Dart - Size: 2.92 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 11 - Forks: 13
vifon/autocomplete-ALL-the-things
Arbitrary text completion for urxvt. MAINTAINER NEEDED
Language: Perl - Size: 67.4 KB - Last synced: about 1 month ago - Pushed: almost 7 years ago - Stars: 64 - Forks: 3
plain-jane-gray/PFAS-web-and-PDF-scrape
Scrapes hazardous waste data from a website and PDF file. Cleans and analyzes the data. Prepares the data for mapping.
Language: Jupyter Notebook - Size: 8.9 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
department-of-veterans-affairs/DAPM-PFAS-PACT-ACT
Scrapes hazardous waste data from a website and PDF file for PACT Act. Cleans the data to prepare it for mapping.
Language: Jupyter Notebook - Size: 15.1 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 0 - Forks: 1