Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: fuzzy-matching

SamiSieranoja/stridx

Fast fuzzy string similarity search and indexing (for filenames)

Language: C++ - Size: 549 KB - Last synced: about 8 hours ago - Pushed: about 9 hours ago - Stars: 1 - Forks: 0

google/unisim

UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.

Language: Python - Size: 8.07 MB - Last synced: about 12 hours ago - Pushed: 1 day ago - Stars: 77 - Forks: 3

bent10/boox

Search anything, instantly

Language: TypeScript - Size: 43.4 MB - Last synced: about 20 hours ago - Pushed: about 21 hours ago - Stars: 2 - Forks: 0

Genivia/FuzzyMatcher

Fast fuzzy regex matcher: specify max edit distance to find approximate matches

Language: C++ - Size: 44.9 KB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 35 - Forks: 6

J535D165/data-matching-software

A list of free data matching and record linkage software.

Size: 93.8 KB - Last synced: about 4 hours ago - Pushed: 3 months ago - Stars: 350 - Forks: 41

IDinsight/hindi-fuzzy-merge

Code repository with customisable Fuzzy Matching scripts in STATA and Python, especially useful when working with datasets containing Hindi text transliterated to English.

Language: Python - Size: 120 KB - Last synced: 3 days ago - Pushed: over 3 years ago - Stars: 5 - Forks: 0

SYSTRAN/fuzzy-match

Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.

Language: C++ - Size: 4.65 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 44 - Forks: 8

zinggAI/zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Language: Java - Size: 438 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 890 - Forks: 108

Yomguithereal/talisman

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

Language: JavaScript - Size: 3.39 MB - Last synced: 2 days ago - Pushed: about 1 year ago - Stars: 701 - Forks: 50

hanifabd/lexifuzz-ner

Python Package for Named Entity Recognition (NER) - Based on Dictionary and Fuzzy Matching (Lexical Fuzzy Named Entity Recognition)

Language: Python - Size: 1.8 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 7 - Forks: 1

wolfgarbe/SymSpell

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Language: C# - Size: 12.1 MB - Last synced: 3 days ago - Pushed: about 2 months ago - Stars: 3,048 - Forks: 281

DanHarltey/Fastenshtein

The fastest .Net Levenshtein around

Language: C# - Size: 135 KB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 224 - Forks: 22

stephengtuggy/hippocratic-demographics

Library for processing human demographic data, licensed under the Hippocratic License

Language: Rust - Size: 60.5 KB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 0 - Forks: 0

Genivia/RE-flex

A high-performance C++ regex library and lexical analyzer generator with Unicode support. Extends Flex++ with Unicode support, indent/dedent anchors, lazy quantifiers, functions for lex and syntax error reporting and more. Seamlessly integrates with Bison and other parsers.

Language: C++ - Size: 64.5 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 488 - Forks: 85

AI-team-UoA/pyJedAI

An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.

Language: Python - Size: 127 MB - Last synced: 6 days ago - Pushed: 25 days ago - Stars: 62 - Forks: 10

asabaylus/react-command-palette

An accessible browser compatible javascript command palette

Language: JavaScript - Size: 25.3 MB - Last synced: about 1 month ago - Pushed: 9 months ago - Stars: 623 - Forks: 33

helix-editor/nucleo

A fast and convenient fuzzy matcher library for rust

Language: Rust - Size: 208 KB - Last synced: 9 days ago - Pushed: about 1 month ago - Stars: 719 - Forks: 23

corentinpla/Fraud-detection-in-a-complex-or-isolated-banking-system Fork of CorentinPernot/Statapp

Prove that the pooling of data from different banking players provides added value for the detection of fraudulent transactions.

Language: Jupyter Notebook - Size: 54.7 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 0 - Forks: 0

matchms/matchms

Python library for processing (tandem) mass spectrometry data and for computing spectral similarities.

Language: Python - Size: 38 MB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 165 - Forks: 57

Senzing/awesome

Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.

Language: Python - Size: 230 KB - Last synced: about 9 hours ago - Pushed: about 1 month ago - Stars: 48 - Forks: 2

rlespinasse/wints

What I Need To See - a fuzzy term-based URLs opener

Language: Rust - Size: 191 KB - Last synced: 17 days ago - Pushed: 18 days ago - Stars: 0 - Forks: 0

gandersen101/spaczz

Fuzzy matching and more functionality for spaCy.

Language: Python - Size: 1.24 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 245 - Forks: 27

persian-tools/persian-tools

An anthology of a variety of tools for the Persian language in javascript

Language: TypeScript - Size: 4.04 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 1,024 - Forks: 113

indxSearch/indx-restapi-search

Typescript component to use Indx Search with RestAPI

Language: TypeScript - Size: 668 KB - Last synced: 2 days ago - Pushed: 4 days ago - Stars: 0 - Forks: 0

leeoniya/uFuzzy

A tiny, efficient fuzzy search that doesn't suck

Language: JavaScript - Size: 3.34 MB - Last synced: 16 days ago - Pushed: 3 months ago - Stars: 2,507 - Forks: 45

Christopher-Thornton/hmni

πŸ“› Fuzzy Name Matching with Machine Learning

Language: Python - Size: 21.2 MB - Last synced: 13 days ago - Pushed: 3 months ago - Stars: 243 - Forks: 50

maxharlow/csvmatch

πŸ”Ž Finds fuzzy matches between CSV files

Language: Python - Size: 118 KB - Last synced: 7 days ago - Pushed: about 1 month ago - Stars: 174 - Forks: 21

scossin/iamsystem_python

Fast dictionary-based approach for semantic annotation / entity linking

Language: Python - Size: 389 KB - Last synced: 18 days ago - Pushed: 19 days ago - Stars: 4 - Forks: 1

Mr-G254/FuzzyLogic

In the sample code provided I have used Java to compare two given strings getting the percentage match using the Fuzzy Logic.

Language: Java - Size: 6.84 KB - Last synced: 18 days ago - Pushed: 19 days ago - Stars: 0 - Forks: 0

teamtnt/tntsearch

A fully featured full text search engine written in PHP

Language: PHP - Size: 7.71 MB - Last synced: 18 days ago - Pushed: 3 months ago - Stars: 3,039 - Forks: 287

Valires/er-evaluation

An End-to-End Evaluation Framework for Entity Resolution Systems

Language: Python - Size: 62.4 MB - Last synced: 18 days ago - Pushed: 6 months ago - Stars: 22 - Forks: 3

beacoder/org-ivy-search

Full text search for org files.

Language: Emacs Lisp - Size: 2.14 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 3 - Forks: 1

ChrisMuir/refinr

Cluster and merge similar string values: an R implementation of Open Refine clustering algorithms

Language: C++ - Size: 287 KB - Last synced: 5 days ago - Pushed: 2 months ago - Stars: 102 - Forks: 5

lotabout/fuzzy-matcher

Fuzzy Matching Library for Rust

Language: Rust - Size: 66.4 KB - Last synced: about 3 hours ago - Pushed: 3 months ago - Stars: 240 - Forks: 16

krisk/fuse-swift πŸ“¦

A lightweight fuzzy-search library, with zero dependencies

Language: Swift - Size: 117 KB - Last synced: 17 days ago - Pushed: about 2 years ago - Stars: 921 - Forks: 110

lewinfox/levitate

Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).

Language: R - Size: 467 KB - Last synced: 22 days ago - Pushed: 8 months ago - Stars: 33 - Forks: 2

life4/textdistance.rs

πŸ¦€πŸ“ Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.

Language: Rust - Size: 206 KB - Last synced: 16 days ago - Pushed: 12 months ago - Stars: 252 - Forks: 9

taleinat/fuzzysearch

Find parts of long text or data, allowing for some changes/typos.

Language: Python - Size: 976 KB - Last synced: 9 days ago - Pushed: over 1 year ago - Stars: 282 - Forks: 24

Vivino/go-autocomplete-trie

go-autocomplete-trie is a data structure for text auto completion that allows for fuzzy matching and configurable levenshtein distance limits

Language: Go - Size: 16.6 KB - Last synced: 18 days ago - Pushed: about 1 year ago - Stars: 29 - Forks: 6

indxSearch/indx-restapi-load

Typescript component to load data to Indx Search with RestAPI

Language: TypeScript - Size: 957 KB - Last synced: 23 days ago - Pushed: 24 days ago - Stars: 0 - Forks: 0

JohnnyBravo75/TwinFinder

fuzzy data matching

Language: C# - Size: 3.48 MB - Last synced: 21 days ago - Pushed: over 6 years ago - Stars: 13 - Forks: 5

persian-tools/react-persian-tools

React wrapper component around Persian tools

Language: TypeScript - Size: 2.99 MB - Last synced: 2 months ago - Pushed: 5 months ago - Stars: 32 - Forks: 2

nol13/fuzzball.js

Easy to use and powerful fuzzy string matching, port of fuzzywuzzy.

Language: JavaScript - Size: 7.48 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 501 - Forks: 40

mammothb/symspellpy

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Language: Python - Size: 5.76 MB - Last synced: 21 days ago - Pushed: about 2 months ago - Stars: 766 - Forks: 116

proycon/analiticcl

an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction

Language: Rust - Size: 2.45 MB - Last synced: 26 days ago - Pushed: 27 days ago - Stars: 26 - Forks: 4

dodona-edu/dolos

:detective: Source code plagiarism detection

Language: TypeScript - Size: 39.5 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 206 - Forks: 25

wyndow/fuzzywuzzy

Fuzzy string matching for PHP

Language: PHP - Size: 3.91 KB - Last synced: 14 days ago - Pushed: about 4 years ago - Stars: 70 - Forks: 23

RobinL/fuzzymatcher

Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4

Language: Python - Size: 848 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 280 - Forks: 60

mixpanel/fuzzbunny

Fast fuzzy string searching/matching/highlighting

Language: JavaScript - Size: 2.14 MB - Last synced: 13 days ago - Pushed: 10 months ago - Stars: 26 - Forks: 6

rosette-api/java

Rosette API Client Library for Java

Language: Java - Size: 64.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 10 - Forks: 35

benjamrio/company-names-matching

Algorithm matching company names.

Language: Jupyter Notebook - Size: 15.1 MB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 2 - Forks: 0

rosette-api/R-Binding

R client binding for the Rosette API

Language: R - Size: 1.05 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 5 - Forks: 8

moj-analytical-services/splink

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Language: Python - Size: 89.1 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 1,072 - Forks: 126

Yggdroot/LeaderF

An efficient fuzzy finder that helps to locate files, buffers, mrus, gtags, etc. on the fly for both vim and neovim.

Language: Python - Size: 2.23 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2,095 - Forks: 170

leo-arch/fnf

A simple fuzzy finder for the terminal

Language: C - Size: 170 KB - Last synced: 18 days ago - Pushed: about 2 months ago - Stars: 13 - Forks: 2

chrislit/abydos

Abydos NLP/IR library for Python

Language: Python - Size: 52.4 MB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 172 - Forks: 32

HellstromIT/go-quickswitch

Quickly jump between git repositories on your filesystem

Language: Go - Size: 154 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0

amiegirl/Analysis_of_AMCAT_Data

The focus of this study is on certain groups of graduates in computer science and engineering in order to better understand the unique job issues they confront.

Language: Jupyter Notebook - Size: 7.68 MB - Last synced: 11 days ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

OlivierBinette/simple-typo-tolerant-search

Efficient typo-tolerant search in 76 lines of code, with no dependencies.

Language: Python - Size: 243 KB - Last synced: 18 days ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0

znwang25/fuzzychinese

A small package to fuzzy match chinese words

Language: Python - Size: 1.81 MB - Last synced: 15 days ago - Pushed: about 1 year ago - Stars: 69 - Forks: 9

aslilac/spirits

Get the spirit of a string, without the whole thing!

Language: TypeScript - Size: 347 KB - Last synced: 10 days ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0

iomega/spec2vec

Word2Vec based similarity measure of mass spectrometry data.

Language: Python - Size: 21.4 MB - Last synced: 16 days ago - Pushed: 9 months ago - Stars: 57 - Forks: 14

orangain/json-fuzzy-match

Custom assertion to check whether a JSON string fuzzily matches a pattern for JVM languages.

Language: Kotlin - Size: 218 KB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 9 - Forks: 0

ii14/fzx

A fuzzy finder, based on fzy

Language: C++ - Size: 299 KB - Last synced: 18 days ago - Pushed: 24 days ago - Stars: 6 - Forks: 3

jonathanmorris180/apex-fuzzy-finder

A fuzzy finder for Apex.

Language: Apex - Size: 279 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 8 - Forks: 2

dvsh243/Seekr

in-memory fuzzy matching

Language: Python - Size: 28.7 MB - Last synced: about 1 month ago - Pushed: 11 months ago - Stars: 1 - Forks: 0

kyr0/clientside-search

A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.

Language: TypeScript - Size: 1.58 MB - Last synced: 17 days ago - Pushed: 10 months ago - Stars: 6 - Forks: 0

wooorm/levenshtein.c

Levenshtein algorithm in C

Language: C - Size: 22.5 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 80 - Forks: 14

patrickdet/fuzzy_compare

A fuzzy string comparison library for Elixir

Language: Elixir - Size: 13.7 KB - Last synced: about 19 hours ago - Pushed: 4 months ago - Stars: 21 - Forks: 4

luqmanoop/use-command-score

Tiny, fast fuzzy ⚑️earch for React applications

Language: TypeScript - Size: 34.2 KB - Last synced: 12 days ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

lpimem/ds_handson

Data Science Hands-on Experiments

Language: Jupyter Notebook - Size: 16.2 MB - Last synced: 2 months ago - Pushed: over 7 years ago - Stars: 0 - Forks: 0

pravigo/fuzzy-matching

Fuzzy matching recipe for Local Authority datasets

Size: 584 KB - Last synced: 2 months ago - Pushed: over 6 years ago - Stars: 5 - Forks: 1

PhilaController/schuylkill

Fixing human errors by matching those hard-to-spell words.

Language: Python - Size: 66.4 KB - Last synced: 25 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

JonathanReeve/text-matcher

A simple text reuse detection CLI tool.

Language: Python - Size: 67.4 KB - Last synced: 16 days ago - Pushed: 11 months ago - Stars: 120 - Forks: 24

maxharlow/textmatch

πŸ”Ž Finds fuzzy matches between datasets

Language: Python - Size: 82 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 7 - Forks: 0

BishopFox/GitGot

Semi-automated, feedback-driven tool to rapidly search through troves of public data on GitHub for sensitive secrets.

Language: Python - Size: 189 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1,361 - Forks: 201

OlivierBinette/StringCompare

Efficient String Comparison Functions and Fuzzy String Matching

Language: Python - Size: 3 MB - Last synced: 18 days ago - Pushed: about 2 years ago - Stars: 16 - Forks: 2

simonschoe/fuzzy-name-match

Fuzzy match entity names (primarily persons and companies) across databases

Language: Python - Size: 211 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 1

persian-tools/vue-persian-tools

Persian tools wrapper for vue.js

Language: TypeScript - Size: 3.45 MB - Last synced: 29 days ago - Pushed: 5 months ago - Stars: 25 - Forks: 2

fritshermans/deduplipy

Python package for deduplication/entity resolution using active learning

Language: Python - Size: 520 KB - Last synced: 2 months ago - Pushed: 8 months ago - Stars: 71 - Forks: 9

gustaveWPM/OC-Kasa πŸ“¦

React webapp made during an OpenClassrooms bootcamp. "Raw React" project (React libs disallowed).

Language: TypeScript - Size: 10.7 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

gustaveWPM/Typescript-Damerau-Levenshtein πŸ“¦

Just another Damerau Levenshtein distance implementation, made in TypeScript

Language: TypeScript - Size: 2.93 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

adellegia/GetHelp

GetHelp is envisioned to be a platform that connects students of the Hertie School with each other in times of urgent need!

Language: Python - Size: 508 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

rosette-api/python

Rosette API Client Library for Python

Language: Python - Size: 1.63 MB - Last synced: 29 days ago - Pushed: 30 days ago - Stars: 37 - Forks: 91

wowinter13/fast_fuzzy_matcher

A tiny and blazing-fast fuzzy search in pure Ruby with FFI bindings to Go.

Language: Go - Size: 958 KB - Last synced: 8 days ago - Pushed: 4 months ago - Stars: 2 - Forks: 0

delonnewman/mini-levenshtein

Simple, fast Levenshtein distance and similarity ratio for Ruby

Language: C - Size: 57.6 KB - Last synced: 9 days ago - Pushed: 5 months ago - Stars: 24 - Forks: 0

DavidMoraisFerreira/FuzzyWuzzy.pas

Fuzzy String Matching in Free Pascal - Port of FuzzyWuzzy

Language: Pascal - Size: 3.91 KB - Last synced: 26 days ago - Pushed: about 5 years ago - Stars: 16 - Forks: 0

schollz/closestmatch

Golang library for fuzzy matching within a set of strings :page_with_curl:

Language: Go - Size: 641 KB - Last synced: 17 days ago - Pushed: over 1 year ago - Stars: 416 - Forks: 53

deductiv/fuzzylookup

Fuzzlookup search command for Splunk. Use fuzzy logic to enrich search results using near-matches in your lookups.

Language: Python - Size: 195 KB - Last synced: 18 days ago - Pushed: over 2 years ago - Stars: 2 - Forks: 1

theoparis/fzy πŸ“¦

A fork of fzy (MIRROR)

Language: C - Size: 358 KB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

datahappy1/go_fuzzymatch_webapp

Fuzzster - fuzzy matching web application

Language: Go - Size: 1.3 MB - Last synced: 4 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

ripxorip/bolt.nvim

⚑ Ultrafast multi-pane file manager for Neovim with fuzzy matching

Language: Python - Size: 487 KB - Last synced: 3 months ago - Pushed: almost 4 years ago - Stars: 107 - Forks: 3

xdrop/fuzzywuzzy

Java fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java

Language: Java - Size: 415 KB - Last synced: 4 months ago - Pushed: 10 months ago - Stars: 758 - Forks: 112

madhurima-nath/nlp_fuzzy_match_algorithms πŸ“¦

Language: Jupyter Notebook - Size: 503 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 12 - Forks: 14

salman-abedin/faint

Extensible TUI fuzzy file file explorer

Language: Shell - Size: 8.6 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 94 - Forks: 0

mnowotnik/fzshell

Fuzzy shell completions you didn't know you needed

Language: Go - Size: 93.8 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 73 - Forks: 4

esentis/multiple_search_selection

A highly customizable multiple selection widget with fuzzy search functionality.

Language: Dart - Size: 2.92 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 11 - Forks: 13

vifon/autocomplete-ALL-the-things

Arbitrary text completion for urxvt. MAINTAINER NEEDED

Language: Perl - Size: 67.4 KB - Last synced: about 1 month ago - Pushed: almost 7 years ago - Stars: 64 - Forks: 3

plain-jane-gray/PFAS-web-and-PDF-scrape

Scrapes hazardous waste data from a website and PDF file. Cleans and analyzes the data. Prepares the data for mapping.

Language: Jupyter Notebook - Size: 8.9 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

department-of-veterans-affairs/DAPM-PFAS-PACT-ACT

Scrapes hazardous waste data from a website and PDF file for PACT Act. Cleans the data to prepare it for mapping.

Language: Jupyter Notebook - Size: 15.1 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 0 - Forks: 1