GitHub topics: jaro-winkler
tonytonyjan/jaro_winkler
Ruby & C implementation of Jaro-Winkler distance algorithm which supports UTF-8 string.
Language: Ruby - Size: 202 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 202 - Forks: 33

thaumant/eddie
Language: Rust - Size: 125 KB - Last synced at: about 10 hours ago - Pushed at: over 5 years ago - Stars: 20 - Forks: 1

joel6666666666555555555555553333333333/kawesearch
This repository contains a fuzzy search library that provides a flexible way to search and match strings using various fuzzy matching algorithms. It supports multiple languages and allows for customization of search options such as algorithms, thresholds, and synonym handling.
Size: 1000 Bytes - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

rapidfuzz/strsim-rs
:abc: Rust implementations of string similarity metrics
Language: Rust - Size: 702 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 441 - Forks: 42

feature23/StringSimilarity.NET
A .NET port of java-string-similarity
Language: C# - Size: 519 KB - Last synced at: 17 days ago - Pushed at: 12 months ago - Stars: 478 - Forks: 72

lignum-vitae/goombay
Python implementation of several sequence alignment algorithms such as Waterman-Smith-Beyer, Gotoh, and Needleman-Wunsch intended to calculate distance, show alignment, and display the underlying matrices.
Language: Python - Size: 270 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 2

anirbanmu/str_metrics
Ruby gem (native extension in Rust) providing implementations of various string metrics
Language: Ruby - Size: 94.7 KB - Last synced at: 15 days ago - Pushed at: about 3 years ago - Stars: 77 - Forks: 2

rapidfuzz/JaroWinkler
Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity
Language: Python - Size: 105 KB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 72 - Forks: 5

jamesturk/jellyfish
🪼 a python library for doing approximate and phonetic matching of strings.
Language: Jupyter Notebook - Size: 3.5 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 2,139 - Forks: 159

adrg/strutil
Go metrics for calculating string similarity and other string utility functions
Language: Go - Size: 112 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 381 - Forks: 25

aallam/string-similarity-kotlin
Kotlin multiplatform library offering various algorithms to measure string similarity and distance
Language: Kotlin - Size: 345 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 2

dbousque/batch_jaro_winkler
Fast batch jaro winkler distance implementation in C99 with Ruby, OCaml and Python bindings.
Language: C - Size: 4.15 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 28 - Forks: 4

life4/textdistance.rs
🦀📏 Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.
Language: Rust - Size: 271 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 295 - Forks: 11

moshetanzer/text-toolbox
A high-performance TypeScript library for string similarity, distance algorithms, and text normalization utilities
Language: TypeScript - Size: 280 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

eulerto/pg_similarity
set of functions and operators for executing similarity queries
Language: C - Size: 164 KB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 375 - Forks: 41

tdebatty/java-string-similarity
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Language: Java - Size: 729 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 2,715 - Forks: 417

hbollon/go-edlib
📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
Language: Go - Size: 76.2 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 514 - Forks: 26

Hasan-Kilici/kawesearch
This repository contains a fuzzy search library that provides a flexible way to search and match strings using various fuzzy matching algorithms. It supports multiple languages and allows for customization of search options such as algorithms, thresholds, and synonym handling.
Language: JavaScript - Size: 69.3 KB - Last synced at: 25 days ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

vanessaklee/akin
A collection of metrics and phonetic algorithms for fuzzy string matching in Elixir.
Language: Elixir - Size: 8.47 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 36 - Forks: 4

HamedFathi/SimMetricsCore
A text similarity metric library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro, etc) to other metrics, (e.g Soundex, Chapman). This library is compiled based on the .NET standard with a lot of useful extension methods.
Language: C# - Size: 37.1 KB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 10 - Forks: 7

lexmag/simetric
String similarity metrics for Elixir
Language: Elixir - Size: 11.7 KB - Last synced at: 17 days ago - Pushed at: about 4 years ago - Stars: 62 - Forks: 7

bezzad/StringComparison
String Comparision in C#.NET
Language: C# - Size: 521 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 19 - Forks: 8

dexyk/stringosim
String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Language: Go - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: almost 8 years ago - Stars: 61 - Forks: 8

scmmishra/pico-search
A pico sized (> 1kb) typo-tolerant word matching library
Language: TypeScript - Size: 28.3 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 2

skyra-project/jaro-winkler
A string similarity utility that uses the Jaro-Winkler algorithm
Language: TypeScript - Size: 3.43 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 5 - Forks: 3

Dynom/TySug
A project around helping to prevent typing typos. TySug (Typo Suggestions) suggests alternative words with respect to keyboard layouts
Language: Go - Size: 440 KB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 3

MrPowers/ceja
PySpark phonetic and string matching algorithms
Language: Python - Size: 32.2 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 39 - Forks: 6

fevieira27/DeezerAnalysisAI-R
An R script that uses AI for data analysis on Deezer playlists, like looking for fuzzy duplicates, rank of genre and artists.
Language: R - Size: 239 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

MrPowers/spark-stringmetric
Spark functions to run popular phonetic and string matching algorithms
Language: Scala - Size: 457 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 60 - Forks: 6

alextanhongpin/stringdist
String metrics function in golang (levenshtein, damerau-levenshtein, jaro, jaro-winkler and additionally bk-tree) for autocorrect
Language: Go - Size: 37.1 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 18 - Forks: 0

astrixgame/jaro-winkler
Jaro Winkler Distance is a algorithm that can be used to compare two strings of sentences and get the score in percent how much is similar.
Language: JavaScript - Size: 6.84 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

joshuaclayton/jaro_winkler
A fast implementation of Jaro-Winkler distance comparing two &str values.
Language: Rust - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

vickumar1981/stringdistance
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Language: Scala - Size: 1.27 MB - Last synced at: 10 days ago - Pushed at: about 3 years ago - Stars: 78 - Forks: 14

a-poor/jarowinkler
An implementation of the Jaro-Winkler string similarity algorithm in Go.
Language: Go - Size: 10.7 KB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

rockymadden/stringmetric 📦
:dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).
Language: Scala - Size: 2.07 MB - Last synced at: 12 months ago - Pushed at: almost 8 years ago - Stars: 485 - Forks: 81

VanekPetr/text-similarity-ranking
Algorithm to rank text similarity between set of strings and given inputs
Language: Python - Size: 128 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

tcrouch/edits
Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment
Language: Ruby - Size: 68.4 KB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

tcrouch/edits.cr
Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment
Language: Crystal - Size: 96.7 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

kwunshing123/jaro-winkler-typescript
Jaro-Winkler typescript implementation
Language: TypeScript - Size: 68.4 KB - Last synced at: 22 days ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

OlivierBinette/StringCompare
Efficient String Comparison Functions and Fuzzy String Matching
Language: Python - Size: 3 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 2

vasgat/jSimilarity
jSimilarity is a library that implements various similarity measures
Language: Java - Size: 32.2 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 2

ogus/jaro-winkler
A measure of distance between words with the Jaro-Winkler algorithm
Language: JavaScript - Size: 7.81 KB - Last synced at: 19 days ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

3urobeat/spotify-playlist-similarity-checker
⚖️ Fetches all song titles in your playlist and compares them to find duplicates. Supports multithreading!
Language: JavaScript - Size: 50.8 KB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 0

addisoncox/ffzf
fast fuzzy string matching for Python
Language: Rust - Size: 249 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

millionseleshi/lara-searchtable
Search database using jaro-winkler string similarity against a query
Language: PHP - Size: 102 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

sahibul-nf/aceh-dictionary-api
Aceh language dictionary API created using Golang
Language: Go - Size: 483 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

schiffma/distlib
Distance related functions (Damerau-Levenshtein, Jaro-Winkler , longest common substring & subsequence) implemented as SQLite run-time loadable extension. Any UTF-8 strings are supported.
Language: C - Size: 1.73 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 16 - Forks: 1

d8zerg/jaro_winkler
Jaro-Winkler distance
Language: C++ - Size: 3.91 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

medric49/w_distances
Just some implementations of word distance functions.
Language: Python - Size: 31.3 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

sharannk93/Matching-algorithm
The main purpose of this project was to develop a matching algorithm in python to fuzzy classify people from a customer list as positive or negative based on a messy positive and negative database with a confidence score.
Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

UjjwalAyyangar/go-jellyfish
Go port of the python jellyfish module for approximate and phonetic matching of strings.
Language: Go - Size: 35.2 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

kaiyoo/NLP-Detect-Lexical-Blends
Detect lexical blending using similarity by approximate string matching and word vectors
Language: Python - Size: 1.33 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

bmiller1009/fuzzy-row-matcher
Framework for finding similar rows in a JDBC source
Language: Kotlin - Size: 1.26 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

enginyoyen/strmetric
String metrics libary for Go lang. (e.g. Dice/Sorensen, Hamming, Jaro, Jaro-Winkler, Levenshtein)
Language: Go - Size: 11.7 KB - Last synced at: 5 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0
