An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: jaro-winkler

tonytonyjan/jaro_winkler

Ruby & C implementation of Jaro-Winkler distance algorithm which supports UTF-8 string.

Language: Ruby - Size: 202 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 202 - Forks: 33

thaumant/eddie

Language: Rust - Size: 125 KB - Last synced at: about 10 hours ago - Pushed at: over 5 years ago - Stars: 20 - Forks: 1

joel6666666666555555555555553333333333/kawesearch

This repository contains a fuzzy search library that provides a flexible way to search and match strings using various fuzzy matching algorithms. It supports multiple languages and allows for customization of search options such as algorithms, thresholds, and synonym handling.

Size: 1000 Bytes - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

rapidfuzz/strsim-rs

:abc: Rust implementations of string similarity metrics

Language: Rust - Size: 702 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 441 - Forks: 42

feature23/StringSimilarity.NET

A .NET port of java-string-similarity

Language: C# - Size: 519 KB - Last synced at: 17 days ago - Pushed at: 12 months ago - Stars: 478 - Forks: 72

lignum-vitae/goombay

Python implementation of several sequence alignment algorithms such as Waterman-Smith-Beyer, Gotoh, and Needleman-Wunsch intended to calculate distance, show alignment, and display the underlying matrices.

Language: Python - Size: 270 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 2

anirbanmu/str_metrics

Ruby gem (native extension in Rust) providing implementations of various string metrics

Language: Ruby - Size: 94.7 KB - Last synced at: 15 days ago - Pushed at: about 3 years ago - Stars: 77 - Forks: 2

rapidfuzz/JaroWinkler

Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity

Language: Python - Size: 105 KB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 72 - Forks: 5

jamesturk/jellyfish

🪼 a python library for doing approximate and phonetic matching of strings.

Language: Jupyter Notebook - Size: 3.5 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 2,139 - Forks: 159

adrg/strutil

Go metrics for calculating string similarity and other string utility functions

Language: Go - Size: 112 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 381 - Forks: 25

aallam/string-similarity-kotlin

Kotlin multiplatform library offering various algorithms to measure string similarity and distance

Language: Kotlin - Size: 345 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 2

dbousque/batch_jaro_winkler

Fast batch jaro winkler distance implementation in C99 with Ruby, OCaml and Python bindings.

Language: C - Size: 4.15 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 28 - Forks: 4

life4/textdistance.rs

🦀📏 Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.

Language: Rust - Size: 271 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 295 - Forks: 11

moshetanzer/text-toolbox

A high-performance TypeScript library for string similarity, distance algorithms, and text normalization utilities

Language: TypeScript - Size: 280 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

eulerto/pg_similarity

set of functions and operators for executing similarity queries

Language: C - Size: 164 KB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 375 - Forks: 41

tdebatty/java-string-similarity

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

Language: Java - Size: 729 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 2,715 - Forks: 417

hbollon/go-edlib

📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

Language: Go - Size: 76.2 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 514 - Forks: 26

Hasan-Kilici/kawesearch

This repository contains a fuzzy search library that provides a flexible way to search and match strings using various fuzzy matching algorithms. It supports multiple languages and allows for customization of search options such as algorithms, thresholds, and synonym handling.

Language: JavaScript - Size: 69.3 KB - Last synced at: 25 days ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

vanessaklee/akin

A collection of metrics and phonetic algorithms for fuzzy string matching in Elixir.

Language: Elixir - Size: 8.47 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 36 - Forks: 4

HamedFathi/SimMetricsCore

A text similarity metric library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro, etc) to other metrics, (e.g Soundex, Chapman). This library is compiled based on the .NET standard with a lot of useful extension methods.

Language: C# - Size: 37.1 KB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 10 - Forks: 7

lexmag/simetric

String similarity metrics for Elixir

Language: Elixir - Size: 11.7 KB - Last synced at: 17 days ago - Pushed at: about 4 years ago - Stars: 62 - Forks: 7

bezzad/StringComparison

String Comparision in C#.NET

Language: C# - Size: 521 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 19 - Forks: 8

dexyk/stringosim

String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...

Language: Go - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: almost 8 years ago - Stars: 61 - Forks: 8

scmmishra/pico-search

A pico sized (> 1kb) typo-tolerant word matching library

Language: TypeScript - Size: 28.3 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 2

skyra-project/jaro-winkler

A string similarity utility that uses the Jaro-Winkler algorithm

Language: TypeScript - Size: 3.43 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 5 - Forks: 3

Dynom/TySug

A project around helping to prevent typing typos. TySug (Typo Suggestions) suggests alternative words with respect to keyboard layouts

Language: Go - Size: 440 KB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 3

MrPowers/ceja

PySpark phonetic and string matching algorithms

Language: Python - Size: 32.2 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 39 - Forks: 6

fevieira27/DeezerAnalysisAI-R

An R script that uses AI for data analysis on Deezer playlists, like looking for fuzzy duplicates, rank of genre and artists.

Language: R - Size: 239 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

MrPowers/spark-stringmetric

Spark functions to run popular phonetic and string matching algorithms

Language: Scala - Size: 457 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 60 - Forks: 6

alextanhongpin/stringdist

String metrics function in golang (levenshtein, damerau-levenshtein, jaro, jaro-winkler and additionally bk-tree) for autocorrect

Language: Go - Size: 37.1 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 18 - Forks: 0

astrixgame/jaro-winkler

Jaro Winkler Distance is a algorithm that can be used to compare two strings of sentences and get the score in percent how much is similar.

Language: JavaScript - Size: 6.84 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

joshuaclayton/jaro_winkler

A fast implementation of Jaro-Winkler distance comparing two &str values.

Language: Rust - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

vickumar1981/stringdistance

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..

Language: Scala - Size: 1.27 MB - Last synced at: 10 days ago - Pushed at: about 3 years ago - Stars: 78 - Forks: 14

a-poor/jarowinkler

An implementation of the Jaro-Winkler string similarity algorithm in Go.

Language: Go - Size: 10.7 KB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

rockymadden/stringmetric 📦

:dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).

Language: Scala - Size: 2.07 MB - Last synced at: 12 months ago - Pushed at: almost 8 years ago - Stars: 485 - Forks: 81

VanekPetr/text-similarity-ranking

Algorithm to rank text similarity between set of strings and given inputs

Language: Python - Size: 128 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

tcrouch/edits

Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment

Language: Ruby - Size: 68.4 KB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

tcrouch/edits.cr

Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment

Language: Crystal - Size: 96.7 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

kwunshing123/jaro-winkler-typescript

Jaro-Winkler typescript implementation

Language: TypeScript - Size: 68.4 KB - Last synced at: 22 days ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

OlivierBinette/StringCompare

Efficient String Comparison Functions and Fuzzy String Matching

Language: Python - Size: 3 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 2

vasgat/jSimilarity

jSimilarity is a library that implements various similarity measures

Language: Java - Size: 32.2 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 2

ogus/jaro-winkler

A measure of distance between words with the Jaro-Winkler algorithm

Language: JavaScript - Size: 7.81 KB - Last synced at: 19 days ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

3urobeat/spotify-playlist-similarity-checker

⚖️ Fetches all song titles in your playlist and compares them to find duplicates. Supports multithreading!

Language: JavaScript - Size: 50.8 KB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 0

addisoncox/ffzf

fast fuzzy string matching for Python

Language: Rust - Size: 249 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

millionseleshi/lara-searchtable

Search database using jaro-winkler string similarity against a query

Language: PHP - Size: 102 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

sahibul-nf/aceh-dictionary-api

Aceh language dictionary API created using Golang

Language: Go - Size: 483 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

schiffma/distlib

Distance related functions (Damerau-Levenshtein, Jaro-Winkler , longest common substring & subsequence) implemented as SQLite run-time loadable extension. Any UTF-8 strings are supported.

Language: C - Size: 1.73 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 16 - Forks: 1

d8zerg/jaro_winkler

Jaro-Winkler distance

Language: C++ - Size: 3.91 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

medric49/w_distances

Just some implementations of word distance functions.

Language: Python - Size: 31.3 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

sharannk93/Matching-algorithm

The main purpose of this project was to develop a matching algorithm in python to fuzzy classify people from a customer list as positive or negative based on a messy positive and negative database with a confidence score.

Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

UjjwalAyyangar/go-jellyfish

Go port of the python jellyfish module for approximate and phonetic matching of strings.

Language: Go - Size: 35.2 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

kaiyoo/NLP-Detect-Lexical-Blends

Detect lexical blending using similarity by approximate string matching and word vectors

Language: Python - Size: 1.33 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

bmiller1009/fuzzy-row-matcher

Framework for finding similar rows in a JDBC source

Language: Kotlin - Size: 1.26 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

enginyoyen/strmetric

String metrics libary for Go lang. (e.g. Dice/Sorensen, Hamming, Jaro, Jaro-Winkler, Levenshtein)

Language: Go - Size: 11.7 KB - Last synced at: 5 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Related Keywords
jaro-winkler 54 jaro 20 levenshtein 20 damerau-levenshtein 15 levenshtein-distance 15 edit-distance 10 distance 9 hamming-distance 9 jaro-winkler-distance 8 hamming 8 similarity 7 string-similarity 7 string-distance 7 similarity-measures 6 soundex 6 string-comparison 6 string-matching 6 metaphone 6 golang 6 jaccard 6 fuzzy-matching 6 nlp 5 longest-common-subsequence 5 cosine-similarity 5 algorithm 5 python 4 hacktoberfest 4 text 4 fuzzy-search 4 search 4 string-metrics 4 algorithms 4 jaro-distance 4 string 4 sorensen-dice-distance 3 fuzzy-matching-algorithm 3 cosine 3 jaccard-similarity 3 n-gram 3 longest-common-substring 3 ruby 3 nysiis 3 winkler 3 go 3 fuzzy 2 damerau-levenshtein-distance 2 similarity-score 2 google-like-search 2 comparison 2 unicode 2 dice-coefficient 2 debounce-search 2 customizable 2 jaccard-index 2 cosine-distance 2 edit-distance-algorithm 2 playlist 2 overlap-coefficient 2 dotnet 2 lcs-distance 2 rust 2 shingles 2 suggestion 2 searching 2 strings 2 text-similarity 2 search-algorithm 2 needleman-wunsch 2 smith-waterman 2 double-metaphone 2 elixir 2 optimazation 2 natural-language-processing 2 sorensen-dice 2 language-support 2 cosine-similarity-scores 1 bk-tree 1 spark 1 ngram 1 autocorrect 1 refined-soundex 1 words 1 jaro-similarity 1 match-rating-comparisons 1 porter-stemmer 1 pyspark 1 ai 1 deezer 1 deezer-api 1 deezer-music-api 1 deezer-playlist 1 duplicate-detection 1 music 1 playlist-analysis 1 playlist-management 1 playlists 1 r 1 songs 1 fuzzy-score 1 soundex-algorithm 1