An open API service providing repository metadata for many open source software ecosystems.

Topic: "jaro-winkler-distance"

tonytonyjan/jaro_winkler

Ruby & C implementation of Jaro-Winkler distance algorithm which supports UTF-8 string.

Language: Ruby - Size: 202 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 202 - Forks: 33

hedzr/cmdr

POSIX-compliant command-line UI (CLI) parser and Hierarchical-configuration operations

Language: Go - Size: 3.08 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 140 - Forks: 10

vickumar1981/stringdistance

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..

Language: Scala - Size: 1.27 MB - Last synced at: 1 day ago - Pushed at: over 3 years ago - Stars: 80 - Forks: 14

mrkkrp/text-metrics

Calculate various string metrics efficiently in Haskell

Language: Haskell - Size: 122 KB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 44 - Forks: 4

hedzr/cmdr-cxx

cmdr cxx version, a C++17/20 header-only command-line parser with hierarchical config data manager here

Language: C++ - Size: 1.61 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 17 - Forks: 2

schiffma/distlib

Distance related functions (Damerau-Levenshtein, Jaro-Winkler , longest common substring & subsequence) implemented as SQLite run-time loadable extension. Any UTF-8 strings are supported.

Language: C - Size: 1.73 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 16 - Forks: 1

sumn2u/string-comparisons

A collection of string comparisons algorithms

Language: JavaScript - Size: 700 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 5

IlluminatiFish/ByteCog

A way to analyse how malware and/or goodware samples vary from each other using Shannon Entropy, Hausdorff Distance and Jaro-Winkler Distance

Language: Python - Size: 200 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 11 - Forks: 5

HamedFathi/SimMetricsCore

A text similarity metric library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro, etc) to other metrics, (e.g Soundex, Chapman). This library is compiled based on the .NET standard with a lot of useful extension methods.

Language: C# - Size: 37.1 KB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 10 - Forks: 7

nilaksh/qwerty-jaro-winkler

Qwerty-Jaro–Winkler distance is a tweak on top of jaro winkler edit distance where we try to consider distance between keys in keyboard while calculating number of matches.

Language: Java - Size: 4.88 KB - Last synced at: 16 days ago - Pushed at: almost 8 years ago - Stars: 6 - Forks: 4

hansalemaos/fuzzmatch

Fast Batch String Matching in Python (Levenshtein, Jaro-Winkler, Hamming) with Zero Cache Misses - made for Python, written in C++

Language: C++ - Size: 7.49 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

hedzr/Cmdr.Core

Useful POSIX command line arguments parser for .Net. Hierarchical-configuration Store for app.

Language: C# - Size: 658 KB - Last synced at: 10 days ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 0

GatoY/Spelling-Correction-Methods

Comparison among four spelling correction methods. n-gram, Levenshtein, Jaro, Jaro_winkler

Language: Python - Size: 1.08 MB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 2

OlegBash599/ZC8A_001

ABAP SimilaritySearch with HANA and Oracle

Language: ABAP - Size: 188 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 1

lucasmenendez/gobstract

Simple automatic abstract text generator

Language: Go - Size: 75.2 KB - Last synced at: 5 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

fredeil/FuzzyMatcher 📦

Fuzzy-Matching algorithm using Jaro-Winkler distance for measuring similarities in strings

Language: F# - Size: 11.7 KB - Last synced at: 11 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

sanket143/Jaro-Winkler

Jaro-Winkler Distance in C++

Language: C++ - Size: 8.79 KB - Last synced at: 13 days ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

arpitj938/string-similarity

Language: Python - Size: 13.7 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

taKana671/PyTextDistance

Cython extension modules for Levenshtein-distance, Jaro-Winkler-distance, Damerau-Levenshtein-distance, Hamming-distance

Language: Python - Size: 1.05 MB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

nikhitmago/entity-linking

Matching records by linking entities using string and data matching

Language: Python - Size: 44.9 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

sharannk93/Matching-algorithm

The main purpose of this project was to develop a matching algorithm in python to fuzzy classify people from a customer list as positive or negative based on a messy positive and negative database with a confidence score.

Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Khojasteh/assorted-utils

A set of assorted helper functions, extensions and classes in .NET Standard

Language: C# - Size: 468 KB - Last synced at: 3 days ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

fevieira27/DeezerAnalysisAI-R

An R script that uses AI for data analysis on Deezer playlists, like looking for fuzzy duplicates, rank of genre and artists.

Language: R - Size: 239 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

ufukhalis/k-string-sim

String similarity library for Kotlin

Language: Kotlin - Size: 68.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jo-so/word-dist

Computes the pairwise similarity between multiple words

Language: Rust - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

astrixgame/jaro-winkler

Jaro Winkler Distance is a algorithm that can be used to compare two strings of sentences and get the score in percent how much is similar.

Language: JavaScript - Size: 6.84 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kaiyoo/NLP-Detect-Lexical-Blends

Detect lexical blending using similarity by approximate string matching and word vectors

Language: Python - Size: 1.33 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

pedromagician/String-Similarity-Algorithms

String Similarity Algorithms

Language: BASIC - Size: 28.3 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

mrkkrp/string_metrics

Collection of string metric algorithms in OCaml

Language: OCaml - Size: 9.77 KB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

Related Topics
levenshtein-distance 15 jaro-distance 10 jaro-winkler 8 hamming-distance 7 nlp 4 damerau-levenshtein-distance 4 jaccard-similarity 4 python 4 cli 3 cmdr 3 argument-parsing 3 argument-parser 3 fuzzy-matching 3 command-line 3 string-similarity 3 command-line-interface 3 command-line-parser 3 levenshtein 3 damerau-levenshtein 3 cosine-similarity 3 cython 2 posix-compliant 2 hierarchical-data 2 machine-learning 2 csharp 2 commandline 2 commandlineparser 2 dotnet 2 sorensen-dice-distance 2 posix 2 posix-compatible 2 golang 2 jaro 2 longest-common-subsequence 2 edit-distance 2 similarity-measures 2 edit-distance-algorithm 2 string-metrics 2 fuzzywuzzy 1 fast 1 cpp 1 measure 1 fuzzy-matching-algorithm 1 fsharp 1 covid-19 1 corona 1 songs 1 r 1 playlists 1 playlist-management 1 playlist-analysis 1 playlist 1 music 1 fuzzy-search 1 metric 1 metrics 1 duplicate-detection 1 deezer-playlist 1 deezer-music-api 1 similarity 1 deezer-api 1 deezer 1 ai 1 similarity-score 1 hierarchy-configurations 1 getopt 1 fossa-status 1 commandline-interface 1 commandline-arguments 1 cmdr-addons 1 soundex-algorithm 1 soundex 1 ngram 1 jaccard 1 hacktoberfest 1 subcommand 1 dice-coefficient 1 cosine-similarity-scores 1 ocaml 1 posix-command 1 nuget 1 algorithm 1 hierarchical-configuration 1 ruby 1 haskell 1 fuzzy 1 cmdline-parser 1 cmdline 1 arguments-parser 1 string 1 rapidfuzz 1 matching 1 string-distance 1 string-comparison 1 smith-waterman 1 javascript 1 malware-detection 1 distance 1 algorithms 1 summarization 1