An open API service providing repository metadata for many open source software ecosystems.

Topic: "similarity-metric"

rockymadden/stringmetric 📦

:dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).

Language: Scala - Size: 2.07 MB - Last synced at: 12 months ago - Pushed at: almost 8 years ago - Stars: 485 - Forks: 81

GT-RIPL/L2C

Learning to Cluster. A deep clustering strategy.

Language: Python - Size: 431 KB - Last synced at: 7 days ago - Pushed at: over 5 years ago - Stars: 315 - Forks: 49

usc-isi-i2/rltk

Record Linkage ToolKit (Find and link entities)

Language: Python - Size: 9.59 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 110 - Forks: 23

agext/levenshtein

Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.

Language: Go - Size: 23.4 KB - Last synced at: 6 days ago - Pushed at: almost 5 years ago - Stars: 89 - Forks: 8

victor-iyi/py-image-search-engine

Python Image Search Engine with OpenCV

Language: Python - Size: 3.14 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 43 - Forks: 19

itspawanbhardwaj/spark-fuzzy-matching

Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)

Language: Scala - Size: 92.8 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 24 - Forks: 11

oertl/treeminhash

TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation

Language: C++ - Size: 2.62 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 14 - Forks: 3

JohnnyBravo75/TwinFinder

fuzzy data matching

Language: C# - Size: 3.48 MB - Last synced at: 12 months ago - Pushed at: over 7 years ago - Stars: 13 - Forks: 5

Olliang/Statistical-Similarity-Measurement

A methodology designed to validate the statistical similarity of synthetic data generated by GAN models. The metrics contain Auto-encoder, PCA, t-SNE, KL-divergence, Clustering, and Cosine Similarity.

Language: Jupyter Notebook - Size: 2.93 MB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 0

hechmik/word_mover_distance

Compute Word Mover's Distance using a generic word embedding model

Language: Python - Size: 15.6 KB - Last synced at: 29 days ago - Pushed at: over 4 years ago - Stars: 9 - Forks: 0

safouaneelg/copulasimilarity

Official implementation of the paper: "CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment"

Language: Jupyter Notebook - Size: 54.7 MB - Last synced at: 10 days ago - Pushed at: 8 months ago - Stars: 8 - Forks: 0

vinalbagaria/virtualVidyalaya

Virtual Vidyalaya is a platform wherein continuous assessments, final exams and even mock exams can be conducted using web proctored environment with automatic assessments for mock exams including descriptive long answers. It provides personalized feedback after each exam and progress analysis for a class and a student.

Language: HTML - Size: 74.2 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 0

Paranioar/GSSF

[TIP2024] The code of "GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning"

Size: 5.86 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 5 - Forks: 0

MEGA-GO/MegaGO

Calculate semantic distance for sets of Gene Ontology terms

Language: Python - Size: 148 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 2

srogatch/TextMatching

Find similar text files in a repository and sort by similarity

Language: C++ - Size: 2.59 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 2

resilva87/stringmetric

String metrics and phonetic algorithms for Go

Language: Go - Size: 19.5 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

KMdsy/autowarp

Tensorflow implementation of paper ‘Autowarp: Learning a Warping Distance from Unlabeled Time Series Using Sequence Autoencoder’ (NIPS18)

Language: Python - Size: 155 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 1

markomih/document-classification

Finding the most similar textual documents using Case-Based Reasoning

Language: Python - Size: 19.4 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 3

ltfschoen/ML-Predictions

Machine Learning engine generates predictions given any dataset using regression

Language: Python - Size: 2.51 MB - Last synced at: 4 months ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 2

bhavyanarang/Movie_Recommendation

Project was done as a part of Machine Learning (CSE343) at IIIT Delhi.

Language: Jupyter Notebook - Size: 2.62 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 2

scimas/QuoteAnalysis

Analysis of Mixed Quote Representation in News Sources

Language: Python - Size: 3.37 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 4

habedi/Similarity-Finder-Service 📦

A tutorial for creating a simple RESTful web-service to calculate how similar two strings are

Language: Python - Size: 1.41 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

mwegrzyn/thoughtExperiment

Code for the analyses of our fMRI mind-reading study

Language: Jupyter Notebook - Size: 183 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

olsenlabmit/Polymer-Ensemble-Similarity

Calculating Pairwise Similarity of Polymer Ensembles via Earth Mover’s Distance

Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

Pradnya1208/Book-Recommendation-System--Traditional-approach

This project aims to build a Book recommendation system using methods such as Model, Collaborative, and Content-based filtering.

Language: Jupyter Notebook - Size: 74.5 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

alabbas-ali/Network-Protocols-Similarity

Language: Java - Size: 201 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

ghosind/go-similarity

Similarity or distance metrics for string implemented on Golang, inspired by Sam Chapman's SimMetrics library.

Language: Go - Size: 40 KB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

SiddhantChalke/Movie-Recommendation-System

A Movie Recommendation System that suggests movies based on input by Machine Learning algorithms.

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

malwaredb/sdhash-rs

similarity digest hashing tool -- in Rust!

Language: C - Size: 55.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

KeithTAllen/Katz-Similarity

Java Implementation of Katz Similarity to measure the difference between BFO compliant ontologies.

Language: Java - Size: 1.65 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

rorevello/Mirrowl

Repository of original ontologies and mirror ontologies (synthetic ontologies that attempt to replicate the originals)

Size: 6.09 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Brandomon/CSC365-YelpDatasetSimilarBusinesses

Assignment 1 Java Code from my CSC365 Class at SUNY Oswego.

Language: Java - Size: 217 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

NilufaYeasmin/Similarity-Search-Metric-Spaces

In this project, I've developed an application of Similarity Search in Metric Spaces by using Inverted Files. | Nilufa Yeasmin | https://www.linkedin.com/in/nilufayeasmin/

Language: Python - Size: 11.7 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

saibot94/sim-dictionary

Similarity based dictionary featuring a map of the world highlighting translations

Language: Python - Size: 98.6 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

elypaolazz/DataMining-Project

Data Mining course project

Language: Jupyter Notebook - Size: 2.25 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

riya-ingale/InternshipRecommendation

Internship Recommendation System

Language: HTML - Size: 339 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Juancard/parallel-and-distributed-IR

Information Retrieval using parallel algorithms on a distributed environment

Language: Java - Size: 990 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

sunnysai12345/Text-Similarity-Metrics

We will learn about different types of text similarity metrics in use and code them in python.

Language: Jupyter Notebook - Size: 20.6 MB - Last synced at: 22 days ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

Related Topics
similarity 6 python 6 nlp 4 levenshtein 4 cosine-similarity 3 machine-learning 3 string-similarity 3 metrics 2 deep-learning 2 python3 2 similarity-measures 2 soundex 2 ontology 2 similarity-search 2 java 2 indexing 2 golang 2 clustering 2 jaccard 2 fuzzy-matching 2 collaborative-filtering 2 recommender-system 2 recommendation-system 2 data-science 2 root-mean-squared-error-metric 1 aws-lambda 1 binary-classification 1 continous-integration 1 correlation 1 unittest 1 cross-validation 1 hyperparameter-optimization 1 multivariate-models 1 k-means-clustering 1 scikit-learn 1 knn-regression 1 prediction-algorithm 1 scipy 1 linear-regression 1 logistic-regression 1 matplotlib 1 multi-classify-with-sklearn 1 pandas-dataframes 1 case-based-reasoning 1 distance 1 hamming 1 jaro 1 jaro-winkler 1 metaphone 1 n-gram 1 nysiis 1 overlap 1 phonetic-algorithms 1 sorensen 1 algorithm 1 apache-spark 1 scala 1 gensim 1 glove 1 word-embeddings 1 extract 1 network-protocol 1 protocol 1 deduplication 1 entity-resolution 1 linkage 1 record-linkage 1 issm 1 math 1 probability-theory 1 quality-assessment 1 ssim 1 bioinformatics 1 gene-ontology-terms 1 methodology 1 synthetic-data 1 hash-algorithm 1 jaccard-coefficient 1 jaccard-distance 1 jaccard-index 1 jaccard-similarity 1 jaccard-similarity-estimation 1 locality-sensitive 1 locality-sensitive-hashing 1 lsh-algorithm 1 minhash 1 minwise-hashing 1 minwise-hashing-algorithm 1 sketching 1 sketching-algorithm 1 weighted-sets 1 bm25 1 distamce-metric 1 go 1 go-library 1 go-package 1 strings 1 artificial-intelligence 1 artificial-neural-networks 1 deep-neural-networks 1