Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: similarity-measures

cissagatto/rogers

This code generate partitions for a multilabel dataset using the Rogers-Tanimoto similarity measure. We use HCLUST with 6 linkage metrics to generate several partitions. You may build the partition with the highest coefficient. This code also provide an analysis about the partitioning.

Language: R - Size: 314 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0

cissagatto/jaccard

This code generate partitions for a multilabel dataset using the Jaccard Index similarity measure. We use HCLUST with 6 linkage metrics to generate several partitions. You may build the partition with the highest coefficient. This code also provide an analysis about the partitioning.

Language: R - Size: 338 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

bhatt-j/Books_Recommender_SL

Self Learning Project based on Books Recommendation System

Size: 23.2 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

yg211/explainable-metrics

An explainable sentence similarity measurement

Language: Jupyter Notebook - Size: 1.47 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 9 - Forks: 1

dsevero/Proof-of-Novelty

A distributed consensus mechanism for securing content novelty: Proof of Novelty.

Language: TeX - Size: 4.22 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 13 - Forks: 1

Silicon-Orchard/fake_product_review_check

Detect review manipulation by leveraging reviewer historical stylometrics in Amazon, Yelp, Facebook and Google reviews

Language: Python - Size: 5.13 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 4 - Forks: 1

matchms/matchms-backup Fork of iomega/Spec2Vec_prototyping 📦

Python library for fuzzy comparison of mass spectrum data and other Python objects

Language: Python - Size: 23 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 1

sahitilucky/wikiaug

Suggesting tangentially related articles(See also links) for Wikipedia articles

Language: HTML - Size: 23.5 MB - Last synced: 6 months ago - Pushed: about 7 years ago - Stars: 0 - Forks: 0

oertl/treeminhash

TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation

Language: C++ - Size: 2.62 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 12 - Forks: 3

Howuhh/link_pred_spark

similarity between graph nodes based on local information with PySpark

Language: Python - Size: 22.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 9 - Forks: 1

AneeshBose/Semantic-Query-Search-Using-Co-Occurrence-Clustering-in-Word-Graphs

Scikit-learn implementation of co-occurrence word graph based semantic query search using machine learning and vector space similarity measures.

Language: Python - Size: 4.66 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

concept-inversion/Recommendation-System

Creating a simple recommendation system on the Basis of similarity

Language: Jupyter Notebook - Size: 544 KB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 8 - Forks: 7

jim-spyropoulos/Trajectory-Analysis-and-Classification-in-Python-Pandas-and-Scikit-Learn

Formed trajectories of sets of points.Experimented on finding similarities between trajectories based on DTW (Dynamic Time Warping) and LCSS (Longest Common SubSequence) algorithms.Modeled trajectories as strings based on a Grid representation.Benchmarked KNN, Random Forest, Logistic Regression classification algorithms to classify efficiently trajectories.

Language: Python - Size: 23.8 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 41 - Forks: 16

Fatemeh-ameri/Data-science-algorithms

Algorithms for Data Science

Language: Python - Size: 8.79 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

janschneida/taxodist

Python library for similarity calculations in taxonomic hierarchies and the underlying concepts.

Language: HTML - Size: 6.51 MB - Last synced: 9 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

iharsuvorau/similarity-metrics

Similarity metrics for event logs

Language: Rust - Size: 16.6 KB - Last synced: 12 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

alphaWizard/link-prediction

Collection of various unsupervised and supervised link prediction approaches

Language: Jupyter Notebook - Size: 30.7 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 4 - Forks: 2

Charbel199/rna-sequence-differencing

Intelligent Data Processing and Applications course project where the objective is to implement a RNA sequences differencing (edit distance) and patching tool, which can be applied on different kinds of RNA sequence formats (such as FASTQ, EMBL, FASTA, GCG, etc.).

Language: Python - Size: 193 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

babylonhealth/fuzzymax

Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.

Language: Python - Size: 32.5 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 43 - Forks: 3

sakusakueva/Similarity_measure

The similarity calculation of the template matching method is summarized.

Language: C++ - Size: 5.48 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 1

alexnguyen9/recipe-matcher

Find similar recipes based off scraped recipes using similarity functions

Language: Python - Size: 4.91 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

Salma-AZIZ/NLP_First_Steps_Python

Natural Language Processing First Steps with Python

Language: Jupyter Notebook - Size: 60.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

raj1603chdry/CSE3018-Content-Based-Image-and-Video-Retrieval-Lab

Repository containing all the codes created for the lab sessions of CSE3018 Content Based Image and Video Retrieval at VIT University Chennai Campus

Language: MATLAB - Size: 64.5 MB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 16 - Forks: 8

zoobereq/semantic_similarities

A tool to assess semantic similarity between English words

Language: Python - Size: 3.91 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

Yuan-fang/ISCtoolbox

inter-subject similarity/correlation

Language: Python - Size: 40 KB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 2 - Forks: 0

Fatma-Eltelwany/NLP-Finding-Similarity-between-movies

Language: Jupyter Notebook - Size: 223 KB - Last synced: almost 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

kargaranamir/Alarm-Similarity

Analytical Derivation and Comparison of Alarm Similarity Measures Paper Code (IFAC Symposium, ADCHEM Conference 2021)

Language: Jupyter Notebook - Size: 109 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 4 - Forks: 0

FiryanulRizky/ProjectTemuKembaliInformasi

Projek Akhir Semester Sistem Temu Kembali Informasi

Language: Python - Size: 2.76 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

sarahaoui/CloudBroker

CloudBroker is a Web Application for providers of cloud services and also for consumers of those services

Language: Java - Size: 67.5 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

TsukiZombina/DependencyMiner

Modified Implementation of two efficient algorithms for discovering functional dependency: TANE and DFD for using similarity metrics.

Language: C++ - Size: 19.2 MB - Last synced: about 1 month ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

cod3licious/nlputils

Library for analysing text documents: tf-idf transformation, computing similarities, visualisation, etc.

Language: Python - Size: 1.75 MB - Last synced: 12 days ago - Pushed: about 4 years ago - Stars: 11 - Forks: 6

vasgat/jSimilarity

jSimilarity is a library that implements various similarity measures

Language: Java - Size: 32.2 KB - Last synced: 11 months ago - Pushed: about 5 years ago - Stars: 6 - Forks: 2

babylonhealth/corrsim

Code for the papers: Correlation Coefficients and Semantic Textual Similarity, NAACL-HLT 2019 & Correlations between Word Vector Sets, EMNLP-IJCNLP 2019.

Language: Python - Size: 32.5 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 27 - Forks: 5

RavanSA/customer-segmentation-using-mahalonobis-distance

Customer Segmentation using mahalonobis and minkowsi distance

Language: Jupyter Notebook - Size: 346 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

renett-t/data-mining-course

ITIS Data Mining course assignments

Language: Jupyter Notebook - Size: 58.8 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

zhongyuchen/similarity-join

Similarity join extension for PostgreSQL

Language: C - Size: 85.6 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 3 - Forks: 0

morchalabi/COMPARE-suite

Novel ultrafast suite for high-throughput & high-content multiparameter screening as in drug discovery. It has unique modules for QC, bias correction, similarity measurement, clustering and visualization. It can process hundreds of samples with many markers in a few hours not days & circumvents bath effect. It couples with any plate reader.

Language: R - Size: 32.6 MB - Last synced: 4 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0

DrejcPesjak/DPhate-double-paraphrasing-hate-speech

Bachelor's thesis on removing hate from online comments using paraphrasing: algorithm DPhate

Language: Python - Size: 37.1 MB - Last synced: 11 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

nbro/aal

A Python package to convert a sequence of points in the XY space to another sequence of points in the Angle-Arc-Length (AAL) space.

Language: Python - Size: 4.88 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

tcrouch/edits

Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment

Language: Ruby - Size: 68.4 KB - Last synced: 25 days ago - Pushed: about 2 months ago - Stars: 2 - Forks: 1

ralfaouad/DocumentDifferencing

Intelligent Data Processing and Application Project

Language: Python - Size: 218 KB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 2

zhuye88/anne-dbscan-demo Fork of cswords/anne-dbscan-demo

Demo of using aNNE similarity for DBSCAN.

Language: MATLAB - Size: 33.2 KB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 3 - Forks: 3

shsarv/UNPLUG-THE-PLAYER

This is the Repository for the Mini Project done using the flask and python libraries, required as a part of the course curriculum.

Language: Jupyter Notebook - Size: 22.2 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 5 - Forks: 3

anmolbansal7/Lost-and-Found

Content-Based Image Retrieval

Language: CSS - Size: 28 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

gipplab/MathMLSim

Similarity calculation module for MathML formulae

Language: Java - Size: 133 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 1

giangntgg/tech_review Fork of BillyZhaohengLi/tech_review

Survey on Similarity Measures for Collaborative Filtering

Size: 211 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

rgayler/fa_sim_cal

Entity resolution research project looking at what can be enabled by construing it as a problem of calibration from similarity to log-odds of a true match.

Language: R - Size: 10.7 MB - Last synced: over 1 year ago - Pushed: almost 3 years ago - Stars: 3 - Forks: 1

ikhlo/SpotiBot

A Discord chatbot for play songs, get informations and find similar tracks !

Language: JavaScript - Size: 8.82 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

SychO9/license-detector

đź“ś A license information detector

Language: PHP - Size: 144 KB - Last synced: about 2 months ago - Pushed: almost 5 years ago - Stars: 2 - Forks: 2

LogicJake/DM_Implement

data preprocessing and similarity methods

Language: Python - Size: 350 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0

ananyaroy1011/Fake-News-Classification

Given the title of a fake news article A and the title of a coming news article B, program classifies B into agree, disagree, and unrelated.

Language: HTML - Size: 410 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0

asier-gutierrez/nn-similarity

A Persistent Homology based Neural Network similarity metric.

Language: Python - Size: 213 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0

Priyansh2/Abstract-Based-Sentiment-Detection

This repository contains code for aspect-based sentiment analysis for the "restaurant" domain. Ref. paper:

Language: Jupyter Notebook - Size: 120 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

drseb/phenopposites

Project to generate phenotype opposite_of relationships and investigate the effect on ontology-based algorithms

Size: 19.5 KB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 3 - Forks: 1

MaryemSamet/Parkinson-diagnostic

Parkinson diagnostic with supervised and unsupervised machine learning

Language: Jupyter Notebook - Size: 1.36 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

sayarghoshroy/Summarization

A lightweight Extractive Summarization Formulation for the CNN Dataset

Language: Jupyter Notebook - Size: 10.7 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 3 - Forks: 2

nikdon/SimilarityMeasure

TF-IDF and similarity measure in C#

Language: C# - Size: 527 KB - Last synced: 9 months ago - Pushed: about 10 years ago - Stars: 3 - Forks: 2

TrinhQuocNguyen/ImageSimilarityMeasures

Image Similarity Measures VS Project c++. Source code is borrowed heavily from https://github.com/Rolinh/VQMT

Language: C++ - Size: 146 MB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 3 - Forks: 0

shamspias/predict-disease-based-on-symptoms

A simple model to predict disease

Language: Python - Size: 233 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

yashgarg1232/MMDTW

Metadata-enriched Dynamic Time Warping for Multi-Variate Time Series

Language: Python - Size: 327 KB - Last synced: 17 days ago - Pushed: over 5 years ago - Stars: 3 - Forks: 2

Shubhammawa/Recommender-Systems

Recommendation systems

Language: Jupyter Notebook - Size: 81.1 KB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

mwegrzyn/fmri_python_SS2019

Kurs zu fMRT-Datenanalyse mit Python (Sommersemester 2019). Eigenständige Erstellung von MRT-Viewern und DIY-Analyse von fMRT-Zeitverläufen und Aktivierungskarten mit Python.

Language: Jupyter Notebook - Size: 27.6 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 2 - Forks: 0

devonsparks/room-similarity

Reprogramming architectural spaces by size and shape

Language: HTML - Size: 161 KB - Last synced: 4 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

Shashwat4K/Clustering-Documents

Cluster documents based on various similarity measures. The project is based on 'Bag of Words' data from UCI Machine Learning reporitory

Language: Jupyter Notebook - Size: 3.92 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 1

ShashwatNigam99/CIFAR-10_classification

Multi-class classification : Representation and similarity measures using CIFAR-10

Language: Jupyter Notebook - Size: 158 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

swelcker/cmd.csp.similarity

A library implementing different string similarity and distance measures for ease of use. A dozen of algorithms (including Levenshtein edit distance and sibblings, Jaro-Winkler, Longest Common Subsequence, cosine similarity etc.) are currently implemented.

Language: Java - Size: 29.3 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

INKWWW/NLP

Gensim\Word2vec\NLP\Similarity :zap::zap:

Language: Python - Size: 1.06 MB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 2 - Forks: 0

gabrielagustin/Comparing-images

Structural similarity index (SSIM) and mean squared error (MSE)

Language: Python - Size: 67.4 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

OscarHChung/Similarities-between-files

This program will run similarities between two uploaded files. There is a choice of similarities in lines, numbered substrings, and sentences.

Language: HTML - Size: 5.86 KB - Last synced: 11 months ago - Pushed: almost 5 years ago - Stars: 0 - Forks: 0

Coldsp33d/UFO-Awesome

Demonstrating the power of Pandas, Tika, and D3.js through exploratory analysis on a UFO sightings dataset

Language: Python - Size: 766 MB - Last synced: 8 months ago - Pushed: about 6 years ago - Stars: 2 - Forks: 1

cpcdoy/WMD

Word Mover’s Distance implementations

Language: Jupyter Notebook - Size: 431 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

abdsamadf/similarities

Measures the edit distance between two strings

Language: Python - Size: 8.79 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

paathurnax/simitrieve

This is a repository to develop my undergraduated thesis

Language: Java - Size: 185 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

spacelis/hrnn4sim

A Hierarchical RNN model for estimating word sequence similarity and category

Language: Python - Size: 26.4 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

shkr/routesimilarity

Language: Python - Size: 6.84 KB - Last synced: 16 days ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

vspinu/simdist

High performance similarity and distance metrics for sparse representations

Language: R - Size: 984 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 3 - Forks: 0

IldikoPilan/swell-norm

Language: Python - Size: 26.6 MB - Last synced: over 1 year ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0

orgh0/Word_Embeddings

Basic scripts for understanding word2vec

Language: Python - Size: 24.4 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0

sherinann/collaborative-filtering-recommender

A collaborative filtering based recommender for a movie database. User based collaborative filtering is used.

Language: Python - Size: 8.79 KB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0

asrul10/recommendation-answers-wordnet

Similarity measure based on WordNet and Rapid Automatic Keyword Extraction (RAKE)

Language: Python - Size: 11.7 KB - Last synced: over 1 year ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0

Related Keywords
similarity-measures 180 python 31 machine-learning 26 nlp 20 similarity 17 distance-measures 12 cosine-similarity 10 natural-language-processing 9 word2vec 9 levenshtein-distance 9 recommender-system 8 fuzzy-matching 8 data-science 7 recommendation-system 6 damerau-levenshtein 6 algorithms 6 similarity-score 6 semantic-similarity 6 jaro-winkler 6 collaborative-filtering 5 data-mining 5 word-embeddings 5 information-retrieval 5 semantic 5 image-processing 5 python3 5 classification 5 similarity-search 5 r 5 dtw 5 numpy 5 distance 5 tf-idf 4 semantic-similarity-measures 4 levenshtein 4 string-distance 4 pandas 4 scikit-learn 4 wordnet 4 flask 4 opencv 4 deep-learning 4 edit-distance 3 jaro 3 nltk-python 3 cosine 3 clustering 3 jaccard-similarity 3 semantics 3 fuzzy-search 3 sequence-analysis 3 dynamic-time-warping 3 synthetic-data 3 text-processing 3 java 3 sentiment-analysis 3 time-series 3 comparison 3 pytorch 3 graph-algorithms 3 nltk 3 text-analysis 3 chatbot 2 sketching 2 jaro-winkler-distance 2 matplotlib 2 streamlit-webapp 2 distance-metrics 2 scientific-research 2 multilabel-classification 2 drug-target-interactions 2 drug-drug-interaction 2 feature-extraction 2 drug-discovery 2 link-prediction 2 isolation-kernel 2 distance-measure 2 image-similarity 2 string-comparison 2 mathematics 2 ontologies 2 neural-networks 2 ml 2 text 2 fuzzywuzzy 2 semantic-web 2 similarity-detection 2 finance 2 spotify-api 2 tfidf-vectorizer 2 statistics 2 synthetic-dataset-generation 2 similarity-metric 2 embeddings 2 content-based-recommendation 2 content-based-image-retrieval 2 machine-learning-algorithms 2 bag-of-words 2 streamlit 2 string-similarity 2