Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: similarity-measures
cissagatto/rogers
This code generate partitions for a multilabel dataset using the Rogers-Tanimoto similarity measure. We use HCLUST with 6 linkage metrics to generate several partitions. You may build the partition with the highest coefficient. This code also provide an analysis about the partitioning.
Language: R - Size: 314 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0
cissagatto/jaccard
This code generate partitions for a multilabel dataset using the Jaccard Index similarity measure. We use HCLUST with 6 linkage metrics to generate several partitions. You may build the partition with the highest coefficient. This code also provide an analysis about the partitioning.
Language: R - Size: 338 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
bhatt-j/Books_Recommender_SL
Self Learning Project based on Books Recommendation System
Size: 23.2 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
yg211/explainable-metrics
An explainable sentence similarity measurement
Language: Jupyter Notebook - Size: 1.47 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 9 - Forks: 1
dsevero/Proof-of-Novelty
A distributed consensus mechanism for securing content novelty: Proof of Novelty.
Language: TeX - Size: 4.22 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 13 - Forks: 1
Silicon-Orchard/fake_product_review_check
Detect review manipulation by leveraging reviewer historical stylometrics in Amazon, Yelp, Facebook and Google reviews
Language: Python - Size: 5.13 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 4 - Forks: 1
matchms/matchms-backup Fork of iomega/Spec2Vec_prototyping 📦
Python library for fuzzy comparison of mass spectrum data and other Python objects
Language: Python - Size: 23 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 1
sahitilucky/wikiaug
Suggesting tangentially related articles(See also links) for Wikipedia articles
Language: HTML - Size: 23.5 MB - Last synced: 6 months ago - Pushed: about 7 years ago - Stars: 0 - Forks: 0
oertl/treeminhash
TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation
Language: C++ - Size: 2.62 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 12 - Forks: 3
Howuhh/link_pred_spark
similarity between graph nodes based on local information with PySpark
Language: Python - Size: 22.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 9 - Forks: 1
AneeshBose/Semantic-Query-Search-Using-Co-Occurrence-Clustering-in-Word-Graphs
Scikit-learn implementation of co-occurrence word graph based semantic query search using machine learning and vector space similarity measures.
Language: Python - Size: 4.66 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
concept-inversion/Recommendation-System
Creating a simple recommendation system on the Basis of similarity
Language: Jupyter Notebook - Size: 544 KB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 8 - Forks: 7
jim-spyropoulos/Trajectory-Analysis-and-Classification-in-Python-Pandas-and-Scikit-Learn
Formed trajectories of sets of points.Experimented on finding similarities between trajectories based on DTW (Dynamic Time Warping) and LCSS (Longest Common SubSequence) algorithms.Modeled trajectories as strings based on a Grid representation.Benchmarked KNN, Random Forest, Logistic Regression classification algorithms to classify efficiently trajectories.
Language: Python - Size: 23.8 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 41 - Forks: 16
Fatemeh-ameri/Data-science-algorithms
Algorithms for Data Science
Language: Python - Size: 8.79 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
janschneida/taxodist
Python library for similarity calculations in taxonomic hierarchies and the underlying concepts.
Language: HTML - Size: 6.51 MB - Last synced: 9 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0
iharsuvorau/similarity-metrics
Similarity metrics for event logs
Language: Rust - Size: 16.6 KB - Last synced: 12 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
alphaWizard/link-prediction
Collection of various unsupervised and supervised link prediction approaches
Language: Jupyter Notebook - Size: 30.7 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 4 - Forks: 2
Charbel199/rna-sequence-differencing
Intelligent Data Processing and Applications course project where the objective is to implement a RNA sequences differencing (edit distance) and patching tool, which can be applied on different kinds of RNA sequence formats (such as FASTQ, EMBL, FASTA, GCG, etc.).
Language: Python - Size: 193 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0
babylonhealth/fuzzymax
Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.
Language: Python - Size: 32.5 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 43 - Forks: 3
sakusakueva/Similarity_measure
The similarity calculation of the template matching method is summarized.
Language: C++ - Size: 5.48 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 1
alexnguyen9/recipe-matcher
Find similar recipes based off scraped recipes using similarity functions
Language: Python - Size: 4.91 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
Salma-AZIZ/NLP_First_Steps_Python
Natural Language Processing First Steps with Python
Language: Jupyter Notebook - Size: 60.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
raj1603chdry/CSE3018-Content-Based-Image-and-Video-Retrieval-Lab
Repository containing all the codes created for the lab sessions of CSE3018 Content Based Image and Video Retrieval at VIT University Chennai Campus
Language: MATLAB - Size: 64.5 MB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 16 - Forks: 8
zoobereq/semantic_similarities
A tool to assess semantic similarity between English words
Language: Python - Size: 3.91 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
Yuan-fang/ISCtoolbox
inter-subject similarity/correlation
Language: Python - Size: 40 KB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 2 - Forks: 0
Fatma-Eltelwany/NLP-Finding-Similarity-between-movies
Language: Jupyter Notebook - Size: 223 KB - Last synced: almost 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
kargaranamir/Alarm-Similarity
Analytical Derivation and Comparison of Alarm Similarity Measures Paper Code (IFAC Symposium, ADCHEM Conference 2021)
Language: Jupyter Notebook - Size: 109 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 4 - Forks: 0
FiryanulRizky/ProjectTemuKembaliInformasi
Projek Akhir Semester Sistem Temu Kembali Informasi
Language: Python - Size: 2.76 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
sarahaoui/CloudBroker
CloudBroker is a Web Application for providers of cloud services and also for consumers of those services
Language: Java - Size: 67.5 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
TsukiZombina/DependencyMiner
Modified Implementation of two efficient algorithms for discovering functional dependency: TANE and DFD for using similarity metrics.
Language: C++ - Size: 19.2 MB - Last synced: about 1 month ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0
cod3licious/nlputils
Library for analysing text documents: tf-idf transformation, computing similarities, visualisation, etc.
Language: Python - Size: 1.75 MB - Last synced: 12 days ago - Pushed: about 4 years ago - Stars: 11 - Forks: 6
vasgat/jSimilarity
jSimilarity is a library that implements various similarity measures
Language: Java - Size: 32.2 KB - Last synced: 11 months ago - Pushed: about 5 years ago - Stars: 6 - Forks: 2
babylonhealth/corrsim
Code for the papers: Correlation Coefficients and Semantic Textual Similarity, NAACL-HLT 2019 & Correlations between Word Vector Sets, EMNLP-IJCNLP 2019.
Language: Python - Size: 32.5 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 27 - Forks: 5
RavanSA/customer-segmentation-using-mahalonobis-distance
Customer Segmentation using mahalonobis and minkowsi distance
Language: Jupyter Notebook - Size: 346 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
renett-t/data-mining-course
ITIS Data Mining course assignments
Language: Jupyter Notebook - Size: 58.8 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
zhongyuchen/similarity-join
Similarity join extension for PostgreSQL
Language: C - Size: 85.6 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 3 - Forks: 0
morchalabi/COMPARE-suite
Novel ultrafast suite for high-throughput & high-content multiparameter screening as in drug discovery. It has unique modules for QC, bias correction, similarity measurement, clustering and visualization. It can process hundreds of samples with many markers in a few hours not days & circumvents bath effect. It couples with any plate reader.
Language: R - Size: 32.6 MB - Last synced: 4 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0
DrejcPesjak/DPhate-double-paraphrasing-hate-speech
Bachelor's thesis on removing hate from online comments using paraphrasing: algorithm DPhate
Language: Python - Size: 37.1 MB - Last synced: 11 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
nbro/aal
A Python package to convert a sequence of points in the XY space to another sequence of points in the Angle-Arc-Length (AAL) space.
Language: Python - Size: 4.88 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0
tcrouch/edits
Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment
Language: Ruby - Size: 68.4 KB - Last synced: 25 days ago - Pushed: about 2 months ago - Stars: 2 - Forks: 1
ralfaouad/DocumentDifferencing
Intelligent Data Processing and Application Project
Language: Python - Size: 218 KB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 2
zhuye88/anne-dbscan-demo Fork of cswords/anne-dbscan-demo
Demo of using aNNE similarity for DBSCAN.
Language: MATLAB - Size: 33.2 KB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 3 - Forks: 3
shsarv/UNPLUG-THE-PLAYER
This is the Repository for the Mini Project done using the flask and python libraries, required as a part of the course curriculum.
Language: Jupyter Notebook - Size: 22.2 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 5 - Forks: 3
anmolbansal7/Lost-and-Found
Content-Based Image Retrieval
Language: CSS - Size: 28 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
gipplab/MathMLSim
Similarity calculation module for MathML formulae
Language: Java - Size: 133 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 1
giangntgg/tech_review Fork of BillyZhaohengLi/tech_review
Survey on Similarity Measures for Collaborative Filtering
Size: 211 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
rgayler/fa_sim_cal
Entity resolution research project looking at what can be enabled by construing it as a problem of calibration from similarity to log-odds of a true match.
Language: R - Size: 10.7 MB - Last synced: over 1 year ago - Pushed: almost 3 years ago - Stars: 3 - Forks: 1
ikhlo/SpotiBot
A Discord chatbot for play songs, get informations and find similar tracks !
Language: JavaScript - Size: 8.82 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
SychO9/license-detector
đź“ś A license information detector
Language: PHP - Size: 144 KB - Last synced: about 2 months ago - Pushed: almost 5 years ago - Stars: 2 - Forks: 2
LogicJake/DM_Implement
data preprocessing and similarity methods
Language: Python - Size: 350 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0
ananyaroy1011/Fake-News-Classification
Given the title of a fake news article A and the title of a coming news article B, program classifies B into agree, disagree, and unrelated.
Language: HTML - Size: 410 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0
asier-gutierrez/nn-similarity
A Persistent Homology based Neural Network similarity metric.
Language: Python - Size: 213 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0
Priyansh2/Abstract-Based-Sentiment-Detection
This repository contains code for aspect-based sentiment analysis for the "restaurant" domain. Ref. paper:
Language: Jupyter Notebook - Size: 120 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
drseb/phenopposites
Project to generate phenotype opposite_of relationships and investigate the effect on ontology-based algorithms
Size: 19.5 KB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 3 - Forks: 1
MaryemSamet/Parkinson-diagnostic
Parkinson diagnostic with supervised and unsupervised machine learning
Language: Jupyter Notebook - Size: 1.36 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0
sayarghoshroy/Summarization
A lightweight Extractive Summarization Formulation for the CNN Dataset
Language: Jupyter Notebook - Size: 10.7 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 3 - Forks: 2
nikdon/SimilarityMeasure
TF-IDF and similarity measure in C#
Language: C# - Size: 527 KB - Last synced: 9 months ago - Pushed: about 10 years ago - Stars: 3 - Forks: 2
TrinhQuocNguyen/ImageSimilarityMeasures
Image Similarity Measures VS Project c++. Source code is borrowed heavily from https://github.com/Rolinh/VQMT
Language: C++ - Size: 146 MB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 3 - Forks: 0
shamspias/predict-disease-based-on-symptoms
A simple model to predict disease
Language: Python - Size: 233 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
yashgarg1232/MMDTW
Metadata-enriched Dynamic Time Warping for Multi-Variate Time Series
Language: Python - Size: 327 KB - Last synced: 17 days ago - Pushed: over 5 years ago - Stars: 3 - Forks: 2
Shubhammawa/Recommender-Systems
Recommendation systems
Language: Jupyter Notebook - Size: 81.1 KB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
mwegrzyn/fmri_python_SS2019
Kurs zu fMRT-Datenanalyse mit Python (Sommersemester 2019). Eigenständige Erstellung von MRT-Viewern und DIY-Analyse von fMRT-Zeitverläufen und Aktivierungskarten mit Python.
Language: Jupyter Notebook - Size: 27.6 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 2 - Forks: 0
devonsparks/room-similarity
Reprogramming architectural spaces by size and shape
Language: HTML - Size: 161 KB - Last synced: 4 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
Shashwat4K/Clustering-Documents
Cluster documents based on various similarity measures. The project is based on 'Bag of Words' data from UCI Machine Learning reporitory
Language: Jupyter Notebook - Size: 3.92 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 1
ShashwatNigam99/CIFAR-10_classification
Multi-class classification : Representation and similarity measures using CIFAR-10
Language: Jupyter Notebook - Size: 158 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
swelcker/cmd.csp.similarity
A library implementing different string similarity and distance measures for ease of use. A dozen of algorithms (including Levenshtein edit distance and sibblings, Jaro-Winkler, Longest Common Subsequence, cosine similarity etc.) are currently implemented.
Language: Java - Size: 29.3 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
INKWWW/NLP
Gensim\Word2vec\NLP\Similarity :zap::zap:
Language: Python - Size: 1.06 MB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 2 - Forks: 0
gabrielagustin/Comparing-images
Structural similarity index (SSIM) and mean squared error (MSE)
Language: Python - Size: 67.4 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
OscarHChung/Similarities-between-files
This program will run similarities between two uploaded files. There is a choice of similarities in lines, numbered substrings, and sentences.
Language: HTML - Size: 5.86 KB - Last synced: 11 months ago - Pushed: almost 5 years ago - Stars: 0 - Forks: 0
Coldsp33d/UFO-Awesome
Demonstrating the power of Pandas, Tika, and D3.js through exploratory analysis on a UFO sightings dataset
Language: Python - Size: 766 MB - Last synced: 8 months ago - Pushed: about 6 years ago - Stars: 2 - Forks: 1
cpcdoy/WMD
Word Mover’s Distance implementations
Language: Jupyter Notebook - Size: 431 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0
abdsamadf/similarities
Measures the edit distance between two strings
Language: Python - Size: 8.79 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
paathurnax/simitrieve
This is a repository to develop my undergraduated thesis
Language: Java - Size: 185 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
spacelis/hrnn4sim
A Hierarchical RNN model for estimating word sequence similarity and category
Language: Python - Size: 26.4 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
shkr/routesimilarity
Language: Python - Size: 6.84 KB - Last synced: 16 days ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
vspinu/simdist
High performance similarity and distance metrics for sparse representations
Language: R - Size: 984 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 3 - Forks: 0
IldikoPilan/swell-norm
Language: Python - Size: 26.6 MB - Last synced: over 1 year ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0
orgh0/Word_Embeddings
Basic scripts for understanding word2vec
Language: Python - Size: 24.4 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0
sherinann/collaborative-filtering-recommender
A collaborative filtering based recommender for a movie database. User based collaborative filtering is used.
Language: Python - Size: 8.79 KB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0
asrul10/recommendation-answers-wordnet
Similarity measure based on WordNet and Rapid Automatic Keyword Extraction (RAKE)
Language: Python - Size: 11.7 KB - Last synced: over 1 year ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0