An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: similarity-measures

ftessari23/DIEM

Dimension Insensitive Euclidean Metric (DIEM)

Language: MATLAB - Size: 654 KB - Last synced at: about 8 hours ago - Pushed at: about 10 hours ago - Stars: 7 - Forks: 1

ashvardanian/SimSIMD

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐

Language: C - Size: 2 MB - Last synced at: 1 day ago - Pushed at: 18 days ago - Stars: 1,468 - Forks: 84

feature23/StringSimilarity.NET

A .NET port of java-string-similarity

Language: C# - Size: 527 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 500 - Forks: 73

matchms/matchms

Python library for processing (tandem) mass spectrometry data and for computing spectral similarities.

Language: Python - Size: 39.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 225 - Forks: 75

atharvaaalok/geosimilarity

Differentiable curve and surface similarity measures.

Language: Python - Size: 82 KB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 10 - Forks: 2

Aries921wu/Highly-Robust-Movie-Recommendation-engine

A highly sophisticated, tested, robust and procedural recommender.

Language: Python - Size: 35.7 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

hbollon/go-edlib

📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

Language: Go - Size: 83 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 523 - Forks: 27

proxectonos/simil-eval

Multilingual toolkit for evaluating LLMs using embeddings

Language: Python - Size: 111 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 1

firmai/mtss-gan 📦

MTSS-GAN: Multivariate Time Series Simulation with Generative Adversarial Networks (by @firmai)

Size: 3.62 MB - Last synced at: 5 days ago - Pushed at: almost 5 years ago - Stars: 93 - Forks: 30

aungpyaeap/distfun-matlab

MATLAB functions designed to construct dissimilarity matrices using a variety of distance metric functions. It provides a comprehensive toolkit for analyzing and comparing data sets through different distance measures.

Language: MATLAB - Size: 28.3 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

tcrouch/edits.cr

Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment

Language: Crystal - Size: 104 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 16 - Forks: 0

jorge-martinez-gil/ensemble-codesim

Advanced Detection of Source Code Clones via an Ensemble of Unsupervised Similarity Measures

Language: Java - Size: 38.1 MB - Last synced at: 28 days ago - Pushed at: 29 days ago - Stars: 2 - Forks: 0

malwaredb/docker

Dockerfiles for MalwareDB, and Postgres with our similarity extensions

Language: Shell - Size: 85.9 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 2 - Forks: 2

firmai/datagene

DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)

Language: Jupyter Notebook - Size: 1.12 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 205 - Forks: 24

drostlab/philentropy

Information Theory and Distance Quantification with R

Language: R - Size: 4.46 MB - Last synced at: 7 days ago - Pushed at: 9 months ago - Stars: 142 - Forks: 19

tcrouch/edits

Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment

Language: Ruby - Size: 72.3 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 1

Ranwiesiel/bhattacharyya-collaborative-filtering

Tugas mata kuliah Sistem Rekomendasi & Personalisasi (SRP), Implementasi code pada paper Enhancing recommendation accuracy of item-based collaborative filtering using Bhattacharyya coefficient and most similar item

Language: Jupyter Notebook - Size: 36.1 KB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

lewinfox/levitate

Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).

Language: R - Size: 542 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 35 - Forks: 2

oertl/treeminhash

TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation

Language: C++ - Size: 2.62 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 14 - Forks: 3

Nakilon/dhash-vips

vips-powered ruby gem to measure images similarity, implementing dHash and IDHash algorithms

Language: Ruby - Size: 610 KB - Last synced at: 25 days ago - Pushed at: 5 months ago - Stars: 93 - Forks: 15

nicofilippucci/PyEyeSim Fork of jozsarato/PyEyeSim

Integration and testing of new functionality for the library

Language: Jupyter Notebook - Size: 243 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

tdebatty/java-string-similarity

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

Language: Java - Size: 729 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 2,726 - Forks: 417

SamiSieranoja/stridx

Fast fuzzy string similarity search and indexing (for filenames)

Language: C++ - Size: 837 KB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 0

jorge-martinez-gil/uwsd

Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense Disambiguation

Language: Python - Size: 2.03 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 0

cjekel/similarity_measures

Quantify the difference between two arbitrary curves in space

Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 259 - Forks: 42

Ssssssstanley/Retrofitting-Concept-Vector-Representations-of-Medical-Concepts

Retrofitting Concept Vectors

Language: SystemVerilog - Size: 26.1 MB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 6 - Forks: 1

frjnn/bhtsne

Parallel Barnes-Hut t-SNE implementation written in Rust.

Language: Rust - Size: 6.94 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 88 - Forks: 8

sultandaris/SimilarityAbstraction

Tugas Akhir Temu Kembali Informasi

Language: Jupyter Notebook - Size: 480 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

bigpon/SpeechSubjectiveTest

Speech (audio) subjective evaluation system

Language: Python - Size: 1.43 MB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 39 - Forks: 8

fullscreen-triangle/heihachi

Python framework for high performance and distributed electronic music analysis

Language: HTML - Size: 14 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

ryandewolfe33/FuzzyClusteringSimilarity.jl

Code for Dirichlet Random Models for Fuzzy Rand Adjustment

Language: Julia - Size: 2.4 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

BrenoFariasdaSilva/Scientific-Research

My Scientific Research Code Repository.

Language: Python - Size: 7.56 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

habedi/hsdlib

Hardware-accelerated distance metrics and similarity measures for high-dimensional data

Language: C - Size: 146 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 44 - Forks: 1

jm199504/Financial-Time-Series

金融时间序列(预测分析 / 相似度 / 数据处理)

Language: Jupyter Notebook - Size: 4.82 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 235 - Forks: 64

Bhasha03/Highly-Robust-Movie-Recommendation-engine

A highly sophisticated, tested, robust and procedural recommender.

Language: Python - Size: 35.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

al4744/rec-system

🎵 A Python-based content recommendation system using ML algorithms and matrix factorization techniques to analyze 600k-song dataset. Combines SVD, NMF, Factorization Machines, and Direct Similarity for personalized music suggestions. Handles cold start, optimizes with weighted similarity, and includes tools for visualization & evaluation.

Language: Jupyter Notebook - Size: 1.93 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

jorge-martinez-gil/graphcodebert-interpretability

Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks

Language: Python - Size: 9.71 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

soubankhandwani/ai-paper-evaluation-model

This project is an intelligent web application that compares student answers from scanned or typed PDFs against teacher-provided answer PDFs using NLP techniques and machine learning. It performs OCR, text extraction, preprocessing, and semantic similarity scoring to generate marks for each question.

Language: HTML - Size: 195 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

patrickzib/SFA

Scalable Time Series Data Analytics

Language: Java - Size: 112 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 315 - Forks: 68

koheiw/proxyC

R package for large-scale similarity/distance computation

Language: R - Size: 4.58 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 28 - Forks: 6

BerensRWU/Point-Cloud-Comparison

Different Point Cloud Similarity Measures.

Language: Python - Size: 7.81 KB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

Markkreel/Binary-Static-Analysis-Through-Instruction-and-Operand-Extraction-and-AHC-Algorithm

A static binary analysis tool visualizes code blocks in the assembly of a disassembled binary file using the AHC algorithm, aided by entropy calculation and similarity measurement.

Language: Assembly - Size: 3.93 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

jorge-martinez-gil/graphcodebert-feature-integration

Improving Source Code Similarity Detection with GraphCodeBERT and Additional Feature Integration

Language: Python - Size: 85 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 1

jessicabonnie/dandd

Tool to estimate deltas for sequence sets and answer questions about relative contribution

Language: Python - Size: 1.75 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 21 - Forks: 1

cissagatto/SimilaritiesMultiLabel

This code is part of my Ph.D. research. The aim is generate similarity matrices from similarity measures.

Language: R - Size: 342 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Jordan-18/reckomik.be

Language: Jupyter Notebook - Size: 50.4 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

xhan97/IsoKernel

A scikit-learn-compatible module for Isolation Kernel.

Language: Python - Size: 89.8 KB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 6 - Forks: 2

Olliang/Statistical-Similarity-Measurement

A methodology designed to validate the statistical similarity of synthetic data generated by GAN models. The metrics contain Auto-encoder, PCA, t-SNE, KL-divergence, Clustering, and Cosine Similarity.

Language: Jupyter Notebook - Size: 2.93 MB - Last synced at: 4 months ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 0

oist-ncbc/spykesim 📦

Extended edit similarity measurement for high dimensional discrete-time series signal (e.g., multi-unit spike-train).

Language: Python - Size: 4.34 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 3

jorge-martinez-gil/similarity-ensemble

A comprehensive review of stacking methods for semantic similarity measurement

Size: 10.7 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jorge-martinez-gil/sesige

Automatic Design of Semantic Similarity Ensembles Using Grammatical Evolution

Language: Python - Size: 75.2 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

cjekel/DTW_cpp

Dynamic Time Warping single header library for C++

Language: C++ - Size: 51.8 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 49 - Forks: 14

babylonhealth/corrsim

Code for the papers: Correlation Coefficients and Semantic Textual Similarity, NAACL-HLT 2019 & Correlations between Word Vector Sets, EMNLP-IJCNLP 2019.

Language: Python - Size: 32.5 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 38 - Forks: 7

Wittline/distance-metrics

Distance metrics are one of the most important parts of some machine learning algorithms, supervised and unsupervised learning, it will help us to calculate and measure similarities between numerical values expressed as data points

Language: Jupyter Notebook - Size: 40 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

solo-studios/kt-fuzzy

A zero-dependency Kotlin Multiplatform library for fuzzy string matching

Language: Kotlin - Size: 1.21 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 12 - Forks: 1

berhane/arbalign

aligns arbitrarily ordered isomers

Language: Python - Size: 5.73 MB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 0

ngmarchant/comparator

Similarity and distance measures for clustering and record linkage applications in R

Language: R - Size: 275 KB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 18 - Forks: 0

mrunmaim16/CSE-5334-Programming-Assignments

Programming assignments completed for course CSE - 5334 Data Mining under Professor Dr. Marnim Galib.

Language: Jupyter Notebook - Size: 460 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

sumn2u/string-comparisons

A collection of string comparisons algorithms

Language: JavaScript - Size: 700 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 5

s-emanuilov/LangVec

Language of Vectors (LangVec) is a simple Python library designed for transforming numerical vector data into a language-like structure using a predefined set of words (lexicon).

Language: Python - Size: 1 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

codeasarjun/vector_store

An easy way to understand vector store working and creation.

Language: Jupyter Notebook - Size: 119 KB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dumitrescustefan/RoWordNet

Romanian WordNet (Data + API for Python)

Language: Python - Size: 84.9 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 48 - Forks: 18

Coldsp33d/UFO-Awesome

Demonstrating the power of Pandas, Tika, and D3.js through exploratory analysis on a UFO sightings dataset

Language: Python - Size: 766 MB - Last synced at: 10 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

babylonhealth/fuzzymax

Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.

Language: Python - Size: 32.5 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 43 - Forks: 5

farvath/Candidate-Matching

This project implements a candidate matching system that recommends suitable candidates for job openings. It leverages machine learning techniques to analyze historical hiring data and improve matching accuracy.

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

kaledhoshme123/Multimodal-face-generation-facial-biometrics-

Similarity between faces: One person resembles another person to a large degree. This can lead to many problems facing security surveillance systems. Facial recognition systems have difficulty distinguishing between the main person and other people who are highly similar in terms of features.

Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

shinhanbyeol/study-vector-data

벡터데이터에 대한 개념을 공부하기 위해 작성한 코드

Language: Python - Size: 1.64 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

paumartinez1/Spotify-info-retrieval

Using information retrieval to help an emerging local artist boost its streams

Language: HTML - Size: 17.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

iamabhaytiwari343/movie-recommender

A content based movie recommender that recommend movies based on tags

Language: Jupyter Notebook - Size: 9.84 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

fredrikSveen/MSc_time_series_GAN

This is the code related to my MSc thesis at the Norwegian University of Science and Technology (NTNU). The MSc program is Electronic system design and innovation with a specialization in signal processing. The goal of this thesis is to explore and compare the state of the art solution to more traditional models for time series generation.

Language: Jupyter Notebook - Size: 43.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

gipplab/MathMLSim

Similarity calculation module for MathML formulae

Language: Java - Size: 163 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

tony0021074/Clustering-webLogData

Language: Jupyter Notebook - Size: 627 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

mfathul21/food-recommendations

Food Recommendations System With Content Based Filtering and Collaborative Filtering

Language: Jupyter Notebook - Size: 794 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jyl1/distance_python

Northwestern Research Computing Services - Distance and similarity - (Python) Workshop

Language: Jupyter Notebook - Size: 654 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

parklize/resim

resim implementation

Size: 9.37 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

GyanPrakashkushwaha/twoFace-match-Deepface

Utilizing the DeepFace Library, informed by a dataset of 4M images across 4K identities curated by Facebook researchers, My 'Two Faces✌🏻' project gauges facial similarity with precision.

Language: Jupyter Notebook - Size: 37.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 2

vishant-mehta/Music-Recommendation-System-using-Cosine-Similarity

The recommender framework goes about as a friend in need and channels the melodies that are reasonable for that client at that point. It likewise expands the client's fulfilment by playing fitting tune at the correct time, and, in the interim, limit the client's work.

Language: Jupyter Notebook - Size: 12.6 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

cswords/anne-dbscan-demo

Demo of using aNNE similarity for DBSCAN.

Language: MATLAB - Size: 26.4 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 9 - Forks: 7

swelcker/cmd.csp.similarity

A library implementing different string similarity and distance measures for ease of use. A dozen of algorithms (including Levenshtein edit distance and sibblings, Jaro-Winkler, Longest Common Subsequence, cosine similarity etc.) are currently implemented.

Language: Java - Size: 29.3 KB - Last synced at: 4 months ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

sajeel12/image-similarity-checker

it is the only eisiest image similarity checker in Python

Language: Python - Size: 948 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

beviah/ezglot

Selected data processing scripts including language agnostic multilingual wiktionary parser

Language: Python - Size: 18.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

tr0j4n034/Count-Min-Sketch

Language: C++ - Size: 12.1 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

lqzhao/SAFNet

[IROS 2021] Implementation of "Similarity-Aware Fusion Network for 3D Semantic Segmentation"

Language: Python - Size: 35.6 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 21 - Forks: 4

0cherry/SimilarityAnalyzer

binary function similarity analyzer

Language: Python - Size: 40 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 7 - Forks: 1

0cherry/PackerIdentificator

packer identification tool using SVM

Language: Python - Size: 31.8 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 1

gitgit-hooray/finalCapstone

NLP Sentiment Analysis & Similarity Comparison of an Amazon Product Reviews Dataset

Language: Python - Size: 112 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

xgfs/verse

Reference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures

Language: C++ - Size: 43 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 128 - Forks: 22

vasgat/jSimilarity

jSimilarity is a library that implements various similarity measures

Language: Java - Size: 32.2 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 2

khrynczenko/compimg

compimg - python package for computing similarity between the images

Language: Python - Size: 83 KB - Last synced at: 11 days ago - Pushed at: almost 5 years ago - Stars: 15 - Forks: 0

hailiang-wang/word2vec-get-started

word embedding model with word2vec and insuranceqa-corpus

Language: C - Size: 151 MB - Last synced at: 6 months ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 2

jorge-martinez-gil/crosslingual-clone-detection

Transcending Language Barriers in Software Engineering with Crosslingual Code Clone Detection

Language: Java - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

liliansteven/Word-Embeddings-and-Similarity-Measures

This repository contains preprocessing and analyzing document similarity using Word2Vec embeddings, various similarity measures, and different visualization techniques.

Language: Jupyter Notebook - Size: 95.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

raj1603chdry/CSE3018-Content-Based-Image-and-Video-Retrieval-Lab

Repository containing all the codes created for the lab sessions of CSE3018 Content Based Image and Video Retrieval at VIT University Chennai Campus

Language: MATLAB - Size: 64.5 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 16 - Forks: 9

hspsuhas/Document-Similarity

A Web App which uses Cosine Similarity to measure the similarity between 2 documents.

Language: Python - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nlpub/rdt 📦

RDT: Russian Distributional Thesaurus (Русский Дистрибутивный Тезаурус)

Language: Python - Size: 11.7 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 26 - Forks: 2

luukka76/data-analytics

Matlab files for data analytics methods

Language: MATLAB - Size: 49.8 KB - Last synced at: 8 months ago - Pushed at: about 6 years ago - Stars: 8 - Forks: 0

WenRichard/Customer-Chatbot

中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!)

Language: Python - Size: 4.58 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 295 - Forks: 110

Anvi98/word_similarities

Code from scratch Word Similarities and have a sense of representation of word vectors with Pure Python.

Language: Python - Size: 2.54 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

DavideNardone/MTSS-Multivariate-Time-Series-Software

A GP-GPU/CPU Dynamic Time Warping (DTW) implementation for the analysis of Multivariate Time Series (MTS).

Language: Cuda - Size: 27.8 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 39 - Forks: 9

ansegura7/Algorithms

Free hands-on course with the implementation (in Python) and description of several computational, mathematical and statistical algorithms.

Language: HTML - Size: 23.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 128 - Forks: 48

Related Keywords
similarity-measures 206 python 33 machine-learning 28 nlp 22 similarity 21 cosine-similarity 15 distance-measures 13 natural-language-processing 10 levenshtein-distance 9 recommendation-system 9 recommender-system 9 word2vec 9 semantic-similarity 8 collaborative-filtering 8 fuzzy-matching 8 similarity-search 8 similarity-score 8 data-science 8 scikit-learn 7 tf-idf 6 data-mining 6 python3 6 classification 6 semantic 6 jaro-winkler 6 damerau-levenshtein 6 algorithms 6 numpy 6 dtw 5 word-embeddings 5 semantic-similarity-measures 5 pandas 5 r 5 image-processing 5 information-retrieval 5 clustering 5 distance 5 flask 4 content-based-recommendation 4 pytorch 4 opencv 4 wordnet 4 levenshtein 4 clone-detection 4 code-similarity 4 string-distance 4 feature-extraction 3 dimensionality-reduction 3 bag-of-words 3 cosine 3 edit-distance 3 graph-algorithms 3 synthetic-data 3 nltk-python 3 sentiment-analysis 3 codebert 3 graphcodebert 3 pattern-recognition 3 data-visualization 3 dynamic-time-warping 3 jaro 3 tokenization 3 text-processing 3 deep-learning 3 comparison 3 fuzzy-search 3 text-analysis 3 sequence-analysis 3 word2vec-embeddinngs 3 semantics 3 nltk 3 hierarchical-clustering 3 jaccard-similarity 3 streamlit 3 java 3 neural-networks 3 distance-metrics 3 time-series 3 distance-calculation 2 mlp-classifier 2 vectorization 2 lemmetization 2 fake-news-classification 2 bert 2 string-similarity 2 clustering-algorithm 2 semantic-web 2 tfidf-vectorizer 2 bert-embeddings 2 jaccard-index 2 distance-measure 2 huggingface 2 similarity-detection 2 similarity-measurement 2 image-similarity 2 similarity-metric 2 sketching 2 pca-analysis 2 artificial-intelligence 2 vector 2