An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: similarity-measurement

kornelski/dssim

Image similarity comparison simulating human perception (multiscale SSIM in Rust)

Language: Rust - Size: 997 KB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 1,124 - Forks: 71

brightmart/nlu_sim

all kinds of baseline models for sentence similarity 句子对语义相似度模型

Language: Python - Size: 16.4 MB - Last synced at: 16 days ago - Pushed at: almost 7 years ago - Stars: 297 - Forks: 89

llrs/BioCor

Package to calculate functional similarity between genes https://biocor.llrs.dev

Language: R - Size: 9.47 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 14 - Forks: 1

jason-chao/appcestry

“Appcestry” (a portmanteau of “app” and “ancestry”) is a tool for the study of similarities of Android applications (apps).

Language: Python - Size: 813 KB - Last synced at: 6 months ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 3

anmol52490/RAG

RAG-Powered Chatbot: An intelligent chatbot that uses RAG (Retrieval-Augmented Generation) to provide responses based on information retrieved from a document database. Integrates Groq for response generation, Chroma for document management, and HuggingFace for embeddings.

Language: Python - Size: 5.46 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

fperdigon/ECG-BaseLineWander-Removal-Methods

This repository contains 9 methods for Base Line Wander removal. It also contains 3 similarity metrics that are applied to signals.

Language: MATLAB - Size: 419 KB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 39 - Forks: 7

liquidsunset/similarity_search

Language: C++ - Size: 452 KB - Last synced at: about 1 year ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 2

swelcker/cmd.csp.similarity

A library implementing different string similarity and distance measures for ease of use. A dozen of algorithms (including Levenshtein edit distance and sibblings, Jaro-Winkler, Longest Common Subsequence, cosine similarity etc.) are currently implemented.

Language: Java - Size: 29.3 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Vermouth1995/StringSimilarityDetection

algorithm of similarity detection about two string

Language: Go - Size: 6.84 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 1

adamliesko/tlsh

TLSH (Trend Micro Locality Sensitive Hash) library for Ruby

Language: Ruby - Size: 549 KB - Last synced at: 10 months ago - Pushed at: almost 8 years ago - Stars: 25 - Forks: 3

castorini/VDPWI-NN-Torch 📦

Very Deep Pairwise Word Interaction Neural Networks for modeling textual similarity (He and Lin, NAACL/HLT 2016)

Language: Lua - Size: 884 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 19 - Forks: 8

castorini/MP-CNN-Torch 📦

Multi-Perspective Convolutional Neural Networks for modeling textual similarity (He et al., EMNLP 2015)

Language: Lua - Size: 1.28 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 107 - Forks: 59

RandolphVI/Text-Pairs-Relation-Classification

About Text Pairs (Sentence Level) Classification (Similarity Modeling) Based on Neural Network.

Language: Python - Size: 376 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 190 - Forks: 55

Rohith-2/Chaos-Game-Representation_BioSeq

Representation of Bio Sequences via Chaos Game and using the same to find similarities

Language: Python - Size: 89.5 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 2

kornosk/GDPR-similarity-comparison

This repo aims to extract pieces of GDPR-like content and form well-structured data for easy processing. We measure the similarity between GDPR-like from different countries.

Language: Python - Size: 21.4 MB - Last synced at: 17 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

kanata2/ruigi 📦

Ruigi is the library for computing the similarity between documents.

Language: Ruby - Size: 16.6 KB - Last synced at: 29 days ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

Orange-OpenSource/documentare-simdoc 📦

New Developments are now done on Gitlab.com: https://gitlab.com/Orange-OpenSource/documentare/documentare-simdoc . Library and tools for similarity measurement, classification and clustering of digital content and segmentation images from digitized document

Language: Java - Size: 34.1 MB - Last synced at: 29 days ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 1

syreal17/Cardinal

Similarity Analysis to Defeat Malware Compiler Variations

Language: LLVM - Size: 279 MB - Last synced at: 2 days ago - Pushed at: over 7 years ago - Stars: 25 - Forks: 4

VPanjeta/Data-Analytics-Lab

Project for finding semantic similarity between 2 sentences using nltk wordnets

Language: Jupyter Notebook - Size: 122 KB - Last synced at: 1 day ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

JaFro96/Geosoftware2 Fork of carobro/Geosoftware2

Geosoftware II - WiSe 2018/19 Enhancing discovery of geospatial datasets in data repositories

Language: Python - Size: 84.2 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

DwarakaSKulkarni/OntEncode

This repository provides an implementation of a prime number based ontology encoding technique to enable efficient match making among the concepts of the ontology.

Language: HTML - Size: 145 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

MayukhSobo/golp

A fast and ambitious implementation of NLP libraries in GoLang

Language: Go - Size: 3.91 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

mohammedjasam/CNN-Scrapper

Collects articles from CNN.com and performs various algorithms on it to find out similarities between articles.

Language: Python - Size: 608 KB - Last synced at: almost 2 years ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 1

cueo/spoken-tutorial

Annotating forum links to spoken-tutorial.org videos at different time intervals using the concepts of NLP

Language: Python - Size: 13.7 MB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 1

Related Keywords
similarity-measurement 24 similarity 5 nlp 3 similarity-detection 2 python3 2 similarity-measures 2 deep-learning 2 similarity-score 2 ruby 2 bioinformatics 2 word2vec 2 golang 2 semantic-similarity 2 benchmark 1 gdpr 1 legal-documents 1 nlp-resources 1 privacy-protection 1 classification 1 clustering 1 image-segmentation 1 bloom-filter 1 streamlit 1 sequence-to-sequence 1 gui 1 chaos-game-representation 1 chaos-game 1 chaos 1 bio-sequences 1 text-pairs-classification 1 text-classification 1 tensorflow 1 spoken-tutorial 1 annotating-forum-links 1 scraper 1 ranking 1 parallel 1 sparql 1 ontology 1 jena 1 java 1 encoding 1 zenodo 1 metadata-extraction 1 geospatial-data 1 wordnet 1 synsets 1 semantic-similarity-measures 1 semantic 1 nltk 1 test-harness 1 optimization 1 malware-research 1 malware-analysis 1 isocompiler-modulation 1 ida 1 cpc 1 cardinalities 1 reverse-engineering 1 plagiarism-detection 1 data-mining 1 android-app 1 pathways 1 pathway-analysis 1 gene-sets 1 gene 1 functional-similarity 1 bioconductor-packages 1 sentence-similarity 1 questions-and-answers 1 question-answering 1 qa 1 nlu 1 atec 1 ssim 1 libpng 1 image-benchmark 1 dssim 1 compress-images 1 comparison 1 c 1 sentence-classification 1 convolutional-neural-networks 1 tlsh 1 locality-sensitive-hashing 1 hashing 1 gem 1 fuzzy 1 algorithm 1 text-processing 1 text-analysis 1 nlp-library 1 databases 1 database-development 1 c-plus-plus 1 matlab 1 ecg-signal 1 baseline-wander-removal 1 text-embeddings 1 interactive-chatbot 1