An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: jaccard-similarity-estimation

NHViet03/Web_Social_Network_with_Link_Prediction

Building a Full Stack Social Network Web Application with React JS using Redux, NodeJS, Socket.IO and MongoDB. Utilizing Python's NetworkX library to represent graphs and the FastAPI framework to display follower suggestion results on the frontend. The Link Prediction Model includes algorithms such as CN, Jaccard, AA, and Katz

Language: Jupyter Notebook - Size: 19.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

amatov/DifferentialDiagnosisCBIR

Image retrieval can facilitate medical diagnosis by identifying categories of similar to a new patient presented for diagnosis phenotypes which have already been assigned a diagnosis

Language: Python - Size: 604 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

oertl/treeminhash

TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation

Language: C++ - Size: 2.62 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 14 - Forks: 3

oertl/probminhash

ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity

Language: C++ - Size: 6.26 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 42 - Forks: 6

dynatrace-research/set-sketch-paper

SetSketch: Filling the Gap between MinHash and HyperLogLog

Language: C++ - Size: 23.7 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 46 - Forks: 5

oertl/bagminhash

BagMinHash - Minwise Hashing Algorithm for Weighted Sets

Language: C++ - Size: 1.02 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 26 - Forks: 6

jgurakuqi/ranker-comparator

The goal of this project it to provide a tool to build new ranker easily and to compare them with existing ones in terms of results overlapping.

Language: C++ - Size: 7.53 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

nnnet/superminhash

SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex

Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 7

micts/jss

Fast Jaccard similarity search for abstract sets (documents, products, users, etc.) using MinHashing and Locality Sensitve Hashing

Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

lstasiak/Big-Data-Algorithms-exercises

Set of tasks solved in Big Data Algorithms course

Language: Scala - Size: 3.06 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

sumitsingh34/school-projects

This contains all projects that I have done during my master degree.

Language: Python - Size: 1.05 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

esalini22/gene-hll

HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante índice de Jaccard

Language: C++ - Size: 185 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

Related Keywords
jaccard-similarity-estimation 12 minhash 6 locality-sensitive-hashing 6 jaccard-similarity 6 jaccard-distance 4 minwise-hashing 3 minwise-hashing-algorithm 3 jaccard-index 3 data-mining 2 lsh-algorithm 2 weighted-sets 2 minhash-sketches 2 nodejs 2 sketch 2 cardinality-estimation 2 jaccard 2 hyperloglog 2 minhash-lsh-algorithm 2 pagerank-algorithm 1 ranking-algorithm 1 top-k-retrieval 1 streaming-algorithms 1 pagerank 1 multithreading 1 mmap 1 in-degree 1 hits-algorithm 1 hits 1 sketch-data-structures 1 sketch-algorithm 1 minhash-similarity 1 min-wise-independent-permutations 1 stream-processing 1 probabilistic-data-structures 1 parallel-programming 1 parallel-computing 1 openmp 1 kmer 1 genomics 1 cpp 1 count-distinct 1 rummy-card-game 1 python3 1 javascript 1 datamining-algorithms 1 datamining 1 data-science 1 word-cloud 1 tf-idf 1 scala 1 big-data-analytics 1 big-data 1 academic-project 1 python 1 numpy 1 superminhash 1 simhashindex 1 simhash 1 web-search 1 k-means-clustering 1 image-retrieval-query 1 elastic-search 1 content-based-image-retrieval 1 computer-vision-python 1 canny-edge-detector 1 bag-of-visual-words 1 bag-of-bags-of-words 1 socket-io 1 redux 1 reactjs 1 networkx-graph 1 mongodb 1 linkprediction 1 katz-similarity 1 fastapi 1 expressjs 1 cn 1 bootstrap 1 adamic-adar-index 1 intersection 1 inclusion-exclusion 1 hyperloglog-sketches 1 estimation 1 cosine-similarity 1 similarity 1 sketching-algorithm 1 sketching 1 similarity-search 1 similarity-metric 1 similarity-measures 1 locality-sensitive 1 jaccard-coefficient 1 hash-algorithm 1 surf-feature-extraction 1 surf-detection 1 surf-descriptor 1 speeded-up-robust-features 1 sift-keypoints 1 sift-descriptors 1 reverse-image-search 1