GitHub topics: minhash
hscspring/sto
MinHash and LSH Based Store and Query.
Language: Python - Size: 9.77 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

zxmeng/SimilarityDetection
Similarity Detection on Wikipedia Articles using MinHash and Random Projection implemented in Hadoop/Spark
Language: Java - Size: 69.5 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 1

tkukurin/Lab.Bioinformatics
University work. Approximate aligner for long DNA sequences. Estimates Jaccard similarity from k-mers via minimizers and MinHash, then uses it as a sequence identity proxy.
Language: Java - Size: 90.3 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

joaocps/mpei-bloomfilter
Probabilistic methods for computer engineering - Final Project
Language: Java - Size: 596 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

DearMadMan/minhash
An implementation of the minhash algorithm in golang
Language: Go - Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

user-cube/NewsAnalyzer
Tool to analyze news from a dataset.
Language: Java - Size: 10.2 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

LuoZijun/rust-jieba
Rust jieba
Language: Rust - Size: 1.97 MB - Last synced at: 5 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

npredey/GeneNetworks
Language: Python - Size: 60.5 KB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

coderthetyler/mhash-c
An implementation of the MinHashing algorithm in C using POSIX threads.
Language: C - Size: 3.86 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

anastasia/minhash
Language: Python - Size: 16.6 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1

CharuMehndiratta/CSE549
Min Hash and Containment Hash implementation for long reads in C++
Language: C++ - Size: 1.3 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

worldofnick/Machine-Learning
Collection of code covering various topics in Machine Learning
Language: Jupyter Notebook - Size: 3.48 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

zeitunik/Big-Data
Big data homework solutions
Language: Python - Size: 146 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0
