Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: data-sketches
ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Language: Python - Size: 5.68 MB - Last synced: 5 days ago - Pushed: 7 days ago - Stars: 2,381 - Forks: 289
dynatrace-oss/dynahist
DynaHist: A Dynamic Histogram Library for Java
Language: Java - Size: 1.82 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 41 - Forks: 7
dynatrace-oss/hash4j
Dynatrace hash library for Java
Language: Java - Size: 40.4 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 73 - Forks: 9
dynatrace-research/exaloglog-paper
ExaLogLog: Space-Efficient and Practical Approximate Distinct Counting up to the Exa-Scale
Language: Java - Size: 2.27 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 8 - Forks: 1
dynatrace-research/ultraloglog-paper
UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting
Language: Python - Size: 4.23 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
turu/yalal
Yet Another Lame Algorithm Library
Language: Python - Size: 50.8 KB - Last synced: about 2 months ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 0
ikegami-yukino/madoka-python
Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)
Language: C++ - Size: 231 KB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 25 - Forks: 2
justinfargnoli/simhash
A barebones implementation of the simhash data sketching algorithm.
Language: Go - Size: 7.81 KB - Last synced: 11 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0
isarn/isarn-sketches-spark
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Language: Scala - Size: 1.33 MB - Last synced: 11 months ago - Pushed: over 1 year ago - Stars: 30 - Forks: 12
andrewmcloud/consimilo
A Clojure library for querying large data-sets on similarity
Language: Clojure - Size: 536 KB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 62 - Forks: 4
oertl/hyperloglog-sketch-estimation-paper
Paper about the estimation of cardinalities from HyperLogLog sketches
Language: TeX - Size: 51.6 MB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 51 - Forks: 4
galprz/dns-random-subdomains-ddos-attack
Implementation for - Mitigating DNS random subdomain DDoS attacks by distinct heavy hitters sketches
Language: Jupyter Notebook - Size: 1.11 MB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 8 - Forks: 3
erikerlandson/cdf-splining-prototype
A Prototype For Fitting Monotonic Cubic Splines to a Tdigest Sketch
Language: Jupyter Notebook - Size: 1.2 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0