GitHub topics: hyperloglog
dynatrace-oss/hash4j
Dynatrace hash library for Java
Language: Java - Size: 37.2 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 112 - Forks: 11

axiomhq/hyperloglog
HyperLogLog with lots of sugar (Sparse, LogLog-Beta bias correction and TailCut space reduction) brought to you by Axiom
Language: Go - Size: 265 KB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 982 - Forks: 74

dnbaker/sketch
C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Language: C++ - Size: 4.43 MB - Last synced at: about 16 hours ago - Pushed at: 11 months ago - Stars: 155 - Forks: 14

alecmocatta/streaming_algorithms
Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.
Language: Rust - Size: 122 KB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 87 - Forks: 11

thepirat000/CachingFramework.Redis
Distributed caching based on StackExchange.Redis and Redis. Includes support for tagging and is cluster-compatible.
Language: C# - Size: 1.4 MB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 293 - Forks: 52

MalekD5/uniusage
UniUsage is a command-line tool for analyzing user behavior from log files.
Language: Java - Size: 4.99 MB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

ankane/datasketches-ruby
Sketch data structures for Ruby
Language: C++ - Size: 85 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 12 - Forks: 0

Callidon/bloom-filters
JS implementation of probabilistic data structures: Bloom Filter (and its derived), HyperLogLog, Count-Min Sketch, Top-K and MinHash
Language: TypeScript - Size: 8.12 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 406 - Forks: 47

Query-farm/datasketches
Integrates DuckDB with the high-performance Apache DataSketches library. This extension enables users to perform approximate analytics on large-scale datasets using state-of-the-art streaming algorithms, all from within DuckDB.
Language: C++ - Size: 316 KB - Last synced at: 2 days ago - Pushed at: 19 days ago - Stars: 21 - Forks: 0

jean-pierreBoth/probminhash
Rust implementation of probminhash, superminhash and hyperloglog sketching algorithms
Language: Rust - Size: 198 KB - Last synced at: 19 days ago - Pushed at: about 2 months ago - Stars: 31 - Forks: 3

axiomhq/hyperminhash
HyperMinHash: Bringing intersections to HyperLogLog
Language: Go - Size: 17.6 KB - Last synced at: 3 days ago - Pushed at: over 7 years ago - Stars: 303 - Forks: 18

mrtkp9993/probdsgo
Probabilistic data structures implemented in Go.
Language: Go - Size: 29.3 KB - Last synced at: 3 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

20101301-Alina-Hasan/Parallel-HyperLogLog-Cardinality-Estimation
A Python project demonstrating efficient estimation of unique elements in any dataset using the HyperLogLog algorithm with parallel processing. In this example, we apply the method to a transactional dataset, showcasing data cleaning, visualization, and performance comparisons for scalable cardinality estimation.
Language: Python - Size: 113 KB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Language: Python - Size: 5.68 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 2,699 - Forks: 299

ascv/HyperLogLog
Fast HyperLogLog for Python.
Language: C - Size: 306 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 106 - Forks: 19

xzeck/HyperLogLog
A rust implementation of google's HyperLogLog for cardinality estimation
Language: Rust - Size: 61.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

VladimirDegt/goit-algo2-hw-05
Language: Python - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

cloudflare/cardinality-estimator
A crate for estimating the cardinality of distinct elements in a stream or dataset.
Language: Rust - Size: 571 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 22 - Forks: 5

fcambus/logswan
Fast Web log analyzer using probabilistic data structures
Language: C - Size: 484 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 388 - Forks: 24

retailnext/hllpp
HyperLogLog cardinality estimation algorithm in go/golang!
Language: Go - Size: 64.5 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 60 - Forks: 12

davidesantangelo/hyll
A Ruby implementation of the HyperLogLog algorithm for efficient cardinality estimation with minimal memory footprint. Count millions of distinct elements using only kilobytes of memory.
Language: Ruby - Size: 133 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 23 - Forks: 0

bcgsc/ntCard
Estimating k-mer coverage histogram of genomics data
Language: C++ - Size: 1.24 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 78 - Forks: 9

jedisct1/rust-hyperloglog
A HyperLogLog implementation in Rust.
Language: Rust - Size: 123 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 50 - Forks: 19

absolute8511/ZanRedisDB
Yet another distributed fault-tolerant key-value database Compatible with Redis written in Golang.
Language: Go - Size: 21.8 MB - Last synced at: 1 day ago - Pushed at: about 2 years ago - Stars: 77 - Forks: 78

oertl/hyperloglog-sketch-estimation-paper
Paper about the estimation of cardinalities from HyperLogLog sketches
Language: TeX - Size: 51.6 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 62 - Forks: 6

LesiaUKR/goit-algo2-hw-05
Master's | Design & Analysis of Algorithms | Algorithms for Big Data Processing
Language: Python - Size: 35.2 KB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

bigmlcom/sketchy
Sketching Algorithms for Clojure (bloom filter, min-hash, hyper-loglog, count-min sketch)
Language: Clojure - Size: 147 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 148 - Forks: 18

simonprickett/python-probabilistic-data-structures
Exploring Probabilistic Data Structures in Python - my 2021 Pycon USA and Australia and Pycon MEA 2022 talk.
Language: Python - Size: 1.8 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 2

nidorx/sqlog
SQLog - Connecting the dots
Language: Go - Size: 880 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 29 - Forks: 2

ivoras/goestimators
Fast Cardinality Estimation algorithms for Go: HyperLogLog/SuperLogLog/LogLog, Bloom
Language: Go - Size: 40 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 8 - Forks: 1

LiveRamp/HyperMinHash-java
Union, intersection, and set cardinality in loglog space
Language: Java - Size: 572 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 56 - Forks: 10

danchen6/pfutil
Fast and Redis-compatible HyperLogLog extension for Python 3
Language: C - Size: 48.8 KB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

KnightChaser/HyperLogLog
A simple Python implementation of the HyperLogLog algorithm, a probabilistic data structure used for estimating the cardinality of a set.
Language: Python - Size: 6.84 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

JsIqbal/redis
Complete redis course by stephen grider
Language: Svelte - Size: 289 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

kwertop/gostatix
Thread-safe and persistent Golang implementations of probabilistic data structures: Bloom Filter, Cuckoo Filter, HyperLogLog, Count-Min Sketch and Top-K
Language: Go - Size: 221 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 2

arashbm/hyperloglog
HyperLogLog++ Implementation with C++14
Language: C++ - Size: 410 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

alexhholmes/inbloom
Rust implementation of a probabilistic multiset cardinality algorithm
Language: Rust - Size: 15.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

vislee/lua-resty-hyperloglog
Hyperloglog for openresty.
Language: Lua - Size: 65.4 KB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 1

shabbyrobe/phphll 📦
HyperLogLog for PHP implemented as a C extension
Language: C - Size: 429 KB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 20 - Forks: 0

sundy-li/simple_hll
A simple HyperLogLog implementation in rust
Language: Rust - Size: 30.3 KB - Last synced at: 25 days ago - Pushed at: 9 months ago - Stars: 3 - Forks: 2

jlumbroso/java-random-hash
A simple, time-tested, family of random hash functions in Java, based on CRC32, affine transformations, and the Mersenne Twister. 🎲
Language: Java - Size: 726 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 0

wesdoyle/hyperloglog
Exploring HyperLogLog
Language: Rust - Size: 25.2 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

tabac/hyperloglog.rs
HyperLogLog implementations.
Language: Rust - Size: 970 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 18 - Forks: 8

dnbaker/dashing
Fast and accurate genomic distances using HyperLogLog
Language: C++ - Size: 877 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 160 - Forks: 12

rohra-mehak/Scala-Projects
Language: Scala - Size: 26.4 KB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

youzan/ZanRedisDB Fork of absolute8511/ZanRedisDB
Yet another distributed fault-tolerant key-value database Compatible with Redis written in Golang.
Language: Go - Size: 7.83 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 387 - Forks: 60

liangyaopei/hyper
Thread safe, concurrent used hyperloglog implemented in Golang, using murmur v3 hash function
Language: Go - Size: 5.86 KB - Last synced at: 9 months ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 2

dynatrace-research/exaloglog-paper
ExaLogLog: Space-Efficient and Practical Approximate Distinct Counting up to the Exa-Scale
Language: Java - Size: 2.27 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1

dynatrace-research/ultraloglog-paper
UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting
Language: Python - Size: 4.23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

algarecu/ppanf
Approximate Privacy-Preserving Neighbourhood Estimations
Language: Python - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

MatejaMaric/kafka-go-cardinality
Estimating cardinality for a data stream using Go and Apache Kafka
Language: Go - Size: 43.9 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

turu/yalal
Yet Another Lame Algorithm Library
Language: Python - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

dynatrace-research/set-sketch-paper
SetSketch: Filling the Gap between MinHash and HyperLogLog
Language: C++ - Size: 23.7 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 46 - Forks: 5

gyson/hll
Redis compatible HyperLogLog implementation in Elixir
Language: Elixir - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 0

ksmin23/streaming-count-sketches-with-hyperloglog-in-redis
Streaming Count Sketches with HyperLogLog in Amazon MemoryDB for Redis
Language: Python - Size: 48.8 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sasha-s/go-hll
HyperLogLog in golang
Language: Go - Size: 54.7 KB - Last synced at: 3 months ago - Pushed at: about 7 years ago - Stars: 25 - Forks: 3

pNre/Sketching
Collection of sketching algorithms in Swift
Language: Swift - Size: 52.7 KB - Last synced at: 1 day ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 2

matoous/visigo
Unique site visits counter in Go
Language: Go - Size: 21.5 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 0

mbrg/py-hyperminhash
HyperLogLog with intersection
Language: Python - Size: 70.3 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

asjadsyed/AnalyticsMesh
Distributed Cardinality Tracking
Language: Python - Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

l0vest0rm/hll
go/golang version of hyperloglog, ported from popular java version java-hll. hyperloglog is an Cardinality estimate algorithm with low memory and low bias
Language: Go - Size: 782 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 0

redis-developer/redisbloom-moby-dick
Experiments with RedisBloom and the text from Moby Dick
Language: Python - Size: 867 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 3

jlumbroso/python-random-hash
A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
Language: Python - Size: 32.2 KB - Last synced at: 6 days ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 0

bcmcmill/hyperlog-simd
A SIMD optimized implementation of the HLL and HLL++ algorithms in Rust
Language: Rust - Size: 44.9 KB - Last synced at: 23 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

mattlorimor/ProbabilisticDataStructures
A probabilistic data structures library for C#
Language: C# - Size: 1.91 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 74 - Forks: 15

darkLord19/hyperloglog
HyperLogLog implementation in Go.
Language: Go - Size: 1.95 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

sangupta/dry-redis
In-memory Pure Java implementation to Redis useful for unit testing
Language: Java - Size: 181 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 2

concealedtea/hll_data
Hyper Log Log analytical data processor for LieYing
Language: Java - Size: 16.6 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

ethantrott/hyperloglog-estimation
python implementations of the Flajolet-Martin, LogLog, SuperLogLog, and HyperLogLog cardinality estimation algorithms, specifically used to estimate the cardinality of unique traffic violations in NYC in the 2019 fiscal year
Language: Python - Size: 94.7 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 2

andy2046/gopie
go patterns
Language: Go - Size: 193 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 32 - Forks: 5

mble/hll 📦
HyperLogLog++ implementation in Crystal
Language: Crystal - Size: 34.2 KB - Last synced at: 7 months ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

wangyi-fudan/wyHLL
The dream accurate approximate set cardinality estimator based on 3-bit HyperLogLog. More accurate than Redis HyperLogLog.
Language: C - Size: 535 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 55 - Forks: 0

Narasimha1997/bit_vector
A header-only bit vector library for C . This can be used for implementing dynamic bit-vectors for building Bloom-Filters and Hyper-Logs .
Language: C - Size: 20.5 KB - Last synced at: 3 months ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 0

esalini22/gene-hll
HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante índice de Jaccard
Language: C++ - Size: 185 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

VolatileDream/count-distinct
Small command line utility taking advantage of HyperLogLog to count distinct lines.
Language: C - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

igdmitrov/dart_hyperloglog
HyperLogLog [HLL] algoritm on Dart
Language: Dart - Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

col3name/icounter
Count unique line on big file
Language: Go - Size: 15.6 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

mtchavez/go-hll
Go implementation of Hyper Log Log
Language: Go - Size: 39.1 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 6 - Forks: 0

shuvava/mssql-hll
HyperLogLog implementation for mssql
Language: TSQL - Size: 22.5 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

oma219/pacsketch
Network Anomaly Detection Using Probabilistic Data Structures
Language: C++ - Size: 12.8 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

travisbrady/flajolet
Probabilistic data structures for OCaml
Language: OCaml - Size: 220 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 38 - Forks: 3

phungvandat/calculate-ccu
Example calculate current ccu
Language: Go - Size: 5.86 KB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

preytaren/datasketch
datasketch in go
Language: Go - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 1

stephengineer/Analytics-Service
:leaves: A RESTful Web Service
Language: Java - Size: 32.2 KB - Last synced at: 1 day ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

kalaidin/sketches
HyperLogLog and other probabilistic data structures for mining in data streams
Language: Python - Size: 160 KB - Last synced at: over 2 years ago - Pushed at: over 10 years ago - Stars: 15 - Forks: 4

soumyadip007/Redis-Endpoint-Storage-Application-using-Spring-Boot-Rest-and-Microservices
Redis is an in-memory data structure project implementing a distributed, in-memory key-value database with optional durability. Redis supports different kinds of abstract data structures, such as strings, lists, maps, sets, sorted sets, HyperLogLogs, bitmaps, streams, and spatial indexes.
Language: Java - Size: 62.5 KB - Last synced at: 4 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

calvinlfer/streaming-analytics
A streaming data pipeline to perform basic analytics with scalability in mind
Language: Scala - Size: 262 KB - Last synced at: 6 days ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

artem0/spark-algebird
Spark with probabilistic algortighmts - Bloom filter, HLL, QTree and Count-min sketch
Language: Scala - Size: 36.1 KB - Last synced at: 14 days ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 3

travisbrady/ccard
Fast Approximate Unique Word Counting (via LogLog-Beta) for the command line
Language: C - Size: 3.38 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 1

maz2198/Stream_Processing
This repository represents several projects completed in IE HST's MS in Business Analytics and Big Data's Stream Processing Analytics course.
Language: Python - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

venkateshmantha/ITM_Benchmarking
Implementations for FM Sketch, Hyperloglog and Virtual Hyperloglog algorithms tested on real Intenet traffic from CAIDA
Language: Java - Size: 265 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

alexandre-lecoq/HyperLogLog
Implements the Hyper Log Log approximate count-distinct algorithm.
Language: C# - Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

guysmoilov/streaming-hll
Extension to Clearspring impl. of HLL++, which allows merging directly from a stream
Language: Java - Size: 67.4 KB - Last synced at: 25 days ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 0

B-O-O-P/VK-internship-DB
Language: C++ - Size: 19.5 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

pavponn/UniqCounter
Implementation of HyperLogLog algorithm to count number of unique elements in data stream.
Language: C++ - Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

cbuffevant/FSharpHyperLogLog
F# HyperLogLog implementation based on C# Microsoft HyperLogLog
Language: F# - Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

influxdata/hll-check
A small tool for comparing HLL/HLL++ implementations
Language: Go - Size: 2.93 KB - Last synced at: 4 months ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 1
