An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: hyperloglog

dynatrace-oss/hash4j

Dynatrace hash library for Java

Language: Java - Size: 37.2 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 112 - Forks: 11

axiomhq/hyperloglog

HyperLogLog with lots of sugar (Sparse, LogLog-Beta bias correction and TailCut space reduction) brought to you by Axiom

Language: Go - Size: 265 KB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 982 - Forks: 74

dnbaker/sketch

C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings

Language: C++ - Size: 4.43 MB - Last synced at: about 16 hours ago - Pushed at: 11 months ago - Stars: 155 - Forks: 14

alecmocatta/streaming_algorithms

Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.

Language: Rust - Size: 122 KB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 87 - Forks: 11

thepirat000/CachingFramework.Redis

Distributed caching based on StackExchange.Redis and Redis. Includes support for tagging and is cluster-compatible.

Language: C# - Size: 1.4 MB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 293 - Forks: 52

MalekD5/uniusage

UniUsage is a command-line tool for analyzing user behavior from log files.

Language: Java - Size: 4.99 MB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

ankane/datasketches-ruby

Sketch data structures for Ruby

Language: C++ - Size: 85 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 12 - Forks: 0

Callidon/bloom-filters

JS implementation of probabilistic data structures: Bloom Filter (and its derived), HyperLogLog, Count-Min Sketch, Top-K and MinHash

Language: TypeScript - Size: 8.12 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 406 - Forks: 47

Query-farm/datasketches

Integrates DuckDB with the high-performance Apache DataSketches library. This extension enables users to perform approximate analytics on large-scale datasets using state-of-the-art streaming algorithms, all from within DuckDB.

Language: C++ - Size: 316 KB - Last synced at: 2 days ago - Pushed at: 19 days ago - Stars: 21 - Forks: 0

jean-pierreBoth/probminhash

Rust implementation of probminhash, superminhash and hyperloglog sketching algorithms

Language: Rust - Size: 198 KB - Last synced at: 19 days ago - Pushed at: about 2 months ago - Stars: 31 - Forks: 3

axiomhq/hyperminhash

HyperMinHash: Bringing intersections to HyperLogLog

Language: Go - Size: 17.6 KB - Last synced at: 3 days ago - Pushed at: over 7 years ago - Stars: 303 - Forks: 18

mrtkp9993/probdsgo

Probabilistic data structures implemented in Go.

Language: Go - Size: 29.3 KB - Last synced at: 3 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

20101301-Alina-Hasan/Parallel-HyperLogLog-Cardinality-Estimation

A Python project demonstrating efficient estimation of unique elements in any dataset using the HyperLogLog algorithm with parallel processing. In this example, we apply the method to a transactional dataset, showcasing data cleaning, visualization, and performance comparisons for scalable cardinality estimation.

Language: Python - Size: 113 KB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ekzhu/datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Language: Python - Size: 5.68 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 2,699 - Forks: 299

ascv/HyperLogLog

Fast HyperLogLog for Python.

Language: C - Size: 306 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 106 - Forks: 19

xzeck/HyperLogLog

A rust implementation of google's HyperLogLog for cardinality estimation

Language: Rust - Size: 61.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

VladimirDegt/goit-algo2-hw-05

Language: Python - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

cloudflare/cardinality-estimator

A crate for estimating the cardinality of distinct elements in a stream or dataset.

Language: Rust - Size: 571 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 22 - Forks: 5

fcambus/logswan

Fast Web log analyzer using probabilistic data structures

Language: C - Size: 484 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 388 - Forks: 24

retailnext/hllpp

HyperLogLog cardinality estimation algorithm in go/golang!

Language: Go - Size: 64.5 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 60 - Forks: 12

davidesantangelo/hyll

A Ruby implementation of the HyperLogLog algorithm for efficient cardinality estimation with minimal memory footprint. Count millions of distinct elements using only kilobytes of memory.

Language: Ruby - Size: 133 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 23 - Forks: 0

bcgsc/ntCard

Estimating k-mer coverage histogram of genomics data

Language: C++ - Size: 1.24 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 78 - Forks: 9

jedisct1/rust-hyperloglog

A HyperLogLog implementation in Rust.

Language: Rust - Size: 123 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 50 - Forks: 19

absolute8511/ZanRedisDB

Yet another distributed fault-tolerant key-value database Compatible with Redis written in Golang.

Language: Go - Size: 21.8 MB - Last synced at: 1 day ago - Pushed at: about 2 years ago - Stars: 77 - Forks: 78

oertl/hyperloglog-sketch-estimation-paper

Paper about the estimation of cardinalities from HyperLogLog sketches

Language: TeX - Size: 51.6 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 62 - Forks: 6

LesiaUKR/goit-algo2-hw-05

Master's | Design & Analysis of Algorithms | Algorithms for Big Data Processing

Language: Python - Size: 35.2 KB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

bigmlcom/sketchy

Sketching Algorithms for Clojure (bloom filter, min-hash, hyper-loglog, count-min sketch)

Language: Clojure - Size: 147 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 148 - Forks: 18

simonprickett/python-probabilistic-data-structures

Exploring Probabilistic Data Structures in Python - my 2021 Pycon USA and Australia and Pycon MEA 2022 talk.

Language: Python - Size: 1.8 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 2

nidorx/sqlog

SQLog - Connecting the dots

Language: Go - Size: 880 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 29 - Forks: 2

ivoras/goestimators

Fast Cardinality Estimation algorithms for Go: HyperLogLog/SuperLogLog/LogLog, Bloom

Language: Go - Size: 40 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 8 - Forks: 1

LiveRamp/HyperMinHash-java

Union, intersection, and set cardinality in loglog space

Language: Java - Size: 572 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 56 - Forks: 10

danchen6/pfutil

Fast and Redis-compatible HyperLogLog extension for Python 3

Language: C - Size: 48.8 KB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

KnightChaser/HyperLogLog

A simple Python implementation of the HyperLogLog algorithm, a probabilistic data structure used for estimating the cardinality of a set.

Language: Python - Size: 6.84 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

JsIqbal/redis

Complete redis course by stephen grider

Language: Svelte - Size: 289 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

kwertop/gostatix

Thread-safe and persistent Golang implementations of probabilistic data structures: Bloom Filter, Cuckoo Filter, HyperLogLog, Count-Min Sketch and Top-K

Language: Go - Size: 221 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 2

arashbm/hyperloglog

HyperLogLog++ Implementation with C++14

Language: C++ - Size: 410 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

alexhholmes/inbloom

Rust implementation of a probabilistic multiset cardinality algorithm

Language: Rust - Size: 15.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

vislee/lua-resty-hyperloglog

Hyperloglog for openresty.

Language: Lua - Size: 65.4 KB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 1

shabbyrobe/phphll 📦

HyperLogLog for PHP implemented as a C extension

Language: C - Size: 429 KB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 20 - Forks: 0

sundy-li/simple_hll

A simple HyperLogLog implementation in rust

Language: Rust - Size: 30.3 KB - Last synced at: 25 days ago - Pushed at: 9 months ago - Stars: 3 - Forks: 2

jlumbroso/java-random-hash

A simple, time-tested, family of random hash functions in Java, based on CRC32, affine transformations, and the Mersenne Twister. 🎲

Language: Java - Size: 726 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 0

wesdoyle/hyperloglog

Exploring HyperLogLog

Language: Rust - Size: 25.2 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

tabac/hyperloglog.rs

HyperLogLog implementations.

Language: Rust - Size: 970 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 18 - Forks: 8

dnbaker/dashing

Fast and accurate genomic distances using HyperLogLog

Language: C++ - Size: 877 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 160 - Forks: 12

rohra-mehak/Scala-Projects

Language: Scala - Size: 26.4 KB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

youzan/ZanRedisDB Fork of absolute8511/ZanRedisDB

Yet another distributed fault-tolerant key-value database Compatible with Redis written in Golang.

Language: Go - Size: 7.83 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 387 - Forks: 60

liangyaopei/hyper

Thread safe, concurrent used hyperloglog implemented in Golang, using murmur v3 hash function

Language: Go - Size: 5.86 KB - Last synced at: 9 months ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 2

dynatrace-research/exaloglog-paper

ExaLogLog: Space-Efficient and Practical Approximate Distinct Counting up to the Exa-Scale

Language: Java - Size: 2.27 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1

dynatrace-research/ultraloglog-paper

UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting

Language: Python - Size: 4.23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

algarecu/ppanf

Approximate Privacy-Preserving Neighbourhood Estimations

Language: Python - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

MatejaMaric/kafka-go-cardinality

Estimating cardinality for a data stream using Go and Apache Kafka

Language: Go - Size: 43.9 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

turu/yalal

Yet Another Lame Algorithm Library

Language: Python - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

dynatrace-research/set-sketch-paper

SetSketch: Filling the Gap between MinHash and HyperLogLog

Language: C++ - Size: 23.7 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 46 - Forks: 5

gyson/hll

Redis compatible HyperLogLog implementation in Elixir

Language: Elixir - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 0

ksmin23/streaming-count-sketches-with-hyperloglog-in-redis

Streaming Count Sketches with HyperLogLog in Amazon MemoryDB for Redis

Language: Python - Size: 48.8 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sasha-s/go-hll

HyperLogLog in golang

Language: Go - Size: 54.7 KB - Last synced at: 3 months ago - Pushed at: about 7 years ago - Stars: 25 - Forks: 3

pNre/Sketching

Collection of sketching algorithms in Swift

Language: Swift - Size: 52.7 KB - Last synced at: 1 day ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 2

matoous/visigo

Unique site visits counter in Go

Language: Go - Size: 21.5 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 0

mbrg/py-hyperminhash

HyperLogLog with intersection

Language: Python - Size: 70.3 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

asjadsyed/AnalyticsMesh

Distributed Cardinality Tracking

Language: Python - Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

l0vest0rm/hll

go/golang version of hyperloglog, ported from popular java version java-hll. hyperloglog is an Cardinality estimate algorithm with low memory and low bias

Language: Go - Size: 782 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 0

redis-developer/redisbloom-moby-dick

Experiments with RedisBloom and the text from Moby Dick

Language: Python - Size: 867 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 3

jlumbroso/python-random-hash

A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲

Language: Python - Size: 32.2 KB - Last synced at: 6 days ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 0

bcmcmill/hyperlog-simd

A SIMD optimized implementation of the HLL and HLL++ algorithms in Rust

Language: Rust - Size: 44.9 KB - Last synced at: 23 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

mattlorimor/ProbabilisticDataStructures

A probabilistic data structures library for C#

Language: C# - Size: 1.91 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 74 - Forks: 15

darkLord19/hyperloglog

HyperLogLog implementation in Go.

Language: Go - Size: 1.95 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

sangupta/dry-redis

In-memory Pure Java implementation to Redis useful for unit testing

Language: Java - Size: 181 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 2

concealedtea/hll_data

Hyper Log Log analytical data processor for LieYing

Language: Java - Size: 16.6 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

ethantrott/hyperloglog-estimation

python implementations of the Flajolet-Martin, LogLog, SuperLogLog, and HyperLogLog cardinality estimation algorithms, specifically used to estimate the cardinality of unique traffic violations in NYC in the 2019 fiscal year

Language: Python - Size: 94.7 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 2

andy2046/gopie

go patterns

Language: Go - Size: 193 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 32 - Forks: 5

mble/hll 📦

HyperLogLog++ implementation in Crystal

Language: Crystal - Size: 34.2 KB - Last synced at: 7 months ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

wangyi-fudan/wyHLL

The dream accurate approximate set cardinality estimator based on 3-bit HyperLogLog. More accurate than Redis HyperLogLog.

Language: C - Size: 535 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 55 - Forks: 0

Narasimha1997/bit_vector

A header-only bit vector library for C . This can be used for implementing dynamic bit-vectors for building Bloom-Filters and Hyper-Logs .

Language: C - Size: 20.5 KB - Last synced at: 3 months ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 0

esalini22/gene-hll

HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante índice de Jaccard

Language: C++ - Size: 185 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

VolatileDream/count-distinct

Small command line utility taking advantage of HyperLogLog to count distinct lines.

Language: C - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

igdmitrov/dart_hyperloglog

HyperLogLog [HLL] algoritm on Dart

Language: Dart - Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

col3name/icounter

Count unique line on big file

Language: Go - Size: 15.6 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

mtchavez/go-hll

Go implementation of Hyper Log Log

Language: Go - Size: 39.1 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 6 - Forks: 0

shuvava/mssql-hll

HyperLogLog implementation for mssql

Language: TSQL - Size: 22.5 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

oma219/pacsketch

Network Anomaly Detection Using Probabilistic Data Structures

Language: C++ - Size: 12.8 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

travisbrady/flajolet

Probabilistic data structures for OCaml

Language: OCaml - Size: 220 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 38 - Forks: 3

phungvandat/calculate-ccu

Example calculate current ccu

Language: Go - Size: 5.86 KB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

preytaren/datasketch

datasketch in go

Language: Go - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 1

stephengineer/Analytics-Service

:leaves: A RESTful Web Service

Language: Java - Size: 32.2 KB - Last synced at: 1 day ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

kalaidin/sketches

HyperLogLog and other probabilistic data structures for mining in data streams

Language: Python - Size: 160 KB - Last synced at: over 2 years ago - Pushed at: over 10 years ago - Stars: 15 - Forks: 4

soumyadip007/Redis-Endpoint-Storage-Application-using-Spring-Boot-Rest-and-Microservices

Redis is an in-memory data structure project implementing a distributed, in-memory key-value database with optional durability. Redis supports different kinds of abstract data structures, such as strings, lists, maps, sets, sorted sets, HyperLogLogs, bitmaps, streams, and spatial indexes.

Language: Java - Size: 62.5 KB - Last synced at: 4 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

calvinlfer/streaming-analytics

A streaming data pipeline to perform basic analytics with scalability in mind

Language: Scala - Size: 262 KB - Last synced at: 6 days ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

artem0/spark-algebird

Spark with probabilistic algortighmts - Bloom filter, HLL, QTree and Count-min sketch

Language: Scala - Size: 36.1 KB - Last synced at: 14 days ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 3

travisbrady/ccard

Fast Approximate Unique Word Counting (via LogLog-Beta) for the command line

Language: C - Size: 3.38 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 1

maz2198/Stream_Processing

This repository represents several projects completed in IE HST's MS in Business Analytics and Big Data's Stream Processing Analytics course.

Language: Python - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

venkateshmantha/ITM_Benchmarking

Implementations for FM Sketch, Hyperloglog and Virtual Hyperloglog algorithms tested on real Intenet traffic from CAIDA

Language: Java - Size: 265 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

alexandre-lecoq/HyperLogLog

Implements the Hyper Log Log approximate count-distinct algorithm.

Language: C# - Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

guysmoilov/streaming-hll

Extension to Clearspring impl. of HLL++, which allows merging directly from a stream

Language: Java - Size: 67.4 KB - Last synced at: 25 days ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 0

B-O-O-P/VK-internship-DB

Language: C++ - Size: 19.5 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

pavponn/UniqCounter

Implementation of HyperLogLog algorithm to count number of unique elements in data stream.

Language: C++ - Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

cbuffevant/FSharpHyperLogLog

F# HyperLogLog implementation based on C# Microsoft HyperLogLog

Language: F# - Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

influxdata/hll-check

A small tool for comparing HLL/HLL++ implementations

Language: Go - Size: 2.93 KB - Last synced at: 4 months ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 1