Topic: "probabilistic-data-structures"
kristoff-it/redis-cuckoofilter
Hashing-function agnostic Cuckoo filters for Redis
Language: Zig - Size: 6.34 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 231 - Forks: 22

crepererum-oss/pdatastructs.rs
Simple probabilistic data structures
Language: Rust - Size: 446 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 145 - Forks: 16

gakhov/pdsa
Probabilistic Data Structures and Algorithms in Python
Language: Python - Size: 354 KB - Last synced at: 14 days ago - Pushed at: over 5 years ago - Stars: 129 - Forks: 19

gmcabrita/bloomex 📦
:hibiscus: A pure Elixir implementation of Scalable Bloom Filters
Language: Elixir - Size: 87.9 KB - Last synced at: 29 days ago - Pushed at: almost 4 years ago - Stars: 112 - Forks: 8

alecmocatta/streaming_algorithms
Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.
Language: Rust - Size: 122 KB - Last synced at: 19 days ago - Pushed at: 10 months ago - Stars: 86 - Forks: 11

AMDComputeLibraries/morton_filter
A compressed, sparse cuckoo filter (see https://www.vldb.org/pvldb/vol11/p1041-breslow.pdf)
Language: C++ - Size: 83 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 85 - Forks: 17

kristoff-it/zig-cuckoofilter
Production-ready Cuckoo Filters for any C ABI compatible target.
Language: Zig - Size: 128 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 83 - Forks: 3

krakend/bloomfilter
Bloomfilter written in Golang, includes rotation and RPC
Language: Go - Size: 104 KB - Last synced at: 7 months ago - Pushed at: 9 months ago - Stars: 81 - Forks: 15

vedhavyas/cuckoo-filter
Cuckoo Filter: Practically better than bloom filter
Language: Go - Size: 65.4 KB - Last synced at: 2 months ago - Pushed at: about 6 years ago - Stars: 81 - Forks: 4

mattlorimor/ProbabilisticDataStructures
A probabilistic data structures library for C#
Language: C# - Size: 1.91 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 74 - Forks: 15

gmcabrita/cuckoo 📦
:bird: Cuckoo Filters in Elixir
Language: Elixir - Size: 83 KB - Last synced at: 9 days ago - Pushed at: almost 4 years ago - Stars: 42 - Forks: 0

gyson/blex
Fast Bloom filter with concurrent accessibility, powered by :atomics module.
Language: Elixir - Size: 23.4 KB - Last synced at: 27 days ago - Pushed at: over 4 years ago - Stars: 39 - Forks: 2

Snawoot/bloom
An in-memory bloom filter with persistence and HTTP interface
Language: C - Size: 90.8 KB - Last synced at: 5 days ago - Pushed at: almost 5 years ago - Stars: 34 - Forks: 7

mpope9/exor_filter
Erlang nif for xor_filter. 'Faster and Smaller Than Bloom and Cuckoo Filters'.
Language: C - Size: 2.61 MB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 32 - Forks: 3

zaghaghi/pdstl
Probabilistic Data Structure Template Library
Language: C++ - Size: 168 KB - Last synced at: 2 months ago - Pushed at: almost 5 years ago - Stars: 30 - Forks: 4

dsa0x/sprout
A Go implementation of an in-memory bloom filter, with support for boltdb and badgerdb as optional data persistent storage.
Language: Go - Size: 146 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 28 - Forks: 6

ikegami-yukino/madoka-python
Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)
Language: C++ - Size: 231 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 26 - Forks: 2

srinivasreddy/rust-bloomfilter
🦀 Bloom filter implementation in Rust 🦀
Language: Rust - Size: 17.6 KB - Last synced at: 15 days ago - Pushed at: over 4 years ago - Stars: 18 - Forks: 1

komiya-atsushi/xor-filter
Production-ready Java implementation of the Xor Filter.
Language: Java - Size: 81.1 KB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 17 - Forks: 2

simonprickett/python-probabilistic-data-structures
Exploring Probabilistic Data Structures in Python - my 2021 Pycon USA and Australia and Pycon MEA 2022 talk.
Language: Python - Size: 1.8 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 2

keilerkonzept/topk
Sliding-window and regular top-K sketches, based on HeavyKeeper
Language: Go - Size: 78.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 1

g-trees/g_trees
A family of randomized, history-independent search tree data structures
Language: TypeScript - Size: 37.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 10 - Forks: 1

theodesp/go-blooms
Simple and efficient bloom filter implementations in Go
Language: Go - Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 5

mpope9/efuse_filter
Erlang NIF for Binary Fuse Filter. Fast and Smaller Than Xor Filters.
Language: C - Size: 975 KB - Last synced at: 6 days ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 2

pnxenopoulos/countminsketch
A Python implementation for the Count-Min Sketch data structure
Language: Python - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 7 - Forks: 1

larskuhtz/cuckoo
Haskell Implementation of Cuckoo Filters
Language: Haskell - Size: 87.9 KB - Last synced at: 22 days ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1

metabolist/codable-bloom-filter
An implementation of the Bloom filter data structure conforming to Swift's Codable serialization protocol
Language: Swift - Size: 17.6 KB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 5

FabianaCampanari/PracticalStats-PUCSP-2024
Statistical Measures in Python - Age and Salary Analysis
Language: Jupyter Notebook - Size: 61.6 MB - Last synced at: 27 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

arashbm/hyperloglog
HyperLogLog++ Implementation with C++14
Language: C++ - Size: 410 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

arunma/buildx_pdsa
BuildX PDSA - A build_x project on Probabilistic Data Structures in Rust
Language: Rust - Size: 153 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

SarthakMakhija/probabilistic-data-structures
Repository for an article series on probabilistic data structures including Skiplist, bloom filter, counting bloom filter, count sketch, count min sketch etc
Language: Go - Size: 38.1 KB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

bertof/sbf-rs
Spatial Bloom Filter for Rust
Language: Rust - Size: 46.9 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

Khalilw1/bloomf
Bloom Filter implementation in JavaScript
Language: JavaScript - Size: 19.5 KB - Last synced at: 8 months ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 0

wenkesj/khh
Vanilla python implementation of the K- Heavy Hitters algorithm (data-structure)
Language: Python - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

pmcgleenon/heavykeeper-rs
Heavykeeper algorithm for Top-K elephant flows
Language: Rust - Size: 1.24 MB - Last synced at: 9 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 0

ozgrakkurt/filterz
Probabilistic filter implementations. Ribbon, bloom, xor filters.
Language: Zig - Size: 79.1 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

esalini22/gene-hll
HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante Ãndice de Jaccard
Language: C++ - Size: 185 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

jleni/pysketch
pySketch (count-min sketch)
Language: C++ - Size: 4.88 KB - Last synced at: 6 days ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

pmcgleenon/heavykeeper-py
Heavykeeper algorithm for Top-K elephant flows - python
Language: Rust - Size: 22.5 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

pmcgleenon/jigsaw-sketch
jigsaw sketch implementation
Language: C++ - Size: 1.15 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

sigpwned/delta4j
Elements for building concurrent and distributed data processing applications
Language: Java - Size: 81.1 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

simonprickett/hd-visual-bloom-filter-for-pi
Visual Bloom Filter Project, High Definition! Unicorn Hat HD, Raspberry Pi, Python 3, Flask, Bulma.
Language: Python - Size: 40 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

VladPodilnyk/probably
A Bloom filter implementation in Go
Language: Go - Size: 9.77 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Diegomangasco/PyBloom
Bloom filter library written in Python
Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

andimiller/probabilistic-playground-web
Teaching tool for explaining TopK Theta Sketches and TopK Tuple Sketches
Language: Scala - Size: 61.5 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

simonprickett/redis-compatible-bloom-filter
Attempt to build a Bloom Filter implementation using a Pimoroni Unicorn Hat on a Raspberry Pi that talks the RESP protocol.
Language: Python - Size: 334 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

cjrh/aggonydb
Aggony DB is a one-trick-pony database that can perform rapid aggregation of many-fields low-cardinality big data
Language: Rust - Size: 241 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

algarecu/ppanf
Approximate Privacy-Preserving Neighbourhood Estimations
Language: Python - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

axamon/bloomfilter
golang implementation of a thread-safe bloomfilter
Language: Go - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

zaz/bloom-filter
A fast, 70-line bloom filter for JavaScript 6.
Language: JavaScript - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

droberson/archbloom
various probabilistic data structures implemented in C
Language: C - Size: 244 KB - Last synced at: 4 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 1

kiplingw/SkipList
Kip's implementation of William Pugh's (1989) probabilistic linked list with logarithmic access time.
Language: C++ - Size: 23.4 KB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

irthomasthomas/Instagram-Machine-Learning-Random-Forest-Classifier
Random Forest Classifier to find items for sale on instagram
Language: Python - Size: 206 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

fukua95/pds
A set of probabilistic data structures
Language: Go - Size: 5.86 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

rarissimaavis/bfcounter-bloom-filter
Analisi di un tool per il conteggio dei k-mer tramite Bloom Filter
Size: 3.29 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

niravkapoor/Count-Min-Sketch
Language: C# - Size: 2.93 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

dynatrace-research/ultraloglog-paper
UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting
Language: Python - Size: 4.23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

veminovici/aabel-multihash-rs
A rust crate which extends the Hasher trait. The added functionality allows the users to get multiple hash values for any given hashable item.
Language: Rust - Size: 21.5 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Jeffrey-Joan/Locality-Sensitive-Hashing-based-Sketches
This project aims to provide an unsupervised lightweight solution to estimate the count of various different category of Vehicles. By implementing a novel Locality Sensitive Hashing based sketch.
Language: Python - Size: 408 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

azizkayumov/bloom-filters
BloomFilter is a space efficient storage of sets at the cost of a small overall error probability while maintaining scalability.
Language: Python - Size: 1.04 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

KOLANICH-libs/ProbSetAbs.py
An abstraction layer around implementations of probabilistic membership testing
Language: Python - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Manav-404/Bloom-Filter
A simple bloom filter written in typescript inspired from Arpit Bhayani's youtube video
Language: TypeScript - Size: 12.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

0xchl0e/oomfi
A bloom filter implemented in pure Rust
Language: Rust - Size: 37.1 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Kenwanjohi/bloom-filter
A Javascript Implementation of Bloom filter
Language: JavaScript - Size: 3.91 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Aditya-1500/Spam-Ham-Detection
This is an ensemble based spam detection web-application that uses probablistic data-structures based on the paper: A. Singh, S. Batra, Ensemble based spam detection in social IoT using probabilistic data structures, Future Generation Computer Systems (2017), https://doi.org/10.1016/j.future.2017.09.072.
Language: Python - Size: 2.36 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

pralad-p/pdi
PDI Project
Language: Python - Size: 2.33 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

jaymindesai/sketchy-data-pipelines
Data Pipelines and Sketchy Data Structures for Sub-Linear Retrievals
Language: Python - Size: 1.38 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

devinmcgloin/probabilistic
Bloom Filters, Min Sketch Count and Min Hashing.
Language: Go - Size: 33.2 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

charlesgregory/bloomd
Bloom Filter and vectorized murmurhash3
Language: D - Size: 3.74 MB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

derrandz/Bloom-Filter Fork of tarunsharma1/Bloom-Filter
Bloom Filter in Python using Murmurhash3
Language: Python - Size: 20.5 KB - Last synced at: over 2 years ago - Pushed at: almost 9 years ago - Stars: 0 - Forks: 0
