Topic: "probabilistic-data-structures"
kristoff-it/redis-cuckoofilter
Hashing-function agnostic Cuckoo filters for Redis
Language: Zig - Size: 6.34 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 233 - Forks: 22
crepererum-oss/pdatastructs.rs
Simple probabilistic data structures
Language: Rust - Size: 386 KB - Last synced at: 18 days ago - Pushed at: 21 days ago - Stars: 147 - Forks: 20
gakhov/pdsa
Probabilistic Data Structures and Algorithms in Python
Language: Python - Size: 354 KB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 130 - Forks: 19
gmcabrita/bloomex 📦
:hibiscus: A pure Elixir implementation of Scalable Bloom Filters
Language: Elixir - Size: 87.9 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 112 - Forks: 8
krakend/bloomfilter
Bloomfilter written in Golang, includes rotation and RPC
Language: Go - Size: 110 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 88 - Forks: 17
alecmocatta/streaming_algorithms
Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.
Language: Rust - Size: 122 KB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 88 - Forks: 12
AMDComputeLibraries/morton_filter
A compressed, sparse cuckoo filter (see https://www.vldb.org/pvldb/vol11/p1041-breslow.pdf)
Language: C++ - Size: 83 KB - Last synced at: 7 months ago - Pushed at: almost 5 years ago - Stars: 85 - Forks: 17
kristoff-it/zig-cuckoofilter
Production-ready Cuckoo Filters for any C ABI compatible target.
Language: Zig - Size: 128 KB - Last synced at: 13 days ago - Pushed at: almost 3 years ago - Stars: 82 - Forks: 3
vedhavyas/cuckoo-filter
Cuckoo Filter: Practically better than bloom filter
Language: Go - Size: 65.4 KB - Last synced at: 18 days ago - Pushed at: over 6 years ago - Stars: 81 - Forks: 4
mattlorimor/ProbabilisticDataStructures
A probabilistic data structures library for C#
Language: C# - Size: 1.91 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 74 - Forks: 15
gmcabrita/cuckoo 📦
:bird: Cuckoo Filters in Elixir
Language: Elixir - Size: 83 KB - Last synced at: 20 days ago - Pushed at: over 4 years ago - Stars: 42 - Forks: 0
gyson/blex
Fast Bloom filter with concurrent accessibility, powered by :atomics module.
Language: Elixir - Size: 23.4 KB - Last synced at: 8 days ago - Pushed at: almost 5 years ago - Stars: 39 - Forks: 2
mpope9/exor_filter
Erlang nif for xor_filter. 'Faster and Smaller Than Bloom and Cuckoo Filters'.
Language: C - Size: 2.61 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 34 - Forks: 3
Snawoot/bloom
An in-memory bloom filter with persistence and HTTP interface
Language: C - Size: 90.8 KB - Last synced at: 5 months ago - Pushed at: over 5 years ago - Stars: 34 - Forks: 7
zaghaghi/pdstl
Probabilistic Data Structure Template Library
Language: C++ - Size: 168 KB - Last synced at: 7 months ago - Pushed at: over 5 years ago - Stars: 30 - Forks: 4
dsa0x/sprout
A Go implementation of an in-memory bloom filter, with support for boltdb and badgerdb as optional data persistent storage.
Language: Go - Size: 146 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 6
ikegami-yukino/madoka-python
Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)
Language: C++ - Size: 231 KB - Last synced at: about 16 hours ago - Pushed at: over 6 years ago - Stars: 27 - Forks: 3
srinivasreddy/rust-bloomfilter
🦀 Bloom filter implementation in Rust 🦀
Language: Rust - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 18 - Forks: 1
komiya-atsushi/xor-filter
Production-ready Java implementation of the Xor Filter.
Language: Java - Size: 81.1 KB - Last synced at: 7 months ago - Pushed at: almost 6 years ago - Stars: 17 - Forks: 2
simonprickett/python-probabilistic-data-structures
Exploring Probabilistic Data Structures in Python - my 2021 Pycon USA and Australia and Pycon MEA 2022 talk.
Language: Python - Size: 1.8 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 12 - Forks: 2
keilerkonzept/topk
Sliding-window and regular top-K sketches, based on HeavyKeeper
Language: Go - Size: 78.1 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 10 - Forks: 1
g-trees/g_trees
A family of randomized, history-independent search tree data structures
Language: TypeScript - Size: 37.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 1
steelcake/filterz
Probabilistic filter implementations. Ribbon, bloom, xor filters.
Language: Zig - Size: 80.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 1
theodesp/go-blooms
Simple and efficient bloom filter implementations in Go
Language: Go - Size: 7.81 KB - Last synced at: 7 months ago - Pushed at: about 7 years ago - Stars: 8 - Forks: 5
mpope9/efuse_filter
Erlang NIF for Binary Fuse Filter. Fast and Smaller Than Xor Filters.
Language: C - Size: 975 KB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 2
pnxenopoulos/countminsketch
A Python implementation for the Count-Min Sketch data structure
Language: Python - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 1
larskuhtz/cuckoo
Haskell Implementation of Cuckoo Filters
Language: Haskell - Size: 87.9 KB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 2
metabolist/codable-bloom-filter
An implementation of the Bloom filter data structure conforming to Swift's Codable serialization protocol
Language: Swift - Size: 17.6 KB - Last synced at: 20 days ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 6
pmcgleenon/heavykeeper-rs
Heavykeeper algorithm for Top-K elephant flows
Language: Rust - Size: 1.28 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 4 - Forks: 2
arashbm/hyperloglog
HyperLogLog++ Implementation with C++14
Language: C++ - Size: 431 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0
caiocdcs/probz
A Zig library for probabilistic data structures, like Bloom Filter, HyperLogLog, etc.
Language: Zig - Size: 81.1 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0
FabianaCampanari/PracticalStats-PUCSP-2024
Statistical Measures in Python - Age and Salary Analysis
Language: Jupyter Notebook - Size: 61.6 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0
arunma/buildx_pdsa
BuildX PDSA - A build_x project on Probabilistic Data Structures in Rust
Language: Rust - Size: 153 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0
SarthakMakhija/probabilistic-data-structures
Repository for an article series on probabilistic data structures including Skiplist, bloom filter, counting bloom filter, count sketch, count min sketch etc
Language: Go - Size: 38.1 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2
bertof/sbf-rs
Spatial Bloom Filter for Rust
Language: Rust - Size: 46.9 KB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 1
Khalilw1/bloomf
Bloom Filter implementation in JavaScript
Language: JavaScript - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0
wenkesj/khh
Vanilla python implementation of the K- Heavy Hitters algorithm (data-structure)
Language: Python - Size: 5.86 KB - Last synced at: 9 months ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 1
esalini22/gene-hll
HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante Ãndice de Jaccard
Language: C++ - Size: 185 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0
jleni/pysketch
pySketch (count-min sketch)
Language: C++ - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 0
pmcgleenon/heavykeeper-py
Heavykeeper algorithm for Top-K elephant flows - python
Language: Python - Size: 1.2 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0
qoofa/bloom-filter-redis
🌸 Bloom filter for username existence checker using Redis Stack, ioredis, and Node.js HTTP server. Efficiently prevents duplicate usernames with fast, low-memory probabilistic checks. Easy to set up with Docker — ideal for scalable user validation! 🚀
Language: JavaScript - Size: 12.7 KB - Last synced at: about 20 hours ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0
pmcgleenon/jigsaw-sketch
jigsaw sketch implementation
Language: C++ - Size: 1.15 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0
sigpwned/delta4j
Elements for building concurrent and distributed data processing applications
Language: Java - Size: 81.1 KB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0
simonprickett/hd-visual-bloom-filter-for-pi
Visual Bloom Filter Project, High Definition! Unicorn Hat HD, Raspberry Pi, Python 3, Flask, Bulma.
Language: Python - Size: 40 KB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
VladPodilnyk/probably
A Bloom filter implementation in Go
Language: Go - Size: 9.77 KB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
Diegomangasco/PyBloom
Bloom filter library written in Python
Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0
andimiller/probabilistic-playground-web
Teaching tool for explaining TopK Theta Sketches and TopK Tuple Sketches
Language: Scala - Size: 61.5 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
simonprickett/redis-compatible-bloom-filter
Attempt to build a Bloom Filter implementation using a Pimoroni Unicorn Hat on a Raspberry Pi that talks the RESP protocol.
Language: Python - Size: 334 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0
cjrh/aggonydb
Aggony DB is a one-trick-pony database that can perform rapid aggregation of many-fields low-cardinality big data
Language: Rust - Size: 241 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0
algarecu/ppanf
Approximate Privacy-Preserving Neighbourhood Estimations
Language: Python - Size: 28.3 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0
axamon/bloomfilter
golang implementation of a thread-safe bloomfilter
Language: Go - Size: 10.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0
zaz/bloom-filter
A fast, 70-line bloom filter for JavaScript 6.
Language: JavaScript - Size: 30.3 KB - Last synced at: over 2 years ago - Pushed at: almost 9 years ago - Stars: 1 - Forks: 0
Miguel2604/EnhancedBloomFilter
Enhanced Learned Bloom Filter with cache optimization, incremental learning, and adaptive threshold control
Language: Python - Size: 5.31 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0
queelius/maph
Space-efficient approximate mappings using perfect hash functions. Supports arbitrary function approximation (X→Y) with configurable storage (8/16/32/64-bit) and accuracy trade-offs.
Language: C++ - Size: 12.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
fullscreen-triangle/autobahn
A Biological Metabolism Computer for Probabilistic Information Processing
Language: Rust - Size: 8.36 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
mrtkp9993/probdsgo
Probabilistic data structures implemented in Go.
Language: Go - Size: 29.3 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0
droberson/archbloom
various probabilistic data structures implemented in C
Language: C - Size: 244 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 1
kiplingw/SkipList
Kip's implementation of William Pugh's (1989) probabilistic linked list with logarithmic access time.
Language: C++ - Size: 23.4 KB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0
irthomasthomas/Instagram-Machine-Learning-Random-Forest-Classifier
Random Forest Classifier to find items for sale on instagram
Language: Python - Size: 206 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0
cxljs/pds
A set of probabilistic data structures
Language: Go - Size: 9.77 KB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0
rarissimaavis/bfcounter-bloom-filter
Analisi di un tool per il conteggio dei k-mer tramite Bloom Filter
Size: 3.29 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0
niravkapoor/Count-Min-Sketch
Language: C# - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
dynatrace-research/ultraloglog-paper
UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting
Language: Python - Size: 4.23 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
veminovici/aabel-multihash-rs
A rust crate which extends the Hasher trait. The added functionality allows the users to get multiple hash values for any given hashable item.
Language: Rust - Size: 21.5 KB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
Jeffrey-Joan/Locality-Sensitive-Hashing-based-Sketches
This project aims to provide an unsupervised lightweight solution to estimate the count of various different category of Vehicles. By implementing a novel Locality Sensitive Hashing based sketch.
Language: Python - Size: 408 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
azizkayumov/bloom-filters
BloomFilter is a space efficient storage of sets at the cost of a small overall error probability while maintaining scalability.
Language: Python - Size: 1.04 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
KOLANICH-libs/ProbSetAbs.py
An abstraction layer around implementations of probabilistic membership testing
Language: Python - Size: 7.81 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0
Manav-404/Bloom-Filter
A simple bloom filter written in typescript inspired from Arpit Bhayani's youtube video
Language: TypeScript - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0
0xchl0e/oomfi
A bloom filter implemented in pure Rust
Language: Rust - Size: 37.1 KB - Last synced at: about 6 hours ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0
Kenwanjohi/bloom-filter
A Javascript Implementation of Bloom filter
Language: JavaScript - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
Aditya-1500/Spam-Ham-Detection
This is an ensemble based spam detection web-application that uses probablistic data-structures based on the paper: A. Singh, S. Batra, Ensemble based spam detection in social IoT using probabilistic data structures, Future Generation Computer Systems (2017), https://doi.org/10.1016/j.future.2017.09.072.
Language: Python - Size: 2.36 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0
pralad-p/pdi
PDI Project
Language: Python - Size: 2.33 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0
jaymindesai/sketchy-data-pipelines
Data Pipelines and Sketchy Data Structures for Sub-Linear Retrievals
Language: Python - Size: 1.38 MB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 1
devinmcgloin/probabilistic
Bloom Filters, Min Sketch Count and Min Hashing.
Language: Go - Size: 33.2 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0
charlesgregory/bloomd
Bloom Filter and vectorized murmurhash3
Language: D - Size: 3.74 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
derrandz/Bloom-Filter Fork of tarunsharma1/Bloom-Filter
Bloom Filter in Python using Murmurhash3
Language: Python - Size: 20.5 KB - Last synced at: over 2 years ago - Pushed at: over 9 years ago - Stars: 0 - Forks: 0