Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: probabilistic-data-structures

irthomasthomas/Instagram-Machine-Learning-Random-Forest-Classifier

Random Forest Classifier to find items for sale on instagram

Language: Python - Size: 206 MB - Last synced: 18 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

kristoff-it/redis-cuckoofilter

Hashing-function agnostic Cuckoo filters for Redis

Language: Zig - Size: 6.34 MB - Last synced: 17 days ago - Pushed: about 4 years ago - Stars: 227 - Forks: 22

dynatrace-research/ultraloglog-paper

UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting

Language: Python - Size: 4.23 MB - Last synced: 26 days ago - Pushed: 27 days ago - Stars: 0 - Forks: 0

algarecu/ppanf

Approximate Privacy-Preserving Neighbourhood Estimations

Language: Python - Size: 28.3 KB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

Kenwanjohi/bloom-filter

A Javascript Implementation of Bloom filter

Language: JavaScript - Size: 3.91 KB - Last synced: about 1 month ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

sigpwned/delta4j

Elements for building concurrent and distributed data processing applications

Language: Java - Size: 81.1 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

alecmocatta/streaming_algorithms

Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.

Language: Rust - Size: 122 KB - Last synced: 27 days ago - Pushed: over 1 year ago - Stars: 81 - Forks: 10

crepererum-oss/pdatastructs.rs

Simple probabilistic data structures

Language: Rust - Size: 379 KB - Last synced: 1 day ago - Pushed: 3 months ago - Stars: 140 - Forks: 16

kristoff-it/zig-cuckoofilter

Production-ready Cuckoo Filters for any C ABI compatible target.

Language: Zig - Size: 128 KB - Last synced: 17 days ago - Pushed: over 1 year ago - Stars: 78 - Forks: 3

Snawoot/bloom

An in-memory bloom filter with persistence and HTTP interface

Language: C - Size: 90.8 KB - Last synced: 29 days ago - Pushed: almost 4 years ago - Stars: 33 - Forks: 7

veminovici/aabel-multihash-rs

A rust crate which extends the Hasher trait. The added functionality allows the users to get multiple hash values for any given hashable item.

Language: Rust - Size: 21.5 KB - Last synced: 18 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

gakhov/pdsa

Probabilistic Data Structures and Algorithms in Python

Language: Python - Size: 354 KB - Last synced: about 1 month ago - Pushed: about 4 years ago - Stars: 113 - Forks: 18

AMDComputeLibraries/morton_filter

A compressed, sparse cuckoo filter (see https://www.vldb.org/pvldb/vol11/p1041-breslow.pdf)

Language: C++ - Size: 83 KB - Last synced: 3 months ago - Pushed: over 3 years ago - Stars: 78 - Forks: 17

VladPodilnyk/probably

A Bloom filter implementation in Go

Language: Go - Size: 9.77 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

zaghaghi/pdstl

Probabilistic Data Structure Template Library

Language: C++ - Size: 168 KB - Last synced: about 1 month ago - Pushed: almost 4 years ago - Stars: 28 - Forks: 4

vedhavyas/cuckoo-filter

Cuckoo Filter: Practically better than bloom filter

Language: Go - Size: 65.4 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 81 - Forks: 4

metabolist/codable-bloom-filter

An implementation of the Bloom filter data structure conforming to Swift's Codable serialization protocol

Language: Swift - Size: 17.6 KB - Last synced: 10 days ago - Pushed: about 3 years ago - Stars: 5 - Forks: 2

Jeffrey-Joan/Locality-Sensitive-Hashing-based-Sketches

This project aims to provide an unsupervised lightweight solution to estimate the count of various different category of Vehicles. By implementing a novel Locality Sensitive Hashing based sketch.

Language: Python - Size: 408 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

mpope9/exor_filter

Erlang nif for xor_filter. 'Faster and Smaller Than Bloom and Cuckoo Filters'.

Language: C - Size: 2.61 MB - Last synced: 16 days ago - Pushed: over 2 years ago - Stars: 31 - Forks: 3

simonprickett/python-probabilistic-data-structures

Exploring Probabilistic Data Structures in Python - my 2021 Pycon USA and Australia and Pycon MEA 2022 talk.

Language: Python - Size: 1.8 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 12 - Forks: 2

krakend/bloomfilter

Bloomfilter written in Golang, includes rotation and RPC

Language: Go - Size: 151 KB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 70 - Forks: 13

azizkayumov/bloom-filters

BloomFilter is a space efficient storage of sets at the cost of a small overall error probability while maintaining scalability.

Language: Python - Size: 1.04 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

gyson/blex

Fast Bloom filter with concurrent accessibility, powered by :atomics module.

Language: Elixir - Size: 23.4 KB - Last synced: 30 days ago - Pushed: over 3 years ago - Stars: 38 - Forks: 2

gmcabrita/bloomex 📦

:hibiscus: A pure Elixir implementation of Scalable Bloom Filters

Language: Elixir - Size: 87.9 KB - Last synced: 19 days ago - Pushed: almost 3 years ago - Stars: 113 - Forks: 8

Diegomangasco/PyBloom

Bloom filter library written in Python

Language: Python - Size: 15.6 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0

Aditya-1500/Spam-Ham-Detection

This is an ensemble based spam detection web-application that uses probablistic data-structures based on the paper: A. Singh, S. Batra, Ensemble based spam detection in social IoT using probabilistic data structures, Future Generation Computer Systems (2017), https://doi.org/10.1016/j.future.2017.09.072.

Language: Python - Size: 2.36 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

ikegami-yukino/madoka-python

Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)

Language: C++ - Size: 231 KB - Last synced: 19 days ago - Pushed: over 5 years ago - Stars: 25 - Forks: 2

Manav-404/Bloom-Filter

A simple bloom filter written in typescript inspired from Arpit Bhayani's youtube video

Language: TypeScript - Size: 12.7 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

dsa0x/sprout

A Go implementation of an in-memory bloom filter, with support for boltdb and badgerdb as optional data persistent storage.

Language: Go - Size: 146 KB - Last synced: 10 months ago - Pushed: about 2 years ago - Stars: 26 - Forks: 6

theodesp/go-blooms

Simple and efficient bloom filter implementations in Go

Language: Go - Size: 7.81 KB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 8 - Forks: 5

gmcabrita/cuckoo 📦

:bird: Cuckoo Filters in Elixir

Language: Elixir - Size: 83 KB - Last synced: 17 days ago - Pushed: almost 3 years ago - Stars: 42 - Forks: 0

larskuhtz/cuckoo

Haskell Implementation of Cuckoo Filters

Language: Haskell - Size: 87.9 KB - Last synced: 5 days ago - Pushed: over 2 years ago - Stars: 6 - Forks: 1

mattlorimor/ProbabilisticDataStructures

A probabilistic data structures library for C#

Language: C# - Size: 1.91 MB - Last synced: 11 months ago - Pushed: about 3 years ago - Stars: 74 - Forks: 15

chloe0x0/oomfi

A bloom filter implemented in pure Rust

Language: Rust - Size: 37.1 KB - Last synced: 15 days ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

arashbm/hyperloglog

HyperLogLog++ Implementation with C++14

Language: C++ - Size: 411 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 3 - Forks: 0

bertof/sbf-rs

Spatial Bloom Filter for Rust

Language: Rust - Size: 46.9 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 4 - Forks: 1

arunma/buildx_pdsa

BuildX PDSA - A build_x project on Probabilistic Data Structures in Rust

Language: Rust - Size: 153 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 1 - Forks: 1

andimiller/probabilistic-playground-web

Teaching tool for explaining TopK Theta Sketches and TopK Tuple Sketches

Language: Scala - Size: 61.5 KB - Last synced: 9 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

pnxenopoulos/countminsketch

A Python implementation for the Count-Min Sketch data structure

Language: Python - Size: 8.79 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 7 - Forks: 1

simonprickett/redis-compatible-bloom-filter

Attempt to build a Bloom Filter implementation using a Pimoroni Unicorn Hat on a Raspberry Pi that talks the RESP protocol.

Language: Python - Size: 334 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

zaz/bloom-filter

A fast, 70-line bloom filter for JavaScript 6.

Language: JavaScript - Size: 30.3 KB - Last synced: about 1 year ago - Pushed: over 7 years ago - Stars: 1 - Forks: 0

esalini22/gene-hll

HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante índice de Jaccard

Language: C++ - Size: 185 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

komiya-atsushi/xor-filter

Production-ready Java implementation of the Xor Filter.

Language: Java - Size: 81.1 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 18 - Forks: 2

SarthakMakhija/probabilistic-data-structures

Repository for an article series on probabilistic data structures including Skiplist, bloom filter, counting bloom filter, count sketch, count min sketch etc

Language: Go - Size: 38.1 KB - Last synced: 10 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

srinivasreddy/rust-bloomfilter

🦀 Bloom filter implementation in Rust 🦀

Language: Rust - Size: 17.6 KB - Last synced: 9 days ago - Pushed: over 3 years ago - Stars: 18 - Forks: 1

mpope9/efuse_filter

Erlang NIF for Binary Fuse Filter. Fast and Smaller Than Xor Filters.

Language: C - Size: 975 KB - Last synced: 1 day ago - Pushed: almost 3 years ago - Stars: 6 - Forks: 2

cjrh/aggonydb

Aggony DB is a one-trick-pony database that can perform rapid aggregation of many-fields low-cardinality big data

Language: Rust - Size: 241 KB - Last synced: 19 days ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

wenkesj/khh

Vanilla python implementation of the K- Heavy Hitters algorithm (data-structure)

Language: Python - Size: 5.86 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 3 - Forks: 1

KOLANICH-libs/ProbSetAbs.py

An abstraction layer around implementations of probabilistic membership testing

Language: Python - Size: 7.81 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

simonprickett/hd-visual-bloom-filter-for-pi

Visual Bloom Filter Project, High Definition! Unicorn Hat HD, Raspberry Pi, Python 3, Flask, Bulma.

Language: Python - Size: 40 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

charlesgregory/bloomd

Bloom Filter and vectorized murmurhash3

Language: D - Size: 3.74 MB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

jaymindesai/sketchy-data-pipelines

Data Pipelines and Sketchy Data Structures for Sub-Linear Retrievals

Language: Python - Size: 1.38 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 1

pralad-p/pdi

PDI Project

Language: Python - Size: 2.33 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

axamon/bloomfilter

golang implementation of a thread-safe bloomfilter

Language: Go - Size: 10.7 KB - Last synced: 5 months ago - Pushed: over 4 years ago - Stars: 1 - Forks: 0

devinmcgloin/probabilistic

Bloom Filters, Min Sketch Count and Min Hashing.

Language: Go - Size: 33.2 KB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

Khalilw1/bloomf

Bloom Filter implementation in JavaScript

Language: JavaScript - Size: 19.5 KB - Last synced: 20 days ago - Pushed: almost 6 years ago - Stars: 3 - Forks: 0

derrandz/Bloom-Filter Fork of tarunsharma1/Bloom-Filter

Bloom Filter in Python using Murmurhash3

Language: Python - Size: 20.5 KB - Last synced: about 1 year ago - Pushed: almost 8 years ago - Stars: 0 - Forks: 0

jleni/pysketch

pySketch (count-min sketch)

Language: C++ - Size: 4.88 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 2 - Forks: 0

Related Keywords
probabilistic-data-structures 58 bloom-filter 27 data-structures 13 python 9 cuckoo-filter 8 hyperloglog 7 count-min-sketch 6 rust 5 elixir 4 xor-filter 4 bloomfilter 4 filter 3 bloom 3 golang 3 redis 3 erlang 3 probabilistic-programming 3 streaming-algorithms 2 probabilistic 2 hashing 2 big-data 2 nif 2 hex 2 scalable-bloom-filters 2 go 2 set 2 cpp 2 data-sketches 2 java 2 distributed-computing 2 cardinality 2 zig 2 cardinality-estimation 2 bloom-filters 2 machine-learning 2 count-distinct 2 spatial-bloom-filter 1 theta-sketch 1 genomics 1 jaccard-distance 1 jaccard-index 1 jaccard-similarity 1 jaccard-similarity-estimation 1 kmer 1 openmp 1 hacktoberfest 1 bloomfilter-rust 1 stable-bloom-filters 1 csharp 1 counting-bloom-filters 1 machine-learning-algorithms 1 c-sharp 1 set-membership 1 haskell 1 probabilistic-models 1 random-forest 1 javascript 1 hash-functions 1 string-matching 1 pdi 1 pdb 1 data-integration 1 kafka 1 flask 1 elasticsearch 1 data-pipelines 1 raspberry-pi 1 pimoroni-unicorn-hat-hd 1 pimoroni 1 iot 1 flask-application 1 bulma 1 membership-testing 1 k-heavy-hitters 1 algorithms 1 datasketches 1 aggregation 1 fuse-filter 1 skiplist 1 stream-processing 1 parallel-programming 1 parallel-computing 1 probabilistic-filters 1 mortonfilter 1 morton-filter 1 membership-query 1 membership-queries 1 data-sketching 1 cuckoofilter 1 rank 1 quantile-estimation 1 ultraloglog 1 membership 1 cython 1 hash 1 hyperanf 1 privacy 1 bloom-server 1 bloom-persistent 1 storage-engines 1