An open API service providing repository metadata for many open source software ecosystems.

Topic: "probabilistic-data-structures"

kristoff-it/redis-cuckoofilter

Hashing-function agnostic Cuckoo filters for Redis

Language: Zig - Size: 6.34 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 231 - Forks: 22

crepererum-oss/pdatastructs.rs

Simple probabilistic data structures

Language: Rust - Size: 446 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 145 - Forks: 16

gakhov/pdsa

Probabilistic Data Structures and Algorithms in Python

Language: Python - Size: 354 KB - Last synced at: 14 days ago - Pushed at: over 5 years ago - Stars: 129 - Forks: 19

gmcabrita/bloomex 📦

:hibiscus: A pure Elixir implementation of Scalable Bloom Filters

Language: Elixir - Size: 87.9 KB - Last synced at: 29 days ago - Pushed at: almost 4 years ago - Stars: 112 - Forks: 8

alecmocatta/streaming_algorithms

Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.

Language: Rust - Size: 122 KB - Last synced at: 19 days ago - Pushed at: 10 months ago - Stars: 86 - Forks: 11

AMDComputeLibraries/morton_filter

A compressed, sparse cuckoo filter (see https://www.vldb.org/pvldb/vol11/p1041-breslow.pdf)

Language: C++ - Size: 83 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 85 - Forks: 17

kristoff-it/zig-cuckoofilter

Production-ready Cuckoo Filters for any C ABI compatible target.

Language: Zig - Size: 128 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 83 - Forks: 3

krakend/bloomfilter

Bloomfilter written in Golang, includes rotation and RPC

Language: Go - Size: 104 KB - Last synced at: 7 months ago - Pushed at: 9 months ago - Stars: 81 - Forks: 15

vedhavyas/cuckoo-filter

Cuckoo Filter: Practically better than bloom filter

Language: Go - Size: 65.4 KB - Last synced at: 2 months ago - Pushed at: about 6 years ago - Stars: 81 - Forks: 4

mattlorimor/ProbabilisticDataStructures

A probabilistic data structures library for C#

Language: C# - Size: 1.91 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 74 - Forks: 15

gmcabrita/cuckoo 📦

:bird: Cuckoo Filters in Elixir

Language: Elixir - Size: 83 KB - Last synced at: 9 days ago - Pushed at: almost 4 years ago - Stars: 42 - Forks: 0

gyson/blex

Fast Bloom filter with concurrent accessibility, powered by :atomics module.

Language: Elixir - Size: 23.4 KB - Last synced at: 27 days ago - Pushed at: over 4 years ago - Stars: 39 - Forks: 2

Snawoot/bloom

An in-memory bloom filter with persistence and HTTP interface

Language: C - Size: 90.8 KB - Last synced at: 5 days ago - Pushed at: almost 5 years ago - Stars: 34 - Forks: 7

mpope9/exor_filter

Erlang nif for xor_filter. 'Faster and Smaller Than Bloom and Cuckoo Filters'.

Language: C - Size: 2.61 MB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 32 - Forks: 3

zaghaghi/pdstl

Probabilistic Data Structure Template Library

Language: C++ - Size: 168 KB - Last synced at: 2 months ago - Pushed at: almost 5 years ago - Stars: 30 - Forks: 4

dsa0x/sprout

A Go implementation of an in-memory bloom filter, with support for boltdb and badgerdb as optional data persistent storage.

Language: Go - Size: 146 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 28 - Forks: 6

ikegami-yukino/madoka-python

Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)

Language: C++ - Size: 231 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 26 - Forks: 2

srinivasreddy/rust-bloomfilter

🦀 Bloom filter implementation in Rust 🦀

Language: Rust - Size: 17.6 KB - Last synced at: 15 days ago - Pushed at: over 4 years ago - Stars: 18 - Forks: 1

komiya-atsushi/xor-filter

Production-ready Java implementation of the Xor Filter.

Language: Java - Size: 81.1 KB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 17 - Forks: 2

simonprickett/python-probabilistic-data-structures

Exploring Probabilistic Data Structures in Python - my 2021 Pycon USA and Australia and Pycon MEA 2022 talk.

Language: Python - Size: 1.8 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 2

keilerkonzept/topk

Sliding-window and regular top-K sketches, based on HeavyKeeper

Language: Go - Size: 78.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 1

g-trees/g_trees

A family of randomized, history-independent search tree data structures

Language: TypeScript - Size: 37.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 10 - Forks: 1

theodesp/go-blooms

Simple and efficient bloom filter implementations in Go

Language: Go - Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 5

mpope9/efuse_filter

Erlang NIF for Binary Fuse Filter. Fast and Smaller Than Xor Filters.

Language: C - Size: 975 KB - Last synced at: 6 days ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 2

pnxenopoulos/countminsketch

A Python implementation for the Count-Min Sketch data structure

Language: Python - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 7 - Forks: 1

larskuhtz/cuckoo

Haskell Implementation of Cuckoo Filters

Language: Haskell - Size: 87.9 KB - Last synced at: 22 days ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1

metabolist/codable-bloom-filter

An implementation of the Bloom filter data structure conforming to Swift's Codable serialization protocol

Language: Swift - Size: 17.6 KB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 5

FabianaCampanari/PracticalStats-PUCSP-2024

Statistical Measures in Python - Age and Salary Analysis

Language: Jupyter Notebook - Size: 61.6 MB - Last synced at: 27 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

arashbm/hyperloglog

HyperLogLog++ Implementation with C++14

Language: C++ - Size: 410 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

arunma/buildx_pdsa

BuildX PDSA - A build_x project on Probabilistic Data Structures in Rust

Language: Rust - Size: 153 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

SarthakMakhija/probabilistic-data-structures

Repository for an article series on probabilistic data structures including Skiplist, bloom filter, counting bloom filter, count sketch, count min sketch etc

Language: Go - Size: 38.1 KB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

bertof/sbf-rs

Spatial Bloom Filter for Rust

Language: Rust - Size: 46.9 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

Khalilw1/bloomf

Bloom Filter implementation in JavaScript

Language: JavaScript - Size: 19.5 KB - Last synced at: 8 months ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 0

wenkesj/khh

Vanilla python implementation of the K- Heavy Hitters algorithm (data-structure)

Language: Python - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

pmcgleenon/heavykeeper-rs

Heavykeeper algorithm for Top-K elephant flows

Language: Rust - Size: 1.24 MB - Last synced at: 9 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 0

ozgrakkurt/filterz

Probabilistic filter implementations. Ribbon, bloom, xor filters.

Language: Zig - Size: 79.1 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

esalini22/gene-hll

HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante índice de Jaccard

Language: C++ - Size: 185 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

jleni/pysketch

pySketch (count-min sketch)

Language: C++ - Size: 4.88 KB - Last synced at: 6 days ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

pmcgleenon/heavykeeper-py

Heavykeeper algorithm for Top-K elephant flows - python

Language: Rust - Size: 22.5 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

pmcgleenon/jigsaw-sketch

jigsaw sketch implementation

Language: C++ - Size: 1.15 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

sigpwned/delta4j

Elements for building concurrent and distributed data processing applications

Language: Java - Size: 81.1 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

simonprickett/hd-visual-bloom-filter-for-pi

Visual Bloom Filter Project, High Definition! Unicorn Hat HD, Raspberry Pi, Python 3, Flask, Bulma.

Language: Python - Size: 40 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

VladPodilnyk/probably

A Bloom filter implementation in Go

Language: Go - Size: 9.77 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Diegomangasco/PyBloom

Bloom filter library written in Python

Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

andimiller/probabilistic-playground-web

Teaching tool for explaining TopK Theta Sketches and TopK Tuple Sketches

Language: Scala - Size: 61.5 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

simonprickett/redis-compatible-bloom-filter

Attempt to build a Bloom Filter implementation using a Pimoroni Unicorn Hat on a Raspberry Pi that talks the RESP protocol.

Language: Python - Size: 334 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

cjrh/aggonydb

Aggony DB is a one-trick-pony database that can perform rapid aggregation of many-fields low-cardinality big data

Language: Rust - Size: 241 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

algarecu/ppanf

Approximate Privacy-Preserving Neighbourhood Estimations

Language: Python - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

axamon/bloomfilter

golang implementation of a thread-safe bloomfilter

Language: Go - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

zaz/bloom-filter

A fast, 70-line bloom filter for JavaScript 6.

Language: JavaScript - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

droberson/archbloom

various probabilistic data structures implemented in C

Language: C - Size: 244 KB - Last synced at: 4 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 1

kiplingw/SkipList

Kip's implementation of William Pugh's (1989) probabilistic linked list with logarithmic access time.

Language: C++ - Size: 23.4 KB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

irthomasthomas/Instagram-Machine-Learning-Random-Forest-Classifier

Random Forest Classifier to find items for sale on instagram

Language: Python - Size: 206 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

fukua95/pds

A set of probabilistic data structures

Language: Go - Size: 5.86 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

rarissimaavis/bfcounter-bloom-filter

Analisi di un tool per il conteggio dei k-mer tramite Bloom Filter

Size: 3.29 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

niravkapoor/Count-Min-Sketch

Language: C# - Size: 2.93 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

dynatrace-research/ultraloglog-paper

UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting

Language: Python - Size: 4.23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

veminovici/aabel-multihash-rs

A rust crate which extends the Hasher trait. The added functionality allows the users to get multiple hash values for any given hashable item.

Language: Rust - Size: 21.5 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Jeffrey-Joan/Locality-Sensitive-Hashing-based-Sketches

This project aims to provide an unsupervised lightweight solution to estimate the count of various different category of Vehicles. By implementing a novel Locality Sensitive Hashing based sketch.

Language: Python - Size: 408 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

azizkayumov/bloom-filters

BloomFilter is a space efficient storage of sets at the cost of a small overall error probability while maintaining scalability.

Language: Python - Size: 1.04 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

KOLANICH-libs/ProbSetAbs.py

An abstraction layer around implementations of probabilistic membership testing

Language: Python - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Manav-404/Bloom-Filter

A simple bloom filter written in typescript inspired from Arpit Bhayani's youtube video

Language: TypeScript - Size: 12.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

0xchl0e/oomfi

A bloom filter implemented in pure Rust

Language: Rust - Size: 37.1 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Kenwanjohi/bloom-filter

A Javascript Implementation of Bloom filter

Language: JavaScript - Size: 3.91 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Aditya-1500/Spam-Ham-Detection

This is an ensemble based spam detection web-application that uses probablistic data-structures based on the paper: A. Singh, S. Batra, Ensemble based spam detection in social IoT using probabilistic data structures, Future Generation Computer Systems (2017), https://doi.org/10.1016/j.future.2017.09.072.

Language: Python - Size: 2.36 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

pralad-p/pdi

PDI Project

Language: Python - Size: 2.33 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

jaymindesai/sketchy-data-pipelines

Data Pipelines and Sketchy Data Structures for Sub-Linear Retrievals

Language: Python - Size: 1.38 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

devinmcgloin/probabilistic

Bloom Filters, Min Sketch Count and Min Hashing.

Language: Go - Size: 33.2 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

charlesgregory/bloomd

Bloom Filter and vectorized murmurhash3

Language: D - Size: 3.74 MB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

derrandz/Bloom-Filter Fork of tarunsharma1/Bloom-Filter

Bloom Filter in Python using Murmurhash3

Language: Python - Size: 20.5 KB - Last synced at: over 2 years ago - Pushed at: almost 9 years ago - Stars: 0 - Forks: 0