An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: streaming-algorithms

OliverHennhoefer/onad

Online Anomaly Detection

Language: Python - Size: 254 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3 - Forks: 0

dynatrace-oss/hash4j

Dynatrace hash library for Java

Language: Java - Size: 37.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 118 - Forks: 13

johnkerl/miller

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

Language: Go - Size: 201 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 9,447 - Forks: 228

jianshu93/DartUniFrac

Approximate UniFrac via Weighted MinHash 🦀

Language: Rust - Size: 3.53 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

xakalex/esdmd

Language: Python - Size: 49.8 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

axtonsun/awesome-streaming-graphs

Must-read papers on streaming graph

Size: 101 KB - Last synced at: 18 days ago - Pushed at: 28 days ago - Stars: 5 - Forks: 2

Waikato/moa

MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.

Language: Java - Size: 30.6 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 651 - Forks: 363

thomasfazzari1/zig-hll

A fast, memory-efficient HyperLogLog cardinality estimation library for Zig

Language: Zig - Size: 688 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

alecmocatta/streaming_algorithms

Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.

Language: Rust - Size: 122 KB - Last synced at: 17 days ago - Pushed at: about 1 year ago - Stars: 88 - Forks: 12

AmoliR/rank1-svd-update

Updating Singular Value Decomposition (SVD) for rank-1 perturbed matrix.

Language: MATLAB - Size: 421 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 2

bcgsc/ntCard

Estimating k-mer coverage histogram of genomics data

Language: C++ - Size: 1.24 MB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 77 - Forks: 9

quantfinlib/screamer

Screamingly fast streaming indicators with C++ performance and Python simplicity.

Language: C++ - Size: 22.4 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 3 - Forks: 1

statsim/preview

Preview large text files online

Language: HTML - Size: 1.47 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 1

qingyaoz/Stock-Market-Simulation

An application of priority queue and streaming algorithms.

Language: C++ - Size: 163 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

MikeJaredS/hermiter

Efficient Sequential and Batch Estimation of Univariate and Bivariate Probability Density Functions and Cumulative Distribution Functions along with Quantiles (Univariate) and Nonparametric Correlation (Bivariate)

Language: R - Size: 8.82 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 3

LaurentRDC/npstreams

Streaming operations on NumPy arrays

Language: Python - Size: 2.83 MB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 36 - Forks: 1

bmoscon/StreamingAlgorithms

A Set of Streaming Algorithms in C++, Python, and Go

Language: C++ - Size: 61.5 KB - Last synced at: 5 months ago - Pushed at: about 8 years ago - Stars: 33 - Forks: 6

alexander-yu/stream

An online statistics library, written in Go

Language: Go - Size: 1.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 32 - Forks: 4

dccuchile/rivertext

RiverText is a framework that standardizes the Incremental Word Embeddings proposed in the state-of-art. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!

Language: Python - Size: 9.85 MB - Last synced at: 16 days ago - Pushed at: 7 months ago - Stars: 22 - Forks: 1

ast-al/rangeless

c++ LINQ -like library of higher-order functions for data manipulation

Language: C++ - Size: 1.28 MB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 195 - Forks: 6

cmdevries/LMW-tree

Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering

Language: C++ - Size: 74.5 MB - Last synced at: 7 months ago - Pushed at: over 3 years ago - Stars: 74 - Forks: 20

Erfanm83/delivery-fare-estimation

a delivery fare estimation script that also filters out suspicious deliveries and incorrect data entries due to faulty GPS devices.

Language: Go - Size: 4.72 MB - Last synced at: 8 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

sjorek/mediacastsegmenter

Create MPEG2-TS encapsulated stream-segments.

Language: C - Size: 31.3 KB - Last synced at: 5 months ago - Pushed at: almost 8 years ago - Stars: 8 - Forks: 2

usmanm/redis-tdigest

t-digest module for Redis

Language: C - Size: 29.3 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 73 - Forks: 12

SebastianSchmidl/pyonlinesvr

Python-Wrapper for Francesco Parrella's OnlineSVR C++ implementation with scikit-learn-compatible interface.

Language: C++ - Size: 206 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 1

alexander-yu/sketch

Approximation streaming algorithms, written in Go

Language: Go - Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

Shamir-Lab/Faucet

This is the codebase for Faucet, described in our manuscript: https://academic.oup.com/bioinformatics/article/34/1/147/4004871, by Roye Rozov, Gil Goldshlager, Eran Halperin, and Ron Shamir

Language: C++ - Size: 127 MB - Last synced at: 8 months ago - Pushed at: over 8 years ago - Stars: 18 - Forks: 3

vojtechgadurek/SimpleSetSketching

Bachelor thesis - a implementation of streaming algorithms for finding symetric difference of very large and very similar sets

Language: C# - Size: 13.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

adriacabeza/ComplexAlgorithms

🧙🏾‍♂️ Complex Algorithms and Complexity Course from the University of San Diego

Language: Python - Size: 7.77 MB - Last synced at: 5 months ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 2

mehrotrasan16/TwitterAnalyser_StormBoi

A lossy counting algorithm implemented to determine the top trending hashtags using the Twitter API to get a continuous stream of tweets.

Language: Java - Size: 396 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

babakyousefian/Virtual-Reality

you can learning operating midwife in operate room in VR hospital within AI robots for the first time in all around the world

Language: C# - Size: 12.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

rahulptel/aobd17

Size: 1.05 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 3

drewm8080/data_mining_streaming_data

A project for streaming algorithms: Bloom filtering, Flajolet-Martin Algorithm, Fixed-Size Sampling

Language: Python - Size: 223 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

PaoloPellizzoni/AdaptiveCoreset

A Java implementation of the experiments for the paper "Dimensionality-adaptive k-center in sliding windows"

Language: Java - Size: 19.2 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ivannikitovic/sublinear

Python library implementing a subset of streaming algorithms. Includes variations of these algorithms (e.g. adversarially robust), as well as support for multiple data types.

Language: Python - Size: 123 KB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

DynoGraph/stinger-dynograph

DynoGraph benchmark suite, implemented using the STINGER graph engine

Language: C - Size: 27.9 MB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 4

jlumbroso/python-random-hash

A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲

Language: Python - Size: 32.2 KB - Last synced at: 20 days ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 0

RobertDurfee/GreenwaldKhanna

An implementation of the Greenwald-Khanna approximate quantile streaming algorithm as a Spark user-defined aggregate function.

Language: Scala - Size: 7.81 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

thanhtbt/aFAPI

[IEEE ICASSP 2023] "Robust Subspace Tracking with Contamination Mitigation via Alpha-Divergence". In 48th IEEE International Conference on Acoustics, Speech, & Signal Processing, 2023.

Language: MATLAB - Size: 32.2 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

statsim/profile

Profile. Generate data profiles in the browser (work in progress)

Language: JavaScript - Size: 1.27 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

andylamp/moses

Streaming, Memory-Limited, r-truncated SVD Revisited!

Language: MATLAB - Size: 4.5 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 19 - Forks: 1

NikolasTz/flink_bayesian_networks_monitoring

Distributed and Online Maintenance of Bayesian Networks in Apache Flink

Language: Java - Size: 8.8 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

andylamp/federated_pca

Federated Principal Component Analysis Revisited!

Language: MATLAB - Size: 7.19 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 33 - Forks: 6

ppoum/spigot-pi-asm

Simple implementation of the Pi spigot algorithm, written in assembly

Language: Assembly - Size: 472 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ivannikitovic/Count-Min-Sketch

Implementation of Count-Min Sketch using Pyhon. Count-Min Sketch (CMS) is a probabilistic data structure that serves as a frequency table of events in a stream of data.

Language: Python - Size: 70.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

panagiotisl/CoEuS

CoEuS: Community Detection via Seed-set Expansion on Graph Streams

Language: Java - Size: 3.7 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

soliblue/Foundations-Of-Data-Science

Collection of algorithms covering various topics and demonstrating data science principles and techniques, developed while attending the Foundations of Data Science course at RWTH Aachen University.

Language: Python - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

esalini22/gene-hll

HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante índice de Jaccard

Language: C++ - Size: 185 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

hishamcse/Data-Structures-and-Algorithms-Specialization

This repository contains all the solutions of assignments, starter files and other materials related to this specialization.

Language: Java - Size: 119 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 4

tarquas/asclasit

ASync CLasses + ASync ITerators

Language: JavaScript - Size: 728 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ocramz/optimization-streaming

Language: Haskell - Size: 7.81 KB - Last synced at: 5 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

burglarhobbit/AOBD17_1401063

Language: Python - Size: 124 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 4

csieber/pydashsim

Simulates a HTTP Adaptive Streaming (HAS) session based on a throughput pattern and video segment sizes.

Language: Python - Size: 406 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 4 - Forks: 0

alessiobernardo/RebalanceDataStream

Language: Java - Size: 4.86 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Sarmentor/IncrementalTextRank

Automatic Keyword/Keyphrase Extraction from Text Streams

Language: R - Size: 15.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

panagiotisl/DiCeS

DiCeS: Distributed Community Detection Over Streams

Language: Java - Size: 3.72 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

duncankmckinnon/online_random_forest_resources

A set of links and repos for modern online random forests

Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Related Keywords
streaming-algorithms 57 streaming-data 7 online-algorithms 6 hyperloglog 6 statistics 5 java 4 cardinality-estimation 4 streaming 4 data-structures 4 python 4 golang 4 cpp 4 go 3 optimization 3 probabilistic-data-structures 3 subspace-tracking 3 machine-learning 3 graph-algorithms 3 python3 3 big-data 2 machine-learning-algorithms 2 moa 2 streaming-graph-processing 2 incremental-learning 2 dimensionality-reduction 2 quantile 2 pca 2 robust-pca 2 probabilistic-pca 2 numpy 2 online-learning 2 stream-processing 2 linear-discriminant-analysis 2 incremental-pca 2 scikit-learn 2 expectation-maximization 2 locality-sensitive-hashing 2 stats 2 spark 2 hash-functions 2 algorithm 2 singular-value-decomposition 2 np-completeness 2 linear-programming 2 minhash 2 data-profile 1 robust-algorithm 1 outliers 1 non-gaussian 1 contamination 1 data-profiling 1 subspace-identification 1 subspace-learning 1 svd 1 apache-flink 1 apache-kafka 1 approximate-distributed-counters 1 bayesian-networks 1 count-tracking-problem 1 generative-adversarial-network 1 decomposition 1 yoloalgoritms 1 virtual-reality 1 touchpad 1 steamvr 1 steam-api 1 single-shot-multibox-detector 1 r-cnn 1 object-detection 1 localization 1 hog 1 haptic-feedback 1 handtracking 1 deep-reinforcement-learning 1 csharp-script 1 k-mer-counting 1 user-defined-functions 1 user-defined-function 1 spark-streaming 1 quantile-estimation 1 randomized-algorithm 1 flajolet-martin 1 flajolet 1 data-streaming 1 analytic-combinatorics 1 analysis-of-algorithms 1 stinger 1 graph 1 benchmark 1 sliding-windows 1 datamining-algorithms 1 online-singular-value-decomposition 1 online-regression 1 numerical-optimization 1 numerical-methods 1 iterator 1 helpers 1 generator 1 framework 1 class 1