GitHub topics: streaming-algorithms
Waikato/moa
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
Language: Java - Size: 30.7 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 655 - Forks: 366
jianshu93/lash
Fast and Memory Efficient Genome Sketching via HyperLogLog, HyperMinHash and UltraLogLog
Language: Rust - Size: 314 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 17 - Forks: 1
quantfinlib/screamer
Screamingly fast streaming indicators with C++ performance and Python simplicity.
Language: C++ - Size: 22.4 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 4 - Forks: 1
johnkerl/miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Language: Go - Size: 201 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9,521 - Forks: 227
dynatrace-oss/hash4j
Dynatrace hash library for Java
Language: Java - Size: 37.9 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 123 - Forks: 14
logannye/emsqrt
Process any data size with a fixed, small memory footprint. EM-√ is an external-memory ETL/log processing engine with hard peak-RAM guarantees. Unlike traditional systems that "try" to stay within memory limits, EM-√ enforces a strict memory cap, enabling you to process arbitrarily large datasets using small memory footprints.
Language: Rust - Size: 112 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0
jianshu93/DartUniFrac
Approximate UniFrac via Weighted MinHash 🦀
Language: Rust - Size: 3.5 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 3 - Forks: 0
xakalex/esdmd
Language: Python - Size: 51.8 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0
OliverHennhoefer/onad
Online Anomaly Detection
Language: Python - Size: 254 MB - Last synced at: 28 days ago - Pushed at: 29 days ago - Stars: 3 - Forks: 0
MikeJaredS/hermiter
Efficient Sequential and Batch Estimation of Univariate and Bivariate Probability Density Functions and Cumulative Distribution Functions along with Quantiles (Univariate) and Nonparametric Correlation (Bivariate)
Language: R - Size: 8.82 MB - Last synced at: 25 days ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 3
THEForgiven/srtt-preprint
Preprint + minimal, reproducible code/data for the Structure–Randomness Transfer Theorem (SRTT); headless runner, HTML report, CI, Pages.
Language: Shell - Size: 267 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
axtonsun/awesome-streaming-graphs
Must-read papers on streaming graph
Size: 101 KB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 5 - Forks: 2
alecmocatta/streaming_algorithms
Performant implementations of various streaming algorithms, including Count–min sketch, Top k, HyperLogLog, Reservoir sampling.
Language: Rust - Size: 122 KB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 88 - Forks: 12
AmoliR/rank1-svd-update
Updating Singular Value Decomposition (SVD) for rank-1 perturbed matrix.
Language: MATLAB - Size: 421 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 2
bcgsc/ntCard
Estimating k-mer coverage histogram of genomics data
Language: C++ - Size: 1.24 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 77 - Forks: 9
statsim/preview
Preview large text files online
Language: HTML - Size: 1.47 MB - Last synced at: 16 days ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1
qingyaoz/Stock-Market-Simulation
An application of priority queue and streaming algorithms.
Language: C++ - Size: 163 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0
LaurentRDC/npstreams
Streaming operations on NumPy arrays
Language: Python - Size: 2.83 MB - Last synced at: 7 months ago - Pushed at: 10 months ago - Stars: 36 - Forks: 1
bmoscon/StreamingAlgorithms
A Set of Streaming Algorithms in C++, Python, and Go
Language: C++ - Size: 61.5 KB - Last synced at: 7 months ago - Pushed at: over 8 years ago - Stars: 33 - Forks: 6
alexander-yu/stream
An online statistics library, written in Go
Language: Go - Size: 1.3 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 32 - Forks: 4
dccuchile/rivertext
RiverText is a framework that standardizes the Incremental Word Embeddings proposed in the state-of-art. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Language: Python - Size: 9.85 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 22 - Forks: 1
ast-al/rangeless
c++ LINQ -like library of higher-order functions for data manipulation
Language: C++ - Size: 1.28 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 195 - Forks: 6
cmdevries/LMW-tree
Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering
Language: C++ - Size: 74.5 MB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 74 - Forks: 20
Erfanm83/delivery-fare-estimation
a delivery fare estimation script that also filters out suspicious deliveries and incorrect data entries due to faulty GPS devices.
Language: Go - Size: 4.72 MB - Last synced at: 10 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0
sjorek/mediacastsegmenter
Create MPEG2-TS encapsulated stream-segments.
Language: C - Size: 31.3 KB - Last synced at: 7 months ago - Pushed at: about 8 years ago - Stars: 8 - Forks: 2
usmanm/redis-tdigest
t-digest module for Redis
Language: C - Size: 29.3 KB - Last synced at: 7 months ago - Pushed at: almost 5 years ago - Stars: 73 - Forks: 12
SebastianSchmidl/pyonlinesvr
Python-Wrapper for Francesco Parrella's OnlineSVR C++ implementation with scikit-learn-compatible interface.
Language: C++ - Size: 209 KB - Last synced at: 20 days ago - Pushed at: 2 months ago - Stars: 6 - Forks: 1
alexander-yu/sketch
Approximation streaming algorithms, written in Go
Language: Go - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0
Shamir-Lab/Faucet
This is the codebase for Faucet, described in our manuscript: https://academic.oup.com/bioinformatics/article/34/1/147/4004871, by Roye Rozov, Gil Goldshlager, Eran Halperin, and Ron Shamir
Language: C++ - Size: 127 MB - Last synced at: 10 months ago - Pushed at: over 8 years ago - Stars: 18 - Forks: 3
vojtechgadurek/SimpleSetSketching
Bachelor thesis - a implementation of streaming algorithms for finding symetric difference of very large and very similar sets
Language: C# - Size: 13.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0
adriacabeza/ComplexAlgorithms
🧙🏾♂️ Complex Algorithms and Complexity Course from the University of San Diego
Language: Python - Size: 7.77 MB - Last synced at: 7 months ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 2
mehrotrasan16/TwitterAnalyser_StormBoi
A lossy counting algorithm implemented to determine the top trending hashtags using the Twitter API to get a continuous stream of tweets.
Language: Java - Size: 396 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1
babakyousefian/Virtual-Reality
you can learning operating midwife in operate room in VR hospital within AI robots for the first time in all around the world
Language: C# - Size: 12.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0
rahulptel/aobd17
Size: 1.05 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 3
drewm8080/data_mining_streaming_data
A project for streaming algorithms: Bloom filtering, Flajolet-Martin Algorithm, Fixed-Size Sampling
Language: Python - Size: 223 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
PaoloPellizzoni/AdaptiveCoreset
A Java implementation of the experiments for the paper "Dimensionality-adaptive k-center in sliding windows"
Language: Java - Size: 19.2 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0
ivannikitovic/sublinear
Python library implementing a subset of streaming algorithms. Includes variations of these algorithms (e.g. adversarially robust), as well as support for multiple data types.
Language: Python - Size: 123 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
DynoGraph/stinger-dynograph
DynoGraph benchmark suite, implemented using the STINGER graph engine
Language: C - Size: 27.9 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 4
jlumbroso/python-random-hash
A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
Language: Python - Size: 32.2 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 0
RobertDurfee/GreenwaldKhanna
An implementation of the Greenwald-Khanna approximate quantile streaming algorithm as a Spark user-defined aggregate function.
Language: Scala - Size: 7.81 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0
thanhtbt/aFAPI
[IEEE ICASSP 2023] "Robust Subspace Tracking with Contamination Mitigation via Alpha-Divergence". In 48th IEEE International Conference on Acoustics, Speech, & Signal Processing, 2023.
Language: MATLAB - Size: 32.2 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
statsim/profile
Profile. Generate data profiles in the browser (work in progress)
Language: JavaScript - Size: 1.27 MB - Last synced at: 9 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0
andylamp/moses
Streaming, Memory-Limited, r-truncated SVD Revisited!
Language: MATLAB - Size: 4.5 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 19 - Forks: 1
NikolasTz/flink_bayesian_networks_monitoring
Distributed and Online Maintenance of Bayesian Networks in Apache Flink
Language: Java - Size: 8.8 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
andylamp/federated_pca
Federated Principal Component Analysis Revisited!
Language: MATLAB - Size: 7.19 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 33 - Forks: 6
ppoum/spigot-pi-asm
Simple implementation of the Pi spigot algorithm, written in assembly
Language: Assembly - Size: 472 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
ivannikitovic/Count-Min-Sketch
Implementation of Count-Min Sketch using Pyhon. Count-Min Sketch (CMS) is a probabilistic data structure that serves as a frequency table of events in a stream of data.
Language: Python - Size: 70.3 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0
panagiotisl/CoEuS
CoEuS: Community Detection via Seed-set Expansion on Graph Streams
Language: Java - Size: 3.7 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0
soliblue/Foundations-Of-Data-Science
Collection of algorithms covering various topics and demonstrating data science principles and techniques, developed while attending the Foundations of Data Science course at RWTH Aachen University.
Language: Python - Size: 13.7 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0
esalini22/gene-hll
HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante índice de Jaccard
Language: C++ - Size: 185 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0
hishamcse/Data-Structures-and-Algorithms-Specialization
This repository contains all the solutions of assignments, starter files and other materials related to this specialization.
Language: Java - Size: 119 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 4
tarquas/asclasit
ASync CLasses + ASync ITerators
Language: JavaScript - Size: 728 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0
ocramz/optimization-streaming
Language: Haskell - Size: 7.81 KB - Last synced at: 7 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1
burglarhobbit/AOBD17_1401063
Language: Python - Size: 124 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 4
csieber/pydashsim
Simulates a HTTP Adaptive Streaming (HAS) session based on a throughput pattern and video segment sizes.
Language: Python - Size: 406 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 0
alessiobernardo/RebalanceDataStream
Language: Java - Size: 4.86 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
Sarmentor/IncrementalTextRank
Automatic Keyword/Keyphrase Extraction from Text Streams
Language: R - Size: 15.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0
panagiotisl/DiCeS
DiCeS: Distributed Community Detection Over Streams
Language: Java - Size: 3.72 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
duncankmckinnon/online_random_forest_resources
A set of links and repos for modern online random forests
Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0