An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-stream

strimzi/strimzi-kafka-operator

Apache Kafka® running on Kubernetes

Language: Java - Size: 91.5 MB - Last synced at: about 19 hours ago - Pushed at: about 23 hours ago - Stars: 5,196 - Forks: 1,355

reugn/go-streams

A lightweight stream processing library for Go

Language: Go - Size: 575 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2,020 - Forks: 164

ConduitIO/conduit

Conduit streams data between data stores. Kafka Connect replacement. No JVM required.

Language: Go - Size: 12.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 463 - Forks: 50

oxnr/awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

Size: 843 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 13,568 - Forks: 2,568

OpenSCAP/openscap

NIST Certified SCAP 1.2 toolkit

Language: XSLT - Size: 30.6 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 1,475 - Forks: 394

GridProtectionAlliance/openHistorian

The Open Source Time-Series Data Historian

Language: TypeScript - Size: 2.59 GB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 180 - Forks: 49

tylertreat/BoomFilters

Probabilistic data structures for processing continuous, unbounded streams.

Language: Go - Size: 130 KB - Last synced at: 12 days ago - Pushed at: about 4 years ago - Stars: 1,608 - Forks: 114

Correia-jpv/fucking-awesome-bigdata

A curated list of awesome big data frameworks, resources and other awesomeness. With repository stars⭐ and forks🍴

Size: 678 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 10 - Forks: 1

a2u/gh-repos

🗂 List all repositories on Github (separated by language)

Size: 93.9 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 39 - Forks: 27

MentatInnovations/datastream.io

An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana

Language: Python - Size: 2.47 MB - Last synced at: 10 days ago - Pushed at: about 5 years ago - Stars: 906 - Forks: 168

vladimirvivien/automi

A stream processing API for Go (alpha)

Language: Go - Size: 2.27 MB - Last synced at: 13 days ago - Pushed at: 25 days ago - Stars: 789 - Forks: 62

Bilpapster/stream-DaQ

A highly-configurable, real-time data quality monitoring tool designed for streaming data

Language: Python - Size: 32.7 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 8 - Forks: 0

wish44165/ntd

CVPR 2024 (Seattle, USA) - CLVision Workshop

Language: Python - Size: 26.4 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

yuvadm/rivulet

Elengant asynchronous data streams

Language: Python - Size: 7.81 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Spreads/Spreads

Series and Panels for Real-time and Exploratory Analysis of Data Streams

Language: C# - Size: 18 MB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 432 - Forks: 39

chandler767/FakeNewsStream

A real-time streaming application designed to detect and display potentially fake news articles

Language: Go - Size: 30 MB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

pjechris/CohesionKit

Single source of truth library

Language: Swift - Size: 1.34 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 1

E3-JSI/StreamStoryPyClient

StreamStory python client

Language: Python - Size: 291 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

giuliano-macedo/clusopt_core

Clustream, Streamkm++ and metrics utilities C/C++ bindings for python

Language: C++ - Size: 396 KB - Last synced at: 2 days ago - Pushed at: 7 months ago - Stars: 16 - Forks: 1

daq-tools/lorrystream

A lightweight and polyglot stream-processing library, to be used as a data backplane-, message relay-, or pipeline-subsystem.

Language: Python - Size: 260 KB - Last synced at: 17 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

bakdata/streams-explorer

Explore Apache Kafka data pipelines in Kubernetes.

Language: Python - Size: 3.63 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 45 - Forks: 5

green-coder/cdc

A library for performing Content-Defined Chunking (CDC) on data streams.

Language: Rust - Size: 28.3 KB - Last synced at: 22 days ago - Pushed at: about 2 years ago - Stars: 24 - Forks: 5

GMAP/DSPBench

a suite of benchmark applications for distributed data stream processing systems

Language: Java - Size: 250 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 28 - Forks: 3

dataphos/schema-registry

Schema Registry is a product used for schema management and message validation.

Language: Go - Size: 1.13 MB - Last synced at: 21 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 1

ertis-research/kafka-ml

Kafka-ML: connecting the data stream with ML/AI frameworks (now TensorFlow and PyTorch!)

Language: Python - Size: 5.44 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 181 - Forks: 25

mhahsler/rEMM

Language: R - Size: 421 KB - Last synced at: 13 days ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

midassystems/midas-rs

A Rust-based client library for interacting with the midas-server, offering high-performance data streaming and binary file storage using the MBN encoding format.

Language: Rust - Size: 85.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

GridProtectionAlliance/PMUConnectionTester

Verifies data streams from synchrophasor measurement devices

Language: C# - Size: 32.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 21 - Forks: 9

midassystems/midas-py

A Python client library for interacting with the midas-server, providing data streaming and file storage using MBN binary encoding.

Language: Python - Size: 54.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Y-debug-sys/UCL-sketch

Official Implementation of "Learning-based Sketches for Frequency Estimation in Data Streams without Ground Truth"

Language: Jupyter Notebook - Size: 17 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rijalghodi/linguatube-ui

Learn english while watching youtube

Language: TypeScript - Size: 1.05 MB - Last synced at: 22 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

dataphos/lib-brokers

lib-brokers is a Go library which contains the interfaces used to interact with messaging systems without relying on a specific technology or client library. This library attempts to solve the issue of properly abstracting away the interaction between applications and messaging systems.

Language: Go - Size: 160 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

jiowchern/Regulus.Remote

A simple C# network library.

Language: C# - Size: 75.5 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 30 - Forks: 7

alipsgh/tornado

The Tornado :tornado: framework, designed and implemented for adaptive online learning and data stream mining in Python.

Language: Python - Size: 17.2 MB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 129 - Forks: 30

AliNajafi1998/ComStream

In this project, we implemented a topic detection system on Twitter. This system reads tweets from a data stream and assigns them to one of the existing clusters or a new one. Each cluster acts as an agent, which makes the proposed approach a multi-agent system. There is also a coordinator, who monitors the whole system and coordinates the agent.

Language: Python - Size: 1.66 MB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 26 - Forks: 6

sanjeev8988/DAE-Drift-Detection-Based-Ensemble-Classifier

Drift-detection-based Adaptive Ensemble Classifier

Language: Python - Size: 41 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

BCG-X-Official/fluxus

Python framework for concurrent data flows

Language: Python - Size: 1.92 MB - Last synced at: 11 days ago - Pushed at: 9 months ago - Stars: 4 - Forks: 1

ominibyte/richflow

A Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.

Language: JavaScript - Size: 122 KB - Last synced at: 24 days ago - Pushed at: over 7 years ago - Stars: 22 - Forks: 1

openfun/ralph

:gear: Ralph, the ultimate Learning Record Store (and more!) for your learning analytics

Language: Python - Size: 14.3 MB - Last synced at: about 24 hours ago - Pushed at: 7 months ago - Stars: 37 - Forks: 14

dataphos/lib-streamproc

A Go library that exposes executors, interfaces, data structures, and utility functions which combined a universal stream processor, invariant to any specific messaging system.

Language: Go - Size: 82 KB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

GustavoHFMO/IDPSO-ELM-S

Algorithms proposed in the following paper: OLIVEIRA, Gustavo HFMO et al. Time series forecasting in the presence of concept drift: A pso-based approach. In: 2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI). IEEE, 2017. p. 239-246.

Language: Python - Size: 5.49 MB - Last synced at: 15 days ago - Pushed at: almost 4 years ago - Stars: 10 - Forks: 0

rezacsedu/Mining-Maximal-Frequent-Pattern-Spark

Implementation of Static mining part of "Mining maximal frequent patterns in transactional databases and dynamic data streams: A spark-based approach" Information Sciences, Volume 432, March 2018, Pages 278-300

Language: Java - Size: 37.1 KB - Last synced at: 28 days ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 1

grebtsew/Visualize-Realtime-Data-Stream-Chart-in-Flask

Automate Visualization of realtime data streams in chart.JS using Flask in python3.

Language: HTML - Size: 27.5 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 11 - Forks: 4

scramjetorg/framework-js

Simple yet powerful live data computation framework.

Language: TypeScript - Size: 5.58 MB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 38 - Forks: 0

tideland/gocells

Event Based Applications [DEPRECATED]

Language: Go - Size: 2.24 MB - Last synced at: 18 days ago - Pushed at: over 7 years ago - Stars: 67 - Forks: 6

jgaud/streamndr

Novelty detection for data streams in Python

Language: Python - Size: 1.05 MB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 11 - Forks: 1

alexkross/AIR

Real-time data stream classification and knowledge generation engine with no dependencies

Language: Pony - Size: 1.08 MB - Last synced at: 13 days ago - Pushed at: almost 8 years ago - Stars: 8 - Forks: 1

GustavoHFMO/SISC

Algorithms proposed in the following master dissertation: OLIVEIRA, Gustavo Henrique Ferreira de Miranda. Previsão de séries temporais na presença de mudança de conceito: uma abordagem baseada em PSO. 2018. Dissertação de Mestrado. Universidade Federal de Pernambuco.

Language: Python - Size: 9.52 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

scramjetorg/scramjet

Public tracker for Scramjet Cloud Platform, a platform that bring data from many environments together.

Size: 2.71 MB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 253 - Forks: 20

natkaida/missing_k_numbers

Finding missing k numbers in a data stream using symm functions

Language: Python - Size: 4.88 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

mitjafelicijan/nanocloudlogger 📦

Simple cloud based logger for microcontrollers.

Language: Python - Size: 57.6 KB - Last synced at: 30 days ago - Pushed at: over 4 years ago - Stars: 9 - Forks: 1

ewpratten/dreamcatcher 📦

A service that aggregates amateur radio data worldwide in real time for public use

Language: Java - Size: 184 KB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

HawkBitPhp/datastream

Compressed binary data designed for inter-process communication.

Language: PHP - Size: 19.5 KB - Last synced at: 12 months ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

Lefteris-Souflas/Azure-Stream-Analytics-ATM-Transactions

Azure Stream Analytics processes ATM transaction data streams, employing Event Hub, Storage, and Stream Analytics Job. Queries include total amounts and alerts. The setup and query execution process are documented with screenshots.

Language: JavaScript - Size: 3.45 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

openenergi/go-event-hub

Library to connect to the Azure Event Hub via AMQP 1.0 for the Go programming language (Golang) based on Apache Qpid Proton (an AMQP 1.0 C library)

Language: Go - Size: 44.9 KB - Last synced at: 10 months ago - Pushed at: about 7 years ago - Stars: 6 - Forks: 2

zaniors/RxJS

RXJS NOTE

Language: JavaScript - Size: 267 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

jomariya23156/sales-forecast-mlops-at-scale

Full-stack Highly Scalable Cloud-native Machine Learning system for demand forecasting with realtime data streaming, inference, retraining loop, and more

Language: Jupyter Notebook - Size: 9.27 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MengLinMaker/ESP32-data-stream-comparisons 📦

Comparing data streaming methods from ESP32

Language: C++ - Size: 39.1 KB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

lukablagoje/dynamic-XGBoost-model-data-stream-prediction

XGBoost model on a data stream to predict stock prices

Language: Jupyter Notebook - Size: 6.34 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MengLinMaker/IMU-Webserial-Visualiser Fork of Autodrop3d/serialTerminal.com

Visualising IMU orientation using "Three.js" via the experimental "Web Serial API".

Language: JavaScript - Size: 95.7 KB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 2

adriel1997/Twitter-Sentiment-Analysis

Twitter based sentiment analysis using JAVA and Hadoop. In this project we are doing the sentiment analysis on twitter data to analyse whether the tweets posted by people are positive or negative or neutral by checking the tweets with the AFFIN dictionary which has a set of 2500 words along with the value of each word ranging from -5 to +5 denoting whether tweets are positive or negative.

Language: Java - Size: 340 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 9 - Forks: 2

ogozuacik/d3-discriminative-drift-detector-concept-drift

unsupervised concept drift detection

Language: Python - Size: 14.6 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 10

ogozuacik/one-class-drift-detection

unsupervised concept drift detection with one-class classifiers

Language: Python - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 13 - Forks: 1

waylongo/denstream

Python implementation of DenStream clustering

Language: Jupyter Notebook - Size: 971 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 5

Western-OC2-Lab/OASW-Concept-Drift-Detection-and-Adaptation

An online learning method used to address concept drift and model drift. Code for the paper entitled "A Lightweight Concept Drift Detection and Adaptation Framework for IoT Data Streams" published in IEEE Internet of Things Magazine.

Language: Jupyter Notebook - Size: 22.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 13

EthanYuan/open-transaction-pool

A CKB Open Transaction solution based on memory pool.

Language: Rust - Size: 1.66 MB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 2

Ville-Eurometropole-Strasbourg/image_compare

comparaison d'image raster avec openlayers4

Language: JavaScript - Size: 4.26 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 3

stefannesic/streamRHF

Unsupervised anomaly detection for data streams

Language: Cython - Size: 2.16 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

buildersoftio/vortex

Buildersoft Cerebro Project

Language: C# - Size: 351 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

mrdcvlsc/BytePadding

A collection of different byte padding methods

Language: C++ - Size: 473 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

snuspl/pluto

MIST: High-performance IoT Stream Processing

Language: Java - Size: 2.85 MB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 17 - Forks: 3

xxxnell/flex

Probabilistic deep learning for data streams.

Language: Scala - Size: 41.5 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 125 - Forks: 14

Western-OC2-Lab/PWPAE-Concept-Drift-Detection-and-Adaptation

Data stream analytics: Implement online learning methods to address concept drift and model drift in data streams using the River library. Code for the paper entitled "PWPAE: An Ensemble Framework for Concept Drift Adaptation in IoT Data Streams" published in IEEE GlobeCom 2021.

Language: Jupyter Notebook - Size: 4.91 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 186 - Forks: 44

shayneobrien/text-cluster

Offline and online (i.e., real-time) annotated clustering methods for text data.

Language: Jupyter Notebook - Size: 15.8 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 5

PetoLau/ClipStream

ClipStream - multiple data streams clustering method

Language: R - Size: 33.2 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 2

faguzz/streamChaos

R package for analysis of nonlinear data streams

Language: R - Size: 123 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 1

pratikgajjar/mysql-data-stream-kafka

This project create data stream from mysql using replication protocols and ingest into kafka. You can create event driven system using this.

Language: Python - Size: 7.81 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

MengLinMaker/Hip-Motion-Capture

Creating a fall detection device

Language: C++ - Size: 863 KB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

bakdata/quick

The Fastest Way to Create Live Data Products

Language: Java - Size: 3.16 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0

SixingYan/Sketch-for-Data-Stream

Language: Python - Size: 21.3 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

w4k2/if-des-imb-stream

Elsevier Information Fusion Journal submission

Language: Python - Size: 196 MB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 2

tideland/go-cells

Light-weight event-processing based on the idea of meshed cells with different pluggable behaviors

Language: Go - Size: 84 KB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

Ismailhachimi/Concept-Drift

Concept Drift Detection Through Resampling - Algorithms Implementation

Language: Jupyter Notebook - Size: 141 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 1

doganulus/reelay-codegen

A code generator from high-level formal specifications for monitoring and pattern matching sequential/temporal data.

Language: Python - Size: 7.87 MB - Last synced at: 18 days ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 1

rpj/rpi

RPJiOS: RPJ's RPi OS, a sensor data platform for the Raspberry Pi built with python2.7 and redis.

Language: Python - Size: 71.3 KB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 25 - Forks: 1

ertis-research/DDNN

A low-latency and fault-tolerant framework for Distributed and Deep Neural Networks over the Cloud-to-Things Continuum

Language: Python - Size: 88.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 3

rschlaefli/anomaly-detection

Report on Anomaly Detection Methods for Data Streams

Language: TeX - Size: 3.8 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

Western-OC2-Lab/MSANA-Online-Data-Stream-Analytics-And-Concept-Drift-Adaptation

Data stream analytics: Implement online learning methods to address concept drift in dynamic data streams. Code for the paper entitled "A Multi-Stage Automated Online Network Data Stream Analytics Framework for IIoT Systems" published in IEEE Transactions on Industrial Informatics.

Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 4

vbernardes/minas

Python implementation of the MINAS novelty detection algorithm for data streams.

Language: Python - Size: 33.2 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 1

JayKumarr/OSGM

This code belongs to paper entitled "An Online Semantic-enhanced Graphical Model for Evolving Short Text Stream Clustering"

Language: Python - Size: 838 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

JayKumarr/OSDM

This code belongs to ACL conference paper entitled as "An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering"

Language: Python - Size: 831 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 14 - Forks: 4

aws-samples/analyzing-reddit-sentiment-with-aws

Learn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level tutorial.

Language: Python - Size: 3.48 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 41 - Forks: 16

Data-Wrangling-with-JavaScript/Chapter-7

Code examples for Chapter 7 of Data Wrangling with JavaScript

Language: JavaScript - Size: 752 KB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 3

scramjetorg/framework-cpp

Simple yet powerful live data computation framework. C++ port of Scramjet framework.

Language: C++ - Size: 207 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

gabyarte/flink-exercises

Flink exercises for UPM's Master in Data Science's Cloud Computing course

Language: Java - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

TxusLopez/CURIE

Data stream mining extracts information from large quantities of data flowing fast and continuously (data streams). They are usually affected by changes in the data distribution, giving rise to a phenomenon referred to as concept drift. Thus, learning models must detect and adapt to such changes, so as to exhibit a good predictive performance after a drift has occurred. In this regard, the development of effective drift detection algorithms becomes a key factor in data stream mining. In this work we propose CU RIE, a drift detector relying on cellular automata. Specifically, in CU RIE the distribution of the data stream is represented in the grid of a cellular automata, whose neighborhood rule can then be utilized to detect possible distribution changes over the stream. Computer simulations are presented and discussed to show that CU RIE, when hybridized with other base learners, renders a competitive behavior in terms of detection metrics and classification accuracy. CU RIE is compared with well-established drift detectors over synthetic datasets with varying drift characteristics.

Language: Python - Size: 194 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

phanvinh0526/data-stream-analysis-with-storm

My minor research in 10 months | Data Stream Analysis with Storm Apache | Frequent Pattern Mining on Item, Itemset | Sep 2014 - July 2015

Language: HTML - Size: 10.5 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

douglas444/minas-reference-implementation

Reference implementation for MINAS (MultI-class learNing Algorithm for data Streams), an algorithm to address novelty detection in data streams multi-class problems.

Language: Java - Size: 95.5 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 1

gretaivan/crypto-trading-bot

Crypto Currency Trading Bot in Python (desktop app) and API- utilising binance and bitmex APIs. Contains live data streams and multithreading.

Language: Python - Size: 141 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 1

guyfrancoeur/ciclad

ciclad C++ :: A super fast Streaming, memory ultra-lite, sliding-window Closed Itemset Miner

Language: C++ - Size: 30.2 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 8