Topic: "streaming-processing"
benedekrozemberczki/NestedSubtreeHash
A distributed implementation of "Nested Subtree Hash Kernels for Large-Scale Graph Classification Over Streams" (ICDM 2012).
Language: Python - Size: 151 KB - Last synced at: 9 months ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 8
OliverHennhoefer/onad
Online Anomaly Detection
Language: Python - Size: 254 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0
judeleonard/Kafka-Streaming-Pipeline
The goal of this project is aimed at optimizing Bank Marketing Model through building an event streaming pipeline around Apache Kafka and its ecosystem that communicates with a Machine learning model microservice. Utilizing this to display the likelihood and status of Bank Customers in real time.
Language: Python - Size: 2.82 MB - Last synced at: almost 3 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0
tmph2003/Streaming-Project-with-Flink
Language: Python - Size: 30.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
meowpunch/ApacheSparkWithScala
Apache Spark With Scala - hands on with big data
Language: Scala - Size: 43.9 KB - Last synced at: almost 3 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0
IBMStreams/sample.netflow
Netflow sample
Language: JavaScript - Size: 40.3 MB - Last synced at: 10 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1
YehiaGewily/real-time-arbitrage-engine
High-frequency arbitrage detection system built with Python and Apache Flink. Features a decoupled microservices architecture with Kafka streaming, real-time windowed analytics, a Streamlit executive dashboard, and automated Discord notifications.
Language: Python - Size: 9.69 MB - Last synced at: 14 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0
gerardodavidlopezcastillo/Cloud9KinesisAthena_Public
Streaming data analysis using AWS tools such as Cloud9 to generate events in the cloud, using boto3 to send records to Kinesis Data Firehose to connect to the S3 bucket destination, saving files in .parquet format. With the help of Glue, a data catalog will be created to enable real-time querying of all records with Athena.
Language: Python - Size: 1.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
gerardodavidlopezcastillo/SimulateDataStreamingIPYNB_Public
Python code is shared that simulates random events in two scenarios: Technology E-commerce and Megastore in their mobile app. This is done to generate large-scale data that can be processed using Data Engineering tools.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0