An open API service providing repository metadata for many open source software ecosystems.

Topic: "streaming-processing"

benedekrozemberczki/NestedSubtreeHash

A distributed implementation of "Nested Subtree Hash Kernels for Large-Scale Graph Classification Over Streams" (ICDM 2012).

Language: Python - Size: 151 KB - Last synced at: 9 months ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 8

OliverHennhoefer/onad

Online Anomaly Detection

Language: Python - Size: 254 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

judeleonard/Kafka-Streaming-Pipeline

The goal of this project is aimed at optimizing Bank Marketing Model through building an event streaming pipeline around Apache Kafka and its ecosystem that communicates with a Machine learning model microservice. Utilizing this to display the likelihood and status of Bank Customers in real time.

Language: Python - Size: 2.82 MB - Last synced at: almost 3 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

tmph2003/Streaming-Project-with-Flink

Language: Python - Size: 30.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

meowpunch/ApacheSparkWithScala

Apache Spark With Scala - hands on with big data

Language: Scala - Size: 43.9 KB - Last synced at: almost 3 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

IBMStreams/sample.netflow

Netflow sample

Language: JavaScript - Size: 40.3 MB - Last synced at: 10 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

YehiaGewily/real-time-arbitrage-engine

High-frequency arbitrage detection system built with Python and Apache Flink. Features a decoupled microservices architecture with Kafka streaming, real-time windowed analytics, a Streamlit executive dashboard, and automated Discord notifications.

Language: Python - Size: 9.69 MB - Last synced at: 14 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

gerardodavidlopezcastillo/Cloud9KinesisAthena_Public

Streaming data analysis using AWS tools such as Cloud9 to generate events in the cloud, using boto3 to send records to Kinesis Data Firehose to connect to the S3 bucket destination, saving files in .parquet format. With the help of Glue, a data catalog will be created to enable real-time querying of all records with Athena.

Language: Python - Size: 1.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

gerardodavidlopezcastillo/SimulateDataStreamingIPYNB_Public

Python code is shared that simulates random events in two scenarios: Technology E-commerce and Megastore in their mobile app. This is done to generate large-scale data that can be processed using Data Engineering tools.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0