An open API service providing repository metadata for many open source software ecosystems.

Topic: "streaming-processing"

benedekrozemberczki/NestedSubtreeHash

A distributed implementation of "Nested Subtree Hash Kernels for Large-Scale Graph Classification Over Streams" (ICDM 2012).

Language: Python - Size: 151 KB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 8

judeleonard/Kafka-Streaming-Pipeline

The goal of this project is aimed at optimizing Bank Marketing Model through building an event streaming pipeline around Apache Kafka and its ecosystem that communicates with a Machine learning model microservice. Utilizing this to display the likelihood and status of Bank Customers in real time.

Language: Python - Size: 2.82 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

tmph2003/Streaming-Project-with-Flink

Language: Python - Size: 30.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

meowpunch/ApacheSparkWithScala

Apache Spark With Scala - hands on with big data

Language: Scala - Size: 43.9 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

IBMStreams/sample.netflow

Netflow sample

Language: JavaScript - Size: 40.3 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

gerardodavidlopezcastillo/Cloud9KinesisAthena_Public

Streaming data analysis using AWS tools such as Cloud9 to generate events in the cloud, using boto3 to send records to Kinesis Data Firehose to connect to the S3 bucket destination, saving files in .parquet format. With the help of Glue, a data catalog will be created to enable real-time querying of all records with Athena.

Language: Python - Size: 1.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

gerardodavidlopezcastillo/SimulateDataStreamingIPYNB_Public

Python code is shared that simulates random events in two scenarios: Technology E-commerce and Megastore in their mobile app. This is done to generate large-scale data that can be processed using Data Engineering tools.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0