GitHub topics: data-streaming
strimzi/strimzi-kafka-operator
Apache Kafka® running on Kubernetes
Language: Java - Size: 91.5 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 5,196 - Forks: 1,355

apache/inlong
Apache InLong - a one-stop, full-scenario integration framework for massive data
Language: Java - Size: 57.5 MB - Last synced at: about 5 hours ago - Pushed at: 2 days ago - Stars: 1,432 - Forks: 532

jaehyeon-kim/streaming-demos
Data streaming demo projects
Language: Python - Size: 2.54 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 1

factaxd/Twitch-Chat-Emote-Analyzer
Real-time Twitch chat analysis application using Python (FastAPI, NLTK, VADER) for backend processing and React (TypeScript, WebSockets) for frontend visualization. Analyzes sentiment, keywords, and emotes (Twitch, FFZ, 7TV)
Language: Python - Size: 68.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

TouK/nussknacker
Low-code tool for automating actions on real time data | Stream processing for the users.
Language: Scala - Size: 179 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 683 - Forks: 95

gunnarmorling/streaming-examples
Example projects and demos around data streaming , stream processing, change data capture, and more.
Language: Java - Size: 22.5 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 0

apache/doris-kafka-connector
Kafka Connector for Apache Doris
Language: Java - Size: 451 KB - Last synced at: 6 days ago - Pushed at: 13 days ago - Stars: 19 - Forks: 13

ConduitIO/streaming-benchmarks
Benchmarks for Conduit and other data streaming tools.
Language: Shell - Size: 8.14 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 4 - Forks: 0

j3-signalroom/ccaf_kickstarter-flight_consolidator_app-lambda
Demonstrates a best practice implementation for using an AWS Lambda function to deploy a Flink Job Graph to Confluent Cloud for Apache Flink.
Language: Python - Size: 2.82 MB - Last synced at: 9 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0

goldsmocap/mocap-streamer
Goldsmiths Mocap Streamer is a cutting-edge tool that streams live motion capture (BVH) data over the internet, allowing users from multiple remote locations to interact in the same shared digital space.
Language: C - Size: 144 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 8 - Forks: 1

giobbu/CUSUM
Different flavours of CUSUM for change point detection
Language: Jupyter Notebook - Size: 26.8 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

jmgiorgi-10/Stock-App
Data pipeline with LSTM prediction based on implementation of stock prices
Language: Python - Size: 254 KB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

superstreamlabs/memphis
Memphis.dev is a highly scalable and effortless data streaming platform
Language: Go - Size: 468 MB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 3,294 - Forks: 223

jaehyeon-kim/kafka-pocs
Apache Kafka and Related Projects
Language: Shell - Size: 63.6 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 30 - Forks: 11

hiabhishek1888/ninjafilesystem
This project is a peer-to-peer decentralized file storage in Go.
Language: Go - Size: 1.78 MB - Last synced at: 9 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

aymane-maghouti/Real-Time-Data-Pipeline-Using-Kafka
This project implements a real-time data pipeline using Apache Kafka, Python's psutil library for metric collection, and SQL Server for data storage. The pipeline collects metrics data from the local computer, processes it through Kafka brokers, and loads it into a SQL Server database. Additionally, a real-time dashboard is created using Power BI.
Language: Python - Size: 380 KB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 3

getindata/flink-http-connector
Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.
Language: Java - Size: 659 KB - Last synced at: 20 days ago - Pushed at: 28 days ago - Stars: 176 - Forks: 45

cning112/fastflight
FastFlight is a high-performance data transfer framework using Apache Arrow Flight for efficient, modular, and pluggable data streaming with optional FastAPI integration for HTTP-based access.
Language: Python - Size: 1.58 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

getindata/dbt-flink-adapter
Adapter for dbt that executes dbt pipelines on Apache Flink
Language: Python - Size: 14 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 92 - Forks: 12

DagsHub/client
DagsHub client libraries
Language: Python - Size: 3.57 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 93 - Forks: 23

daniel-dqsdatalabs/bigdata-projects
Big Data Stack
Language: Python - Size: 368 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

jaehyeon-kim/flink-demos
Apache Flink (Pyflink) and Related Projects
Language: Python - Size: 2.18 MB - Last synced at: 17 days ago - Pushed at: 11 months ago - Stars: 35 - Forks: 12

build-on-aws/building-apache-kafka-connectors
Sample code that shows the important aspects of developing custom connectors for Kafka Connect. It provides the resources for building, deploying, and running the code on-premises using Docker, as well as running the code in the cloud.
Language: Java - Size: 57.6 KB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 54 - Forks: 14

ZainAli104/distributed-file-system-go
This project offers a peer-to-peer content-addressable distributed file storage in Go with a peer-to-peer library built on top of TCP from scratch. It also supports data encryption during storage and transmission
Language: Go - Size: 70.3 KB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

janzenisek/dsg
C# .NET data stream generator
Language: C# - Size: 9.57 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 3

pipelane/pipelaner
High-performance and efficient Framework and Agent for creating data pipelines. The core of pipeline descriptions is based on the Configuration As Code concept and the Pkl configuration language by Apple.
Language: Go - Size: 412 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 34 - Forks: 0

daq-tools/lorrystream
A lightweight and polyglot stream-processing library, to be used as a data backplane-, message relay-, or pipeline-subsystem.
Language: Python - Size: 260 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

ThaiTechTales/kafka
This repository serves as a collection of projects demonstrating expertise in Apache Kafka, a distributed event-streaming platform. The projects aim to highlight real-time data integration and stream processing solutions.
Language: Java - Size: 11.1 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

jlumbroso/affirmative-sampling
Reference implementation of the Affirmative Sampling algorithm by Jérémie Lumbroso and Conrado Martínez (2022). 🍀
Language: Python - Size: 794 KB - Last synced at: 13 days ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

PranavBarthwal/kafka
Kafka is an open-source software platform for storing, processing, and analyzing streaming data in real time. It's used to build data pipelines and applications that can adapt to data streams.
Language: JavaScript - Size: 37.1 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

turulix/memphis-rust-community
A Memphis SDK written in rust.
Language: Rust - Size: 256 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

tashi-2004/Apache-Flink-Spark-Data-Streaming
This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.
Language: Python - Size: 62.6 MB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mitgar14/etl-workshop-3
Workshop #3 (Machine Learning and Data Streaming) for the ETL course using scikit-learn to develop the ML model and Apache Kafka to manage the data streaming process.
Language: Jupyter Notebook - Size: 2.11 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ThiagoBarradas/logstash-beats-demo
Elastic Stack with Nginx, Logstash and Beats demo
Size: 19.5 KB - Last synced at: 15 days ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 3

ixian-platform/Ixian-S2
Ixian S2 end to end data streaming network software
Language: C# - Size: 311 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 5 - Forks: 3

strimzi/strimzi-canary 📦
Strimzi canary
Language: Go - Size: 325 KB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 41 - Forks: 29

nessmahm/SparkKafka-Real-Time-Brand-Trend-Analysis
A real-time data processing project for analyzing e-commerce sales data. It integrates Apache Kafka 📡 and Apache Spark ⚡ to extract brand information and calculate the top N brands. Results are stored in HBase and displayed in real-time ⏱️. This project enables seamless integration with visualization tools 📊, offering insights into brand trends.
Language: Java - Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Richardbnk/Hive_MQ_Streaming
Streaming data seamlessly using the HiveMQ Broker for efficient communication and IoT integration.
Language: Python - Size: 3.91 KB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Richardbnk/Raspberry_Pi
Python Operations for Raspberry Pi in Internet of Things (IoT) Applications
Language: Python - Size: 28.3 KB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Richardbnk/Telegram_Tool
Utility functions for seamless message streaming and automation with Telegram.
Language: Python - Size: 12.7 KB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

dataphos/lib-brokers
lib-brokers is a Go library which contains the interfaces used to interact with messaging systems without relying on a specific technology or client library. This library attempts to solve the issue of properly abstracting away the interaction between applications and messaging systems.
Language: Go - Size: 160 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

SidiahmedHABIB/e2e-data-engineering
This project is an end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using a variety of powerful tools including Apache Airflow, Apache Kafka, Apache Spark and Cassandra. All components are containerized with Docker for easy deployment and scalability.
Language: Python - Size: 1.73 MB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1

codeterrayt/StreamGuard
StreamGuard is a high-performance data management script using Kafka and MongoDB to efficiently handle and process real-time data streams. Ideal for scenarios like live GPS tracking, it features real-time data processing, reduced database load, and bulk data insertion.
Language: JavaScript - Size: 24.4 KB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

joeycumines/go-longpoll
Package longpoll supports batching e.g. receiving as many values as possible from a channel.
Language: Go - Size: 9.77 KB - Last synced at: 30 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

lupusruber/crypto_stats
A project that provides a cloud-native solution for ingesting, transforming, and visualizing cryptocurrency data, utilizing modern tools and workflows for scalability and automation.
Language: Python - Size: 536 KB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Pirate-Emperor/K2BQ
K2BQ is a dataflow pipeline that streams data from Kafka to BigQuery. It uses Google Cloud’s managed Kafka, Dataflow for processing, and BigQuery for real-time analytics, offering scalable, automated data integration for fast insights.
Language: Python - Size: 21.5 KB - Last synced at: about 10 hours ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

bdbao/Kafka-VM
This project demonstrates a basic Kafka implementation: using the kafka-python library via Ubuntu virtual machine; and Change Data Capture (CDC) between 2 DBMS via Docker.
Language: Makefile - Size: 27 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

serdaraltin/fusion-bridge
Establishes and manages communication between different hardware components and software layers, ensuring seamless data exchange and synchronization.
Language: C++ - Size: 27.3 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

CLoaKY233/DataStreams
Real-time cat/dog image classifier using Kafka and CNN. Sender uploads images to Kafka, receiver processes with pre-trained model, returns predictions via Kafka. Demonstrates distributed, scalable image processing with instant feedback. Uses TensorFlow, OpenCV, and Kafka for efficient, asynchronous communication.
Language: Python - Size: 7.81 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

linkedin/brooklin
An extensible distributed system for reliable nearline data streaming at scale
Language: Java - Size: 5.61 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 919 - Forks: 137

MasWag/monaa
A Tool for Timed Patten Matching with Automata-Based Acceleration
Language: C++ - Size: 2.15 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 9 - Forks: 0

dataphos/lib-streamproc
A Go library that exposes executors, interfaces, data structures, and utility functions which combined a universal stream processor, invariant to any specific messaging system.
Language: Go - Size: 82 KB - Last synced at: 13 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

beam-pyio/firehose_pyio
Apache Beam Python I/O connector for Amazon Data Firehose
Language: Python - Size: 2.83 MB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

beam-pyio/sqs_pyio
Apache Beam Python I/O connector for Amazon SQS
Language: Python - Size: 2.83 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

beam-pyio/dynamodb_pyio
Apache Beam Python I/O connector for Amazon DynamoDB
Language: Python - Size: 2.8 MB - Last synced at: 22 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

jlumbroso/java-random-hash
A simple, time-tested, family of random hash functions in Java, based on CRC32, affine transformations, and the Mersenne Twister. 🎲
Language: Java - Size: 726 KB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 0

Giangblackk/streaming_benchmark
Streaming Data Pipeline End-to-End Latency Benchmarking
Language: Python - Size: 1.2 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

night-fury-me/real-time-vehicle-data-processing
A repository that contains implementation of a Real-Time Vehicle Data Processing Pipeline that efficiently manages and analyzes vehicle data through a cohesive system.
Language: Python - Size: 664 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Doch88/aws-firehose-sender-py
AWS Firehose Sender - Sending data securely through a Firehose stream using boto3
Language: Python - Size: 4.88 KB - Last synced at: 8 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

to-infinitee/real-time-data-system-arch
The architecture ingests data via Kafka, processes it in real-time with Spark Streaming, and stores it in Cassandra and Hadoop HDFS. It supports direct data push to apps using WebSockets/HTTP Streaming, with a front-end built on Spring Boot, Bootstrap.js, and Chart.js.
Size: 353 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

cjcocokrisp/m3x-data-streaming
Library for interacting with the Myopro 2 device that allows for data streaming and changing the device's control parameters.
Language: Python - Size: 112 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

KentHsu/Udacity-Data-Streaming-Nanodegree
Udacity Data Streaming Nanodegree Program
Language: Python - Size: 624 KB - Last synced at: 18 days ago - Pushed at: about 4 years ago - Stars: 22 - Forks: 12

kanenorman/mobility-ai
Full Stack Machine Learning Engineer Project - Real Time Data Streaming and Predictions
Language: Python - Size: 122 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 5 - Forks: 0

AyemunHossain/stream-buffers-in-nodejs
This project focuses on implementing and demonstrating how stream and buffer works along together in nodejs.
Language: JavaScript - Size: 28.5 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

nyoungstudios/multiflow
A Python multithreading library for data processing pipelines, data streaming, etc.
Language: Python - Size: 370 KB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Waiguru254/surveycto
Manipulation of Data Collected through XLSFORM-compliant Platforms
Language: R - Size: 290 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mahdimlo/user_data_streaming
This project demonstrates how to fetch, process, and stream user data using Python and Kafka, with final storage in PostgreSQL. It includes steps for adding timestamps and labels to the data.
Language: Python - Size: 574 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

priyangshupal/cas-distributed-file-system
This project offers a peer-to-peer content-addressable distributed file storage in Go with a peer-to-peer library built on top of TCP from scratch. It also supports data encryption during storage and transmission
Language: Go - Size: 274 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

saveriogzz/weather-collector Fork of goeh/weather-collector
This Java program read data from a weather station (Davis Vantage Pro2) and store it in a SQL database
Language: Java - Size: 389 KB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

kamil271e/stock-analysis
Stock data processing and analysis
Language: Jupyter Notebook - Size: 247 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

stancsz/akka-stream-processor 📦
Stream processor that's consuming two topics and process data
Language: Scala - Size: 7.62 MB - Last synced at: 11 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

tealtools/puls
The simplest way to run a local multi-cluster multi-broker Apache Pulsar instance 🚀
Language: Rust - Size: 146 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

zabir-nabil/picast
A lightweight fast data streaming library for raspberry pi in python.
Language: Python - Size: 162 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 1

officialarijit/RPWE
Reward Penalty Weighted Ensemble approach for multimodal data stream classification
Language: Jupyter Notebook - Size: 1.74 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

MauricioVazquezM/Spark_BigData_Architecture_Project
Final project for the course 'Architecture for Large Data Volumes', taught in the Bachelor's program in Data Science at ITAM
Language: Python - Size: 47.9 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

kanenorman/crypto-streaming
Real time data streaming and analysis
Language: Python - Size: 411 KB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

accoffin12/streaming-03-bonus-acoffin
Demonstrating the application of RabbitMQ and the Pika Library that enables communication through an intermediary using MTA Subway Data from NYC.
Language: Python - Size: 668 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

PubNubDevelopers/Real-Time-Data-Streaming-Demo-Android
Demonstration of PubNub's real-time data streaming capabilities from Twitter, Wikipedia & more (Android)
Language: Kotlin - Size: 31.3 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

AVINESHWAR-KRISHNA/DataStream-SQLServer
DataStream-SQLServer provides real-time data streaming from SQL Server using Zookeeper, Kafka, and Debezium. This repository contains the necessary configurations, Docker setups, and sample code to get you started.
Language: Python - Size: 20.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MengLinMaker/ESP32-data-stream-comparisons 📦
Comparing data streaming methods from ESP32
Language: C++ - Size: 39.1 KB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

tealtools/dekaf
Dekaf is a visual user interface for Apache Pulsar 🛸
Size: 51.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

MengLinMaker/IMU-Webserial-Visualiser Fork of Autodrop3d/serialTerminal.com
Visualising IMU orientation using "Three.js" via the experimental "Web Serial API".
Language: JavaScript - Size: 95.7 KB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 2

sabharish-21/covid-analysis-mongoDB-explored
A covid data analysis application mainly focused on the applications of mongoDB databse
Language: Jupyter Notebook - Size: 4.3 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

marian-nmt/sotastream
A library for data streaming and augmentation
Language: Python - Size: 536 KB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 3

pravega/pravega-samples
Sample Applications for Pravega.
Language: Java - Size: 51.8 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 54 - Forks: 63

robertlit/location-data-streaming
Data processing pipeline for detection of motion based events in a stream of real-time location updates
Language: Java - Size: 77.1 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

eserie/wax-ml
A Python library for machine-learning and feedback loops on streaming data
Language: Python - Size: 10.3 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 57 - Forks: 5

eli64s/pyflink-poc
PyFlink data stream processing utilities 🐿
Language: Python - Size: 32.2 KB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

sorieux/trino-kafka-demo
Hands-on demo for querying Kafka streams using SQL with Trino and data integration with PostgreSQL.
Language: Python - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jeffreywangzhi/AWS_Kinesis-Data-Streaming
Real-time data streaming implementation with AWS Kinesis.
Language: Java - Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

blitzkz23/final-project-end-to-end-banking-campaign-pipeline
Final Project for IYKRA Data Fellowship 8 Program, creating an end-to-end banking campaign pipeline using lambda architecture (providing acess to batch and stream processing)
Language: Python - Size: 242 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 2

IsaacMwendwa/Streaming-Data-with-AWS-Kinesis-and-Lambda
Working with real-time data - Streaming Data with Amazon Kinesis and AWS Lambda
Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

gerardodavidlopezcastillo/Apache-KafkaDruidEC2_Public
Process that generates random events over thousands of records from a technology e-commerce site for processing in Apache Kafka and analysis in Druid instantiated from EC2, WSL2 Ubuntu, Key .pem
Language: Jupyter Notebook - Size: 1.93 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jlumbroso/python-random-hash
A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
Language: Python - Size: 32.2 KB - Last synced at: 9 days ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 0

iamirmasoud/data_streaming
Hands on data streaming
Language: Jupyter Notebook - Size: 14.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

terranovafr/PersonalityClustering
ML clustering techniques for grouping users according to their personality
Language: Jupyter Notebook - Size: 45.8 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

DataFloz/Metro
Metro is pipelines orchestrator platform for stream data.
Language: Python - Size: 642 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

leonardohss0/data-streaming-with-kafka
Keywords: Python, Data Processing, Kafka, Data Streaming, Docker
Language: Python - Size: 60.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Tzur1234/Optimizing-Public-Transportation
I built a real-time event pipeline using Apache Kafka and its ecosystem, simulating and displaying Chicago Transit Authority train line statuses using public data.
Language: Python - Size: 319 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

EstevaoUyra/streaming-san-francisco-crime-data
A projects developed while learning data streaming
Language: Python - Size: 7.88 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
