An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-streaming

strimzi/strimzi-kafka-operator

Apache Kafka® running on Kubernetes

Language: Java - Size: 91.5 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 5,196 - Forks: 1,355

apache/inlong

Apache InLong - a one-stop, full-scenario integration framework for massive data

Language: Java - Size: 57.5 MB - Last synced at: about 5 hours ago - Pushed at: 2 days ago - Stars: 1,432 - Forks: 532

jaehyeon-kim/streaming-demos

Data streaming demo projects

Language: Python - Size: 2.54 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 1

factaxd/Twitch-Chat-Emote-Analyzer

Real-time Twitch chat analysis application using Python (FastAPI, NLTK, VADER) for backend processing and React (TypeScript, WebSockets) for frontend visualization. Analyzes sentiment, keywords, and emotes (Twitch, FFZ, 7TV)

Language: Python - Size: 68.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

TouK/nussknacker

Low-code tool for automating actions on real time data | Stream processing for the users.

Language: Scala - Size: 179 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 683 - Forks: 95

gunnarmorling/streaming-examples

Example projects and demos around data streaming , stream processing, change data capture, and more.

Language: Java - Size: 22.5 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 0

apache/doris-kafka-connector

Kafka Connector for Apache Doris

Language: Java - Size: 451 KB - Last synced at: 6 days ago - Pushed at: 13 days ago - Stars: 19 - Forks: 13

ConduitIO/streaming-benchmarks

Benchmarks for Conduit and other data streaming tools.

Language: Shell - Size: 8.14 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 4 - Forks: 0

j3-signalroom/ccaf_kickstarter-flight_consolidator_app-lambda

Demonstrates a best practice implementation for using an AWS Lambda function to deploy a Flink Job Graph to Confluent Cloud for Apache Flink.

Language: Python - Size: 2.82 MB - Last synced at: 9 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0

goldsmocap/mocap-streamer

Goldsmiths Mocap Streamer is a cutting-edge tool that streams live motion capture (BVH) data over the internet, allowing users from multiple remote locations to interact in the same shared digital space.

Language: C - Size: 144 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 8 - Forks: 1

giobbu/CUSUM

Different flavours of CUSUM for change point detection

Language: Jupyter Notebook - Size: 26.8 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

jmgiorgi-10/Stock-App

Data pipeline with LSTM prediction based on implementation of stock prices

Language: Python - Size: 254 KB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

superstreamlabs/memphis

Memphis.dev is a highly scalable and effortless data streaming platform

Language: Go - Size: 468 MB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 3,294 - Forks: 223

jaehyeon-kim/kafka-pocs

Apache Kafka and Related Projects

Language: Shell - Size: 63.6 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 30 - Forks: 11

hiabhishek1888/ninjafilesystem

This project is a peer-to-peer decentralized file storage in Go.

Language: Go - Size: 1.78 MB - Last synced at: 9 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

aymane-maghouti/Real-Time-Data-Pipeline-Using-Kafka

This project implements a real-time data pipeline using Apache Kafka, Python's psutil library for metric collection, and SQL Server for data storage. The pipeline collects metrics data from the local computer, processes it through Kafka brokers, and loads it into a SQL Server database. Additionally, a real-time dashboard is created using Power BI.

Language: Python - Size: 380 KB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 3

getindata/flink-http-connector

Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.

Language: Java - Size: 659 KB - Last synced at: 20 days ago - Pushed at: 28 days ago - Stars: 176 - Forks: 45

cning112/fastflight

FastFlight is a high-performance data transfer framework using Apache Arrow Flight for efficient, modular, and pluggable data streaming with optional FastAPI integration for HTTP-based access.

Language: Python - Size: 1.58 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

getindata/dbt-flink-adapter

Adapter for dbt that executes dbt pipelines on Apache Flink

Language: Python - Size: 14 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 92 - Forks: 12

DagsHub/client

DagsHub client libraries

Language: Python - Size: 3.57 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 93 - Forks: 23

daniel-dqsdatalabs/bigdata-projects

Big Data Stack

Language: Python - Size: 368 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

jaehyeon-kim/flink-demos

Apache Flink (Pyflink) and Related Projects

Language: Python - Size: 2.18 MB - Last synced at: 17 days ago - Pushed at: 11 months ago - Stars: 35 - Forks: 12

build-on-aws/building-apache-kafka-connectors

Sample code that shows the important aspects of developing custom connectors for Kafka Connect. It provides the resources for building, deploying, and running the code on-premises using Docker, as well as running the code in the cloud.

Language: Java - Size: 57.6 KB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 54 - Forks: 14

ZainAli104/distributed-file-system-go

This project offers a peer-to-peer content-addressable distributed file storage in Go with a peer-to-peer library built on top of TCP from scratch. It also supports data encryption during storage and transmission

Language: Go - Size: 70.3 KB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

janzenisek/dsg

C# .NET data stream generator

Language: C# - Size: 9.57 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 3

pipelane/pipelaner

High-performance and efficient Framework and Agent for creating data pipelines. The core of pipeline descriptions is based on the Configuration As Code concept and the Pkl configuration language by Apple.

Language: Go - Size: 412 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 34 - Forks: 0

daq-tools/lorrystream

A lightweight and polyglot stream-processing library, to be used as a data backplane-, message relay-, or pipeline-subsystem.

Language: Python - Size: 260 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

ThaiTechTales/kafka

This repository serves as a collection of projects demonstrating expertise in Apache Kafka, a distributed event-streaming platform. The projects aim to highlight real-time data integration and stream processing solutions.

Language: Java - Size: 11.1 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

jlumbroso/affirmative-sampling

Reference implementation of the Affirmative Sampling algorithm by Jérémie Lumbroso and Conrado Martínez (2022). 🍀

Language: Python - Size: 794 KB - Last synced at: 13 days ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

PranavBarthwal/kafka

Kafka is an open-source software platform for storing, processing, and analyzing streaming data in real time. It's used to build data pipelines and applications that can adapt to data streams.

Language: JavaScript - Size: 37.1 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

turulix/memphis-rust-community

A Memphis SDK written in rust.

Language: Rust - Size: 256 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

tashi-2004/Apache-Flink-Spark-Data-Streaming

This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.

Language: Python - Size: 62.6 MB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mitgar14/etl-workshop-3

Workshop #3 (Machine Learning and Data Streaming) for the ETL course using scikit-learn to develop the ML model and Apache Kafka to manage the data streaming process.

Language: Jupyter Notebook - Size: 2.11 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ThiagoBarradas/logstash-beats-demo

Elastic Stack with Nginx, Logstash and Beats demo

Size: 19.5 KB - Last synced at: 15 days ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 3

ixian-platform/Ixian-S2

Ixian S2 end to end data streaming network software

Language: C# - Size: 311 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 5 - Forks: 3

strimzi/strimzi-canary 📦

Strimzi canary

Language: Go - Size: 325 KB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 41 - Forks: 29

nessmahm/SparkKafka-Real-Time-Brand-Trend-Analysis

A real-time data processing project for analyzing e-commerce sales data. It integrates Apache Kafka 📡 and Apache Spark ⚡ to extract brand information and calculate the top N brands. Results are stored in HBase and displayed in real-time ⏱️. This project enables seamless integration with visualization tools 📊, offering insights into brand trends.

Language: Java - Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Richardbnk/Hive_MQ_Streaming

Streaming data seamlessly using the HiveMQ Broker for efficient communication and IoT integration.

Language: Python - Size: 3.91 KB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Richardbnk/Raspberry_Pi

Python Operations for Raspberry Pi in Internet of Things (IoT) Applications

Language: Python - Size: 28.3 KB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Richardbnk/Telegram_Tool

Utility functions for seamless message streaming and automation with Telegram.

Language: Python - Size: 12.7 KB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

dataphos/lib-brokers

lib-brokers is a Go library which contains the interfaces used to interact with messaging systems without relying on a specific technology or client library. This library attempts to solve the issue of properly abstracting away the interaction between applications and messaging systems.

Language: Go - Size: 160 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

SidiahmedHABIB/e2e-data-engineering

This project is an end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using a variety of powerful tools including Apache Airflow, Apache Kafka, Apache Spark and Cassandra. All components are containerized with Docker for easy deployment and scalability.

Language: Python - Size: 1.73 MB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1

codeterrayt/StreamGuard

StreamGuard is a high-performance data management script using Kafka and MongoDB to efficiently handle and process real-time data streams. Ideal for scenarios like live GPS tracking, it features real-time data processing, reduced database load, and bulk data insertion.

Language: JavaScript - Size: 24.4 KB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

joeycumines/go-longpoll

Package longpoll supports batching e.g. receiving as many values as possible from a channel.

Language: Go - Size: 9.77 KB - Last synced at: 30 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

lupusruber/crypto_stats

A project that provides a cloud-native solution for ingesting, transforming, and visualizing cryptocurrency data, utilizing modern tools and workflows for scalability and automation.

Language: Python - Size: 536 KB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Pirate-Emperor/K2BQ

K2BQ is a dataflow pipeline that streams data from Kafka to BigQuery. It uses Google Cloud’s managed Kafka, Dataflow for processing, and BigQuery for real-time analytics, offering scalable, automated data integration for fast insights.

Language: Python - Size: 21.5 KB - Last synced at: about 10 hours ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

bdbao/Kafka-VM

This project demonstrates a basic Kafka implementation: using the kafka-python library via Ubuntu virtual machine; and Change Data Capture (CDC) between 2 DBMS via Docker.

Language: Makefile - Size: 27 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

serdaraltin/fusion-bridge

Establishes and manages communication between different hardware components and software layers, ensuring seamless data exchange and synchronization.

Language: C++ - Size: 27.3 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

CLoaKY233/DataStreams

Real-time cat/dog image classifier using Kafka and CNN. Sender uploads images to Kafka, receiver processes with pre-trained model, returns predictions via Kafka. Demonstrates distributed, scalable image processing with instant feedback. Uses TensorFlow, OpenCV, and Kafka for efficient, asynchronous communication.

Language: Python - Size: 7.81 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

linkedin/brooklin

An extensible distributed system for reliable nearline data streaming at scale

Language: Java - Size: 5.61 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 919 - Forks: 137

MasWag/monaa

A Tool for Timed Patten Matching with Automata-Based Acceleration

Language: C++ - Size: 2.15 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 9 - Forks: 0

dataphos/lib-streamproc

A Go library that exposes executors, interfaces, data structures, and utility functions which combined a universal stream processor, invariant to any specific messaging system.

Language: Go - Size: 82 KB - Last synced at: 13 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

beam-pyio/firehose_pyio

Apache Beam Python I/O connector for Amazon Data Firehose

Language: Python - Size: 2.83 MB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

beam-pyio/sqs_pyio

Apache Beam Python I/O connector for Amazon SQS

Language: Python - Size: 2.83 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

beam-pyio/dynamodb_pyio

Apache Beam Python I/O connector for Amazon DynamoDB

Language: Python - Size: 2.8 MB - Last synced at: 22 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

jlumbroso/java-random-hash

A simple, time-tested, family of random hash functions in Java, based on CRC32, affine transformations, and the Mersenne Twister. 🎲

Language: Java - Size: 726 KB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 0

Giangblackk/streaming_benchmark

Streaming Data Pipeline End-to-End Latency Benchmarking

Language: Python - Size: 1.2 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

night-fury-me/real-time-vehicle-data-processing

A repository that contains implementation of a Real-Time Vehicle Data Processing Pipeline that efficiently manages and analyzes vehicle data through a cohesive system.

Language: Python - Size: 664 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Doch88/aws-firehose-sender-py

AWS Firehose Sender - Sending data securely through a Firehose stream using boto3

Language: Python - Size: 4.88 KB - Last synced at: 8 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

to-infinitee/real-time-data-system-arch

The architecture ingests data via Kafka, processes it in real-time with Spark Streaming, and stores it in Cassandra and Hadoop HDFS. It supports direct data push to apps using WebSockets/HTTP Streaming, with a front-end built on Spring Boot, Bootstrap.js, and Chart.js.

Size: 353 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

cjcocokrisp/m3x-data-streaming

Library for interacting with the Myopro 2 device that allows for data streaming and changing the device's control parameters.

Language: Python - Size: 112 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

KentHsu/Udacity-Data-Streaming-Nanodegree

Udacity Data Streaming Nanodegree Program

Language: Python - Size: 624 KB - Last synced at: 18 days ago - Pushed at: about 4 years ago - Stars: 22 - Forks: 12

kanenorman/mobility-ai

Full Stack Machine Learning Engineer Project - Real Time Data Streaming and Predictions

Language: Python - Size: 122 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 5 - Forks: 0

AyemunHossain/stream-buffers-in-nodejs

This project focuses on implementing and demonstrating how stream and buffer works along together in nodejs.

Language: JavaScript - Size: 28.5 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

nyoungstudios/multiflow

A Python multithreading library for data processing pipelines, data streaming, etc.

Language: Python - Size: 370 KB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Waiguru254/surveycto

Manipulation of Data Collected through XLSFORM-compliant Platforms

Language: R - Size: 290 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mahdimlo/user_data_streaming

This project demonstrates how to fetch, process, and stream user data using Python and Kafka, with final storage in PostgreSQL. It includes steps for adding timestamps and labels to the data.

Language: Python - Size: 574 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

priyangshupal/cas-distributed-file-system

This project offers a peer-to-peer content-addressable distributed file storage in Go with a peer-to-peer library built on top of TCP from scratch. It also supports data encryption during storage and transmission

Language: Go - Size: 274 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

saveriogzz/weather-collector Fork of goeh/weather-collector

This Java program read data from a weather station (Davis Vantage Pro2) and store it in a SQL database

Language: Java - Size: 389 KB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

kamil271e/stock-analysis

Stock data processing and analysis

Language: Jupyter Notebook - Size: 247 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

stancsz/akka-stream-processor 📦

Stream processor that's consuming two topics and process data

Language: Scala - Size: 7.62 MB - Last synced at: 11 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

tealtools/puls

The simplest way to run a local multi-cluster multi-broker Apache Pulsar instance 🚀

Language: Rust - Size: 146 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

zabir-nabil/picast

A lightweight fast data streaming library for raspberry pi in python.

Language: Python - Size: 162 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 1

officialarijit/RPWE

Reward Penalty Weighted Ensemble approach for multimodal data stream classification

Language: Jupyter Notebook - Size: 1.74 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

MauricioVazquezM/Spark_BigData_Architecture_Project

Final project for the course 'Architecture for Large Data Volumes', taught in the Bachelor's program in Data Science at ITAM

Language: Python - Size: 47.9 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

kanenorman/crypto-streaming

Real time data streaming and analysis

Language: Python - Size: 411 KB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

accoffin12/streaming-03-bonus-acoffin

Demonstrating the application of RabbitMQ and the Pika Library that enables communication through an intermediary using MTA Subway Data from NYC.

Language: Python - Size: 668 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

PubNubDevelopers/Real-Time-Data-Streaming-Demo-Android

Demonstration of PubNub's real-time data streaming capabilities from Twitter, Wikipedia & more (Android)

Language: Kotlin - Size: 31.3 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

AVINESHWAR-KRISHNA/DataStream-SQLServer

DataStream-SQLServer provides real-time data streaming from SQL Server using Zookeeper, Kafka, and Debezium. This repository contains the necessary configurations, Docker setups, and sample code to get you started.

Language: Python - Size: 20.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MengLinMaker/ESP32-data-stream-comparisons 📦

Comparing data streaming methods from ESP32

Language: C++ - Size: 39.1 KB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

tealtools/dekaf

Dekaf is a visual user interface for Apache Pulsar 🛸

Size: 51.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

MengLinMaker/IMU-Webserial-Visualiser Fork of Autodrop3d/serialTerminal.com

Visualising IMU orientation using "Three.js" via the experimental "Web Serial API".

Language: JavaScript - Size: 95.7 KB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 2

sabharish-21/covid-analysis-mongoDB-explored

A covid data analysis application mainly focused on the applications of mongoDB databse

Language: Jupyter Notebook - Size: 4.3 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

marian-nmt/sotastream

A library for data streaming and augmentation

Language: Python - Size: 536 KB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 3

pravega/pravega-samples

Sample Applications for Pravega.

Language: Java - Size: 51.8 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 54 - Forks: 63

robertlit/location-data-streaming

Data processing pipeline for detection of motion based events in a stream of real-time location updates

Language: Java - Size: 77.1 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

eserie/wax-ml

A Python library for machine-learning and feedback loops on streaming data

Language: Python - Size: 10.3 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 57 - Forks: 5

eli64s/pyflink-poc

PyFlink data stream processing utilities 🐿

Language: Python - Size: 32.2 KB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

sorieux/trino-kafka-demo

Hands-on demo for querying Kafka streams using SQL with Trino and data integration with PostgreSQL.

Language: Python - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jeffreywangzhi/AWS_Kinesis-Data-Streaming

Real-time data streaming implementation with AWS Kinesis.

Language: Java - Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

blitzkz23/final-project-end-to-end-banking-campaign-pipeline

Final Project for IYKRA Data Fellowship 8 Program, creating an end-to-end banking campaign pipeline using lambda architecture (providing acess to batch and stream processing)

Language: Python - Size: 242 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 2

IsaacMwendwa/Streaming-Data-with-AWS-Kinesis-and-Lambda

Working with real-time data - Streaming Data with Amazon Kinesis and AWS Lambda

Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

gerardodavidlopezcastillo/Apache-KafkaDruidEC2_Public

Process that generates random events over thousands of records from a technology e-commerce site for processing in Apache Kafka and analysis in Druid instantiated from EC2, WSL2 Ubuntu, Key .pem

Language: Jupyter Notebook - Size: 1.93 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jlumbroso/python-random-hash

A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲

Language: Python - Size: 32.2 KB - Last synced at: 9 days ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 0

iamirmasoud/data_streaming

Hands on data streaming

Language: Jupyter Notebook - Size: 14.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

terranovafr/PersonalityClustering

ML clustering techniques for grouping users according to their personality

Language: Jupyter Notebook - Size: 45.8 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

DataFloz/Metro

Metro is pipelines orchestrator platform for stream data.

Language: Python - Size: 642 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

leonardohss0/data-streaming-with-kafka

Keywords: Python, Data Processing, Kafka, Data Streaming, Docker

Language: Python - Size: 60.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Tzur1234/Optimizing-Public-Transportation

I built a real-time event pipeline using Apache Kafka and its ecosystem, simulating and displaying Chicago Transit Authority train line statuses using public data.

Language: Python - Size: 319 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

EstevaoUyra/streaming-san-francisco-crime-data

A projects developed while learning data streaming

Language: Python - Size: 7.88 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0