Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: data-streaming
officialarijit/RPWE
Reward Penalty Weighted Ensemble approach for multimodal data stream classification
Language: Jupyter Notebook - Size: 1.74 MB - Last synced: about 20 hours ago - Pushed: about 21 hours ago - Stars: 4 - Forks: 0
apache/doris-kafka-connector
Kafka Connector for Apache Doris
Language: Java - Size: 393 KB - Last synced: 10 days ago - Pushed: 14 days ago - Stars: 6 - Forks: 8
MasWag/monaa
A Tool for Timed Patten Matching with Automata-Based Acceleration
Language: C++ - Size: 2.13 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 9 - Forks: 0
DagsHub/client
DagsHub client libraries
Language: Python - Size: 3.09 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 90 - Forks: 23
daq-tools/lorrystream
A lightweight and polyglot stream-processing library, to be used as a data backplane-, message relay-, or pipeline-subsystem.
Language: Python - Size: 110 KB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 0
superstreamlabs/memphis
Memphis.dev is a highly scalable and effortless data streaming platform
Language: Go - Size: 465 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 3,169 - Forks: 210
apache/inlong
Apache InLong - a one-stop, full-scenario integration framework for massive data
Language: Java - Size: 52.7 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 1,310 - Forks: 487
marcusmcb/serato-nowplaying-twitch
A chat-bot script for Twitch that allows your viewers to interact with your Serato DJ play history in real-time.
Language: JavaScript - Size: 7.71 MB - Last synced: 8 days ago - Pushed: 9 days ago - Stars: 0 - Forks: 0
linkedin/brooklin
An extensible distributed system for reliable nearline data streaming at scale
Language: Java - Size: 5.61 MB - Last synced: 8 days ago - Pushed: 10 days ago - Stars: 891 - Forks: 134
MauricioVazquezM/Spark_BigData_Architecture_Project
Final project for the course 'Architecture for Large Data Volumes', taught in the Bachelor's program in Data Science at ITAM
Language: Python - Size: 47.9 KB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 0 - Forks: 0
getindata/dbt-flink-adapter
Adapter for dbt that executes dbt pipelines on Apache Flink
Language: Python - Size: 14 MB - Last synced: 10 days ago - Pushed: 2 months ago - Stars: 70 - Forks: 8
kanenorman/crypto-streaming
Real time data streaming and analysis
Language: Python - Size: 411 KB - Last synced: 9 days ago - Pushed: 2 months ago - Stars: 1 - Forks: 1
accoffin12/streaming-03-bonus-acoffin
Demonstrating the application of RabbitMQ and the Pika Library that enables communication through an intermediary using MTA Subway Data from NYC.
Language: Python - Size: 668 KB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 0 - Forks: 0
getindata/flink-http-connector
Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.
Language: Java - Size: 533 KB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 126 - Forks: 36
tealtools/puls
The simplest way to run a local multi-cluster multi-broker Apache Pulsar instance 🚀
Language: Rust - Size: 145 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0
strimzi/strimzi-canary
Strimzi canary
Language: Go - Size: 321 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 39 - Forks: 24
PubNubDevelopers/Real-Time-Data-Streaming-Demo-Android
Demonstration of PubNub's real-time data streaming capabilities from Twitter, Wikipedia & more (Android)
Language: Kotlin - Size: 31.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
AVINESHWAR-KRISHNA/DataStream-SQLServer
DataStream-SQLServer provides real-time data streaming from SQL Server using Zookeeper, Kafka, and Debezium. This repository contains the necessary configurations, Docker setups, and sample code to get you started.
Language: Python - Size: 20.5 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
TouK/nussknacker
Low-code tool for automating actions on real time data | Stream processing for the users.
Language: Scala - Size: 139 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 599 - Forks: 89
strimzi/strimzi-kafka-operator
Apache Kafka® running on Kubernetes
Language: Java - Size: 69.2 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 4,437 - Forks: 1,210
MengLinMaker/ESP32-data-stream-comparisons 📦
Comparing data streaming methods from ESP32
Language: C++ - Size: 39.1 KB - Last synced: about 17 hours ago - Pushed: about 2 years ago - Stars: 2 - Forks: 0
tealtools/dekaf
Dekaf is a visual user interface for Apache Pulsar 🛸
Size: 51.8 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 3 - Forks: 0
kanenorman/mobility-ai
Full Stack Machine Learning Engineer Project - Real Time Data Streaming and Predictions
Language: Python - Size: 122 MB - Last synced: 8 days ago - Pushed: about 1 month ago - Stars: 4 - Forks: 0
goldsmocap/mocap-streamer
Goldsmiths Mocap Streamer is a cutting-edge tool that streams live motion capture (BVH) data over the internet, allowing users from multiple remote locations to interact in the same shared digital space.
Language: TypeScript - Size: 105 MB - Last synced: 10 days ago - Pushed: over 1 year ago - Stars: 8 - Forks: 1
MengLinMaker/IMU-Webserial-Visualiser Fork of Autodrop3d/serialTerminal.com
Visualising IMU orientation using "Three.js" via the experimental "Web Serial API".
Language: JavaScript - Size: 95.7 KB - Last synced: about 17 hours ago - Pushed: 11 months ago - Stars: 3 - Forks: 2
ppatierno/devday-meet-apache-kafka
Meet Apache Kafka
Size: 223 KB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
cuihantao/ltbnet Fork of enliten/ltbnet
Large Scale Test Bed Network Emulator (CURENT @ UTK)
Language: Python - Size: 219 KB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 2
sabharish-21/covid-analysis-mongoDB-explored
A covid data analysis application mainly focused on the applications of mongoDB databse
Language: Jupyter Notebook - Size: 4.3 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
jaehyeon-kim/kafka-pocs
Apache Kafka and Related Projects
Language: Shell - Size: 56.1 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 9 - Forks: 7
joeycumines/go-longpoll
Package longpoll supports batching e.g. receiving as many values as possible from a channel.
Language: Go - Size: 7.81 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
marian-nmt/sotastream
A library for data streaming and augmentation
Language: Python - Size: 536 KB - Last synced: 13 days ago - Pushed: 2 months ago - Stars: 20 - Forks: 2
pravega/pravega-samples
Sample Applications for Pravega.
Language: Java - Size: 51.8 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 54 - Forks: 61
robertlit/location-data-streaming
Data processing pipeline for detection of motion based events in a stream of real-time location updates
Language: Java - Size: 77.1 KB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
turulix/memphis-rust-community
A Memphis SDK written in rust.
Language: Rust - Size: 256 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 5 - Forks: 0
aymane-maghouti/Real-Time-Data-Pipeline-Using-Kafka
This project implements a real-time data pipeline using Apache Kafka, Python's psutil library for metric collection, and SQL Server for data storage. The pipeline collects metrics data from the local computer, processes it through Kafka brokers, and loads it into a SQL Server database. Additionally, a real-time dashboard is created using Power BI.
Language: Python - Size: 380 KB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0
eserie/wax-ml
A Python library for machine-learning and feedback loops on streaming data
Language: Python - Size: 10.3 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 57 - Forks: 5
jaehyeon-kim/flink-demos
Apache Flink (Pyflink) and Related Projects
Language: Python - Size: 2.18 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 7 - Forks: 1
sorieux/trino-kafka-demo
Hands-on demo for querying Kafka streams using SQL with Trino and data integration with PostgreSQL.
Language: Python - Size: 14.6 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
jeffreywangzhi/AWS_Kinesis-Data-Streaming
Real-time data streaming implementation with AWS Kinesis.
Language: Java - Size: 17.6 KB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
blitzkz23/final-project-end-to-end-banking-campaign-pipeline
Final Project for IYKRA Data Fellowship 8 Program, creating an end-to-end banking campaign pipeline using lambda architecture (providing acess to batch and stream processing)
Language: Python - Size: 242 MB - Last synced: 2 months ago - Pushed: about 1 year ago - Stars: 8 - Forks: 2
grc-iit/HFlow
HFlow is a platform for I/O forwarding managed elastically, dynamically, and actively
Language: C++ - Size: 3.61 MB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
IsaacMwendwa/Streaming-Data-with-AWS-Kinesis-and-Lambda
Working with real-time data - Streaming Data with Amazon Kinesis and AWS Lambda
Size: 3.91 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0
gerardodavidlopezcastillo/Apache-KafkaDruidEC2_Public
Process that generates random events over thousands of records from a technology e-commerce site for processing in Apache Kafka and analysis in Druid instantiated from EC2, WSL2 Ubuntu, Key .pem
Language: Jupyter Notebook - Size: 1.93 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
jlumbroso/python-random-hash
A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
Language: Python - Size: 32.2 KB - Last synced: 10 days ago - Pushed: almost 2 years ago - Stars: 9 - Forks: 0
iamirmasoud/data_streaming
Hands on data streaming
Language: Jupyter Notebook - Size: 14.1 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
terranovafr/PersonalityClustering
ML clustering techniques for grouping users according to their personality
Language: Jupyter Notebook - Size: 45.8 MB - Last synced: 9 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
DataFloz/Metro
Metro is pipelines orchestrator platform for stream data.
Language: Python - Size: 642 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0
janzenisek/dsg
C# .NET data stream generator
Language: C# - Size: 9.54 MB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 3
leonardohss0/data-streaming-with-kafka
Keywords: Python, Data Processing, Kafka, Data Streaming, Docker
Language: Python - Size: 60.5 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0
ProjectIxian/Ixian-S2
Ixian S2 end to end data streaming network software
Language: C# - Size: 297 KB - Last synced: 30 days ago - Pushed: about 2 months ago - Stars: 5 - Forks: 3
Tzur1234/Optimizing-Public-Transportation
I built a real-time event pipeline using Apache Kafka and its ecosystem, simulating and displaying Chicago Transit Authority train line statuses using public data.
Language: Python - Size: 319 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
EstevaoUyra/streaming-san-francisco-crime-data
A projects developed while learning data streaming
Language: Python - Size: 7.88 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
vhtnguyen/Kafka_Youtube-watcher
An app to keep track of Youtube videos and sends the notification to a Telegram bot to inform you if anyone comments on those
Language: Python - Size: 561 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0
minhloannguyen/postgres-cdc
Change Data Capture with kafka, debezium, apache flink for posgres database
Language: Java - Size: 337 KB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
ppatierno/kafka-hybrid-iot
Apache Kafka for the Hybrid IoT
Language: JavaScript - Size: 6.34 MB - Last synced: 3 months ago - Pushed: about 5 years ago - Stars: 10 - Forks: 0
AlexImb/automl-streams-research-paper
AutoML Techniques for Data Streams - Research Paper
Language: TeX - Size: 9.23 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 1 - Forks: 0
build-on-aws/building-apache-kafka-connectors
Sample code that shows the important aspects of developing custom connectors for Kafka Connect. It provides the resources for building, deploying, and running the code on-premises using Docker, as well as running the code in the cloud.
Language: Java - Size: 56.6 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 24 - Forks: 4
dmpalyvos/erebus
Erebus: Explaining the Outputs of Data Streaming Queries
Language: Java - Size: 257 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 1
enliten/ltbnet
Large Scale Test Bed Network Emulator
Language: Python - Size: 250 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 5
AbdullahMu/Data-Streaming-Nanodegree-Project_02-Evaluate-Human-Balance-with-Spark-Streaming
Design data streaming architecture and API for a real-life application called the Step Trending Electronic Data Interface (STEDI). It is a working application used to assess fall risk for seniors. When a senior takes a test, they are scored using an index which reflects the likelihood of falling, and potentially sustaining an injury in the course of walking. STEDI uses a Redis datastore for risk score and other data. The Data Science team has completed a working graph for population risk at a STEDI clinic. The problem is the data is not populated yet. You will work with Kafka Connect Redis Source events and Business Events to create a Kafka topic containing anonymized risk scores of seniors in the clinic.
Language: Python - Size: 827 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0
PubNubDevelopers/Real-Time-Data-Streaming-Demo
Demonstration of PubNub's real-time data streaming capabilities from Twitter, Wikipedia & more
Language: JavaScript - Size: 1.53 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 7 - Forks: 0
hnjm/memphis-broker Fork of memphisdev/memphis
Next-Generation Real-Time Data Processing Platform
Size: 183 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
KentHsu/Udacity-Data-Streaming-Nanodegree
Udacity Data Streaming Nanodegree Program
Language: Python - Size: 624 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 14 - Forks: 11
bcgov/data-stream
Subscribe to datasets and be notified of changes via webhook
Language: Python - Size: 10.1 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 1 - Forks: 2
oelin/flare
Progressive streaming of large datasets.
Size: 47.9 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
stancsz/akka-stream-processor 📦
Stream processor that's consuming two topics and process data
Language: Scala - Size: 7.62 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 2 - Forks: 0
jlumbroso/java-random-hash
A simple, time-tested, family of random hash functions in Java, based on CRC32, affine transformations, and the Mersenne Twister. 🎲
Language: Java - Size: 726 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 9 - Forks: 0
fernandito77777/AWSDataAnalyticsPostgreWorkshop
Workshop Database RDS Postgre Integration and offloading to Data Lake, and visualize the data to QuickSight
Size: 5.63 MB - Last synced: 12 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0
maulanaakbardj/Tigergraph_Streams
Example Tigergraph implementation of the TG data Streaming with Kafka
Size: 1.57 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
officialarijit/Fed-ReMECS-mqtt
A Federated Learning Method for Real-time Emotion State Classification from Multi-modal Streaming
Language: Python - Size: 146 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 5 - Forks: 1
PubNubDevelopers/Real-Time-Data-Streaming-Tutorial
Tutorial for PubNub's real-time data streaming capabilities including HTTP Streaming
Language: Python - Size: 1.4 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
darkfennertrader/Optimizing-Public-Trasportation
Udacity Data Streaming project based on Apache Kafka
Language: Python - Size: 348 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0
iamniklas/Server-Client-Dual-Antenna
Language: Kotlin - Size: 82 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0
Waiguru254/surveycto
Manipulation of Data Collected through XLSFORM-compliant Platforms
Language: R - Size: 285 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
the-timoye/songstreams
Streams simulated events using Kafka & Spark, from a music application to a data lake (AWS S3), and then a warehouse (AWS Redshift)
Language: Python - Size: 21.9 MB - Last synced: 12 months ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 1
akurgat/energy_trading_prediction
Language: Jupyter Notebook - Size: 13.2 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 1
jlumbroso/affirmative-sampling
Reference implementation of the Affirmative Sampling algorithm by Jérémie Lumbroso and Conrado Martínez (2022). 🍀
Language: Python - Size: 794 KB - Last synced: 21 days ago - Pushed: almost 2 years ago - Stars: 3 - Forks: 0
ThiagoBarradas/logstash-beats-demo
Elastic Stack with Nginx, Logstash and Beats demo
Size: 19.5 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 4 - Forks: 3
nyoungstudios/multiflow
A Python multithreading library for data processing pipelines, data streaming, etc.
Language: Python - Size: 370 KB - Last synced: 10 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
lpraat/SoccerStreams
Fault-tolerant streaming pipeline for real-time soccer match analysis.
Language: Scala - Size: 2.68 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 1 - Forks: 1
chaitanyakasaraneni/streamingDataPipeline
Example to build a data streaming using GCP
Language: Python - Size: 1.08 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
saveriogzz/weather-collector Fork of goeh/weather-collector
This Java program read data from a weather station (Davis Vantage Pro2) and store it in a SQL database
Language: Java - Size: 389 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
jerryloyn/tweet-crypto-kafka-spark-streaming
To stream real-time Twitter feeds and visualize the real-time crypto hashtag count
Language: Jupyter Notebook - Size: 50.8 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
jo-jstrm/rime-data-streaming-iot
Code for the conference paper "Streaming Data through the IoT via Actor-Based Semantic Routing Trees" from VLIOT@VLDB 2021.
Language: Jupyter Notebook - Size: 30.3 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
rahiakela/data-structures-and-algorithms-specialization
Data Structures and Algorithms Specialization | Coursera
Language: Java - Size: 1.25 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
ManikHossain08/Kafka-Stream-Data-Pipeline-Near-Real-Time
Stream data into pipeline in near-real-time using Kafka
Language: Scala - Size: 18.6 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0
dattranm/tweetspark
Twitter Sentiment Analysis using Spark Streaming
Language: Python - Size: 202 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
luhlitt/data-streaming-with-kafka
Public Transit Status with Apache Kafka
Language: Python - Size: 389 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
AbdullahMu/Data-Streaming-Nanodegree-Project_01-Optimizing-Public-Transportation
Construct a streaming event pipeline around Apache Kafka and its ecosystem. Using public data from the Chicago Transit Authority, we will construct an event pipeline around Kafka that allows us to simulate and display the status of train lines in real time.
Language: Python - Size: 512 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 1
zabir-nabil/picast
A lightweight fast data streaming library for raspberry pi in python.
Language: Python - Size: 162 KB - Last synced: over 1 year ago - Pushed: almost 5 years ago - Stars: 5 - Forks: 1
Agrivatehq/Kafka
Kafka Devlopment and Production repo for all data streamings
Language: Python - Size: 7.81 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
eshnil2000/streaming_nd Fork of shabie/streaming_nd
Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions
Size: 9.37 MB - Last synced: about 2 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
OmalPerera/thermoSensor-data-streamer
Language: Scala - Size: 18.6 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0
7erry/jetliner
The missing command line interface for Jet:
Language: Java - Size: 8.33 MB - Last synced: about 1 year ago - Pushed: about 7 years ago - Stars: 0 - Forks: 0