Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-streaming

officialarijit/RPWE

Reward Penalty Weighted Ensemble approach for multimodal data stream classification

Language: Jupyter Notebook - Size: 1.74 MB - Last synced: about 20 hours ago - Pushed: about 21 hours ago - Stars: 4 - Forks: 0

apache/doris-kafka-connector

Kafka Connector for Apache Doris

Language: Java - Size: 393 KB - Last synced: 10 days ago - Pushed: 14 days ago - Stars: 6 - Forks: 8

MasWag/monaa

A Tool for Timed Patten Matching with Automata-Based Acceleration

Language: C++ - Size: 2.13 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 9 - Forks: 0

DagsHub/client

DagsHub client libraries

Language: Python - Size: 3.09 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 90 - Forks: 23

daq-tools/lorrystream

A lightweight and polyglot stream-processing library, to be used as a data backplane-, message relay-, or pipeline-subsystem.

Language: Python - Size: 110 KB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 0

superstreamlabs/memphis

Memphis.dev is a highly scalable and effortless data streaming platform

Language: Go - Size: 465 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 3,169 - Forks: 210

apache/inlong

Apache InLong - a one-stop, full-scenario integration framework for massive data

Language: Java - Size: 52.7 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 1,310 - Forks: 487

marcusmcb/serato-nowplaying-twitch

A chat-bot script for Twitch that allows your viewers to interact with your Serato DJ play history in real-time.

Language: JavaScript - Size: 7.71 MB - Last synced: 8 days ago - Pushed: 9 days ago - Stars: 0 - Forks: 0

linkedin/brooklin

An extensible distributed system for reliable nearline data streaming at scale

Language: Java - Size: 5.61 MB - Last synced: 8 days ago - Pushed: 10 days ago - Stars: 891 - Forks: 134

MauricioVazquezM/Spark_BigData_Architecture_Project

Final project for the course 'Architecture for Large Data Volumes', taught in the Bachelor's program in Data Science at ITAM

Language: Python - Size: 47.9 KB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 0 - Forks: 0

getindata/dbt-flink-adapter

Adapter for dbt that executes dbt pipelines on Apache Flink

Language: Python - Size: 14 MB - Last synced: 10 days ago - Pushed: 2 months ago - Stars: 70 - Forks: 8

kanenorman/crypto-streaming

Real time data streaming and analysis

Language: Python - Size: 411 KB - Last synced: 9 days ago - Pushed: 2 months ago - Stars: 1 - Forks: 1

accoffin12/streaming-03-bonus-acoffin

Demonstrating the application of RabbitMQ and the Pika Library that enables communication through an intermediary using MTA Subway Data from NYC.

Language: Python - Size: 668 KB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 0 - Forks: 0

getindata/flink-http-connector

Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.

Language: Java - Size: 533 KB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 126 - Forks: 36

tealtools/puls

The simplest way to run a local multi-cluster multi-broker Apache Pulsar instance 🚀

Language: Rust - Size: 145 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0

strimzi/strimzi-canary

Strimzi canary

Language: Go - Size: 321 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 39 - Forks: 24

PubNubDevelopers/Real-Time-Data-Streaming-Demo-Android

Demonstration of PubNub's real-time data streaming capabilities from Twitter, Wikipedia & more (Android)

Language: Kotlin - Size: 31.3 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

AVINESHWAR-KRISHNA/DataStream-SQLServer

DataStream-SQLServer provides real-time data streaming from SQL Server using Zookeeper, Kafka, and Debezium. This repository contains the necessary configurations, Docker setups, and sample code to get you started.

Language: Python - Size: 20.5 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

TouK/nussknacker

Low-code tool for automating actions on real time data | Stream processing for the users.

Language: Scala - Size: 139 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 599 - Forks: 89

strimzi/strimzi-kafka-operator

Apache Kafka® running on Kubernetes

Language: Java - Size: 69.2 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 4,437 - Forks: 1,210

MengLinMaker/ESP32-data-stream-comparisons 📦

Comparing data streaming methods from ESP32

Language: C++ - Size: 39.1 KB - Last synced: about 17 hours ago - Pushed: about 2 years ago - Stars: 2 - Forks: 0

tealtools/dekaf

Dekaf is a visual user interface for Apache Pulsar 🛸

Size: 51.8 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 3 - Forks: 0

kanenorman/mobility-ai

Full Stack Machine Learning Engineer Project - Real Time Data Streaming and Predictions

Language: Python - Size: 122 MB - Last synced: 8 days ago - Pushed: about 1 month ago - Stars: 4 - Forks: 0

goldsmocap/mocap-streamer

Goldsmiths Mocap Streamer is a cutting-edge tool that streams live motion capture (BVH) data over the internet, allowing users from multiple remote locations to interact in the same shared digital space.

Language: TypeScript - Size: 105 MB - Last synced: 10 days ago - Pushed: over 1 year ago - Stars: 8 - Forks: 1

MengLinMaker/IMU-Webserial-Visualiser Fork of Autodrop3d/serialTerminal.com

Visualising IMU orientation using "Three.js" via the experimental "Web Serial API".

Language: JavaScript - Size: 95.7 KB - Last synced: about 17 hours ago - Pushed: 11 months ago - Stars: 3 - Forks: 2

ppatierno/devday-meet-apache-kafka

Meet Apache Kafka

Size: 223 KB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

cuihantao/ltbnet Fork of enliten/ltbnet

Large Scale Test Bed Network Emulator (CURENT @ UTK)

Language: Python - Size: 219 KB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 2

sabharish-21/covid-analysis-mongoDB-explored

A covid data analysis application mainly focused on the applications of mongoDB databse

Language: Jupyter Notebook - Size: 4.3 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

jaehyeon-kim/kafka-pocs

Apache Kafka and Related Projects

Language: Shell - Size: 56.1 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 9 - Forks: 7

joeycumines/go-longpoll

Package longpoll supports batching e.g. receiving as many values as possible from a channel.

Language: Go - Size: 7.81 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

marian-nmt/sotastream

A library for data streaming and augmentation

Language: Python - Size: 536 KB - Last synced: 13 days ago - Pushed: 2 months ago - Stars: 20 - Forks: 2

pravega/pravega-samples

Sample Applications for Pravega.

Language: Java - Size: 51.8 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 54 - Forks: 61

robertlit/location-data-streaming

Data processing pipeline for detection of motion based events in a stream of real-time location updates

Language: Java - Size: 77.1 KB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

turulix/memphis-rust-community

A Memphis SDK written in rust.

Language: Rust - Size: 256 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 5 - Forks: 0

aymane-maghouti/Real-Time-Data-Pipeline-Using-Kafka

This project implements a real-time data pipeline using Apache Kafka, Python's psutil library for metric collection, and SQL Server for data storage. The pipeline collects metrics data from the local computer, processes it through Kafka brokers, and loads it into a SQL Server database. Additionally, a real-time dashboard is created using Power BI.

Language: Python - Size: 380 KB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0

eserie/wax-ml

A Python library for machine-learning and feedback loops on streaming data

Language: Python - Size: 10.3 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 57 - Forks: 5

jaehyeon-kim/flink-demos

Apache Flink (Pyflink) and Related Projects

Language: Python - Size: 2.18 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 7 - Forks: 1

sorieux/trino-kafka-demo

Hands-on demo for querying Kafka streams using SQL with Trino and data integration with PostgreSQL.

Language: Python - Size: 14.6 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

jeffreywangzhi/AWS_Kinesis-Data-Streaming

Real-time data streaming implementation with AWS Kinesis.

Language: Java - Size: 17.6 KB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

blitzkz23/final-project-end-to-end-banking-campaign-pipeline

Final Project for IYKRA Data Fellowship 8 Program, creating an end-to-end banking campaign pipeline using lambda architecture (providing acess to batch and stream processing)

Language: Python - Size: 242 MB - Last synced: 2 months ago - Pushed: about 1 year ago - Stars: 8 - Forks: 2

grc-iit/HFlow

HFlow is a platform for I/O forwarding managed elastically, dynamically, and actively

Language: C++ - Size: 3.61 MB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

IsaacMwendwa/Streaming-Data-with-AWS-Kinesis-and-Lambda

Working with real-time data - Streaming Data with Amazon Kinesis and AWS Lambda

Size: 3.91 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

gerardodavidlopezcastillo/Apache-KafkaDruidEC2_Public

Process that generates random events over thousands of records from a technology e-commerce site for processing in Apache Kafka and analysis in Druid instantiated from EC2, WSL2 Ubuntu, Key .pem

Language: Jupyter Notebook - Size: 1.93 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

jlumbroso/python-random-hash

A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲

Language: Python - Size: 32.2 KB - Last synced: 10 days ago - Pushed: almost 2 years ago - Stars: 9 - Forks: 0

iamirmasoud/data_streaming

Hands on data streaming

Language: Jupyter Notebook - Size: 14.1 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

terranovafr/PersonalityClustering

ML clustering techniques for grouping users according to their personality

Language: Jupyter Notebook - Size: 45.8 MB - Last synced: 9 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

DataFloz/Metro

Metro is pipelines orchestrator platform for stream data.

Language: Python - Size: 642 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0

janzenisek/dsg

C# .NET data stream generator

Language: C# - Size: 9.54 MB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 3

leonardohss0/data-streaming-with-kafka

Keywords: Python, Data Processing, Kafka, Data Streaming, Docker

Language: Python - Size: 60.5 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

ProjectIxian/Ixian-S2

Ixian S2 end to end data streaming network software

Language: C# - Size: 297 KB - Last synced: 30 days ago - Pushed: about 2 months ago - Stars: 5 - Forks: 3

Tzur1234/Optimizing-Public-Transportation

I built a real-time event pipeline using Apache Kafka and its ecosystem, simulating and displaying Chicago Transit Authority train line statuses using public data.

Language: Python - Size: 319 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

EstevaoUyra/streaming-san-francisco-crime-data

A projects developed while learning data streaming

Language: Python - Size: 7.88 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

vhtnguyen/Kafka_Youtube-watcher

An app to keep track of Youtube videos and sends the notification to a Telegram bot to inform you if anyone comments on those

Language: Python - Size: 561 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0

minhloannguyen/postgres-cdc

Change Data Capture with kafka, debezium, apache flink for posgres database

Language: Java - Size: 337 KB - Last synced: 29 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

ppatierno/kafka-hybrid-iot

Apache Kafka for the Hybrid IoT

Language: JavaScript - Size: 6.34 MB - Last synced: 3 months ago - Pushed: about 5 years ago - Stars: 10 - Forks: 0

AlexImb/automl-streams-research-paper

AutoML Techniques for Data Streams - Research Paper

Language: TeX - Size: 9.23 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 1 - Forks: 0

build-on-aws/building-apache-kafka-connectors

Sample code that shows the important aspects of developing custom connectors for Kafka Connect. It provides the resources for building, deploying, and running the code on-premises using Docker, as well as running the code in the cloud.

Language: Java - Size: 56.6 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 24 - Forks: 4

dmpalyvos/erebus

Erebus: Explaining the Outputs of Data Streaming Queries

Language: Java - Size: 257 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 1

enliten/ltbnet

Large Scale Test Bed Network Emulator

Language: Python - Size: 250 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 5

AbdullahMu/Data-Streaming-Nanodegree-Project_02-Evaluate-Human-Balance-with-Spark-Streaming

Design data streaming architecture and API for a real-life application called the Step Trending Electronic Data Interface (STEDI). It is a working application used to assess fall risk for seniors. When a senior takes a test, they are scored using an index which reflects the likelihood of falling, and potentially sustaining an injury in the course of walking. STEDI uses a Redis datastore for risk score and other data. The Data Science team has completed a working graph for population risk at a STEDI clinic. The problem is the data is not populated yet. You will work with Kafka Connect Redis Source events and Business Events to create a Kafka topic containing anonymized risk scores of seniors in the clinic.

Language: Python - Size: 827 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0

PubNubDevelopers/Real-Time-Data-Streaming-Demo

Demonstration of PubNub's real-time data streaming capabilities from Twitter, Wikipedia & more

Language: JavaScript - Size: 1.53 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 7 - Forks: 0

hnjm/memphis-broker Fork of memphisdev/memphis

Next-Generation Real-Time Data Processing Platform

Size: 183 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

KentHsu/Udacity-Data-Streaming-Nanodegree

Udacity Data Streaming Nanodegree Program

Language: Python - Size: 624 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 14 - Forks: 11

bcgov/data-stream

Subscribe to datasets and be notified of changes via webhook

Language: Python - Size: 10.1 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 1 - Forks: 2

oelin/flare

Progressive streaming of large datasets.

Size: 47.9 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

stancsz/akka-stream-processor 📦

Stream processor that's consuming two topics and process data

Language: Scala - Size: 7.62 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 2 - Forks: 0

jlumbroso/java-random-hash

A simple, time-tested, family of random hash functions in Java, based on CRC32, affine transformations, and the Mersenne Twister. 🎲

Language: Java - Size: 726 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 9 - Forks: 0

fernandito77777/AWSDataAnalyticsPostgreWorkshop

Workshop Database RDS Postgre Integration and offloading to Data Lake, and visualize the data to QuickSight

Size: 5.63 MB - Last synced: 12 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

maulanaakbardj/Tigergraph_Streams

Example Tigergraph implementation of the TG data Streaming with Kafka

Size: 1.57 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

officialarijit/Fed-ReMECS-mqtt

A Federated Learning Method for Real-time Emotion State Classification from Multi-modal Streaming

Language: Python - Size: 146 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 5 - Forks: 1

PubNubDevelopers/Real-Time-Data-Streaming-Tutorial

Tutorial for PubNub's real-time data streaming capabilities including HTTP Streaming

Language: Python - Size: 1.4 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

darkfennertrader/Optimizing-Public-Trasportation

Udacity Data Streaming project based on Apache Kafka

Language: Python - Size: 348 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

iamniklas/Server-Client-Dual-Antenna

Language: Kotlin - Size: 82 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

Waiguru254/surveycto

Manipulation of Data Collected through XLSFORM-compliant Platforms

Language: R - Size: 285 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

the-timoye/songstreams

Streams simulated events using Kafka & Spark, from a music application to a data lake (AWS S3), and then a warehouse (AWS Redshift)

Language: Python - Size: 21.9 MB - Last synced: 12 months ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 1

akurgat/energy_trading_prediction

Language: Jupyter Notebook - Size: 13.2 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 1

jlumbroso/affirmative-sampling

Reference implementation of the Affirmative Sampling algorithm by Jérémie Lumbroso and Conrado Martínez (2022). 🍀

Language: Python - Size: 794 KB - Last synced: 21 days ago - Pushed: almost 2 years ago - Stars: 3 - Forks: 0

ThiagoBarradas/logstash-beats-demo

Elastic Stack with Nginx, Logstash and Beats demo

Size: 19.5 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 4 - Forks: 3

nyoungstudios/multiflow

A Python multithreading library for data processing pipelines, data streaming, etc.

Language: Python - Size: 370 KB - Last synced: 10 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

lpraat/SoccerStreams

Fault-tolerant streaming pipeline for real-time soccer match analysis.

Language: Scala - Size: 2.68 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 1 - Forks: 1

chaitanyakasaraneni/streamingDataPipeline

Example to build a data streaming using GCP

Language: Python - Size: 1.08 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

saveriogzz/weather-collector Fork of goeh/weather-collector

This Java program read data from a weather station (Davis Vantage Pro2) and store it in a SQL database

Language: Java - Size: 389 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

jerryloyn/tweet-crypto-kafka-spark-streaming

To stream real-time Twitter feeds and visualize the real-time crypto hashtag count

Language: Jupyter Notebook - Size: 50.8 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

jo-jstrm/rime-data-streaming-iot

Code for the conference paper "Streaming Data through the IoT via Actor-Based Semantic Routing Trees" from VLIOT@VLDB 2021.

Language: Jupyter Notebook - Size: 30.3 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

rahiakela/data-structures-and-algorithms-specialization

Data Structures and Algorithms Specialization | Coursera

Language: Java - Size: 1.25 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

ManikHossain08/Kafka-Stream-Data-Pipeline-Near-Real-Time

Stream data into pipeline in near-real-time using Kafka

Language: Scala - Size: 18.6 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0

dattranm/tweetspark

Twitter Sentiment Analysis using Spark Streaming

Language: Python - Size: 202 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

luhlitt/data-streaming-with-kafka

Public Transit Status with Apache Kafka

Language: Python - Size: 389 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

AbdullahMu/Data-Streaming-Nanodegree-Project_01-Optimizing-Public-Transportation

Construct a streaming event pipeline around Apache Kafka and its ecosystem. Using public data from the Chicago Transit Authority, we will construct an event pipeline around Kafka that allows us to simulate and display the status of train lines in real time.

Language: Python - Size: 512 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 1

zabir-nabil/picast

A lightweight fast data streaming library for raspberry pi in python.

Language: Python - Size: 162 KB - Last synced: over 1 year ago - Pushed: almost 5 years ago - Stars: 5 - Forks: 1

Agrivatehq/Kafka

Kafka Devlopment and Production repo for all data streamings

Language: Python - Size: 7.81 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

eshnil2000/streaming_nd Fork of shabie/streaming_nd

Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions

Size: 9.37 MB - Last synced: about 2 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

OmalPerera/thermoSensor-data-streamer

Language: Scala - Size: 18.6 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0

7erry/jetliner

The missing command line interface for Jet:

Language: Java - Size: 8.33 MB - Last synced: about 1 year ago - Pushed: about 7 years ago - Stars: 0 - Forks: 0