GitHub topics: streaming-data
kafbat/kafka-ui
Open-Source Web UI for managing Apache Kafka clusters
Language: Java - Size: 34.9 MB - Last synced at: about 22 hours ago - Pushed at: 5 days ago - Stars: 1,181 - Forks: 148

GridProtectionAlliance/openPDC
Open Source Phasor Data Concentrator
Language: C# - Size: 2.48 GB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 138 - Forks: 57

johnkerl/miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Language: Go - Size: 201 MB - Last synced at: about 10 hours ago - Pushed at: 9 days ago - Stars: 9,329 - Forks: 224

MinaEssam16/stream-to-river
Streams to River is a microservice system designed for effective English learning, utilizing the Hertz and Kitex frameworks. This project integrates features like real-time chat and speech recognition to enhance the learning experience. 🛠️🌊
Language: Go - Size: 5.93 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

readysettech/readyset
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the results of cached select statements and incrementally updates these results over time as the underlying data changes.
Language: Rust - Size: 229 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4,998 - Forks: 143

GridProtectionAlliance/gsf
Grid Solutions Framework
Language: C# - Size: 249 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 165 - Forks: 70

goodboy/tractor
A distributed, structured concurrency runtime for Python (and friends)
Language: Python - Size: 2.99 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 287 - Forks: 11

bytewax/bytewax
Python Stream Processing
Language: Python - Size: 12 MB - Last synced at: about 1 hour ago - Pushed at: 3 months ago - Stars: 1,765 - Forks: 82

jsa-aerial/aerobio
Extensible full DAG streaming computation server with services and jobs for RNA-Seq, Tn-Seq, WG-Seq and Term-Seq.
Language: Clojure - Size: 1.04 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 11 - Forks: 1

certeu/morio
Connect - Stream - Observe - Respond | Morio provides the plumbing for your observability needs
Language: JavaScript - Size: 27.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 27 - Forks: 3

memgraph/memgraph
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
Language: C++ - Size: 43.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,999 - Forks: 164

luisKING2008/Stream-Omni
Stream-Omni enables seamless interactions across text, vision, and speech using a large language model. This repository includes the model, datasets, and tools for developers to explore multimodal capabilities. 🌟🌐
Language: Python - Size: 10.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

SamarthKulkarni20/keyed-batched-items-accumulator
A lightweight utility for Node.js projects that accumulates items into fixed-size batches per key, preserving insertion order within each key. Streams items directly into their respective batches at runtime, eliminating the overhead of post-processing 1D arrays into chunks. It abstracts key-based batch management.
Language: TypeScript - Size: 52.7 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

redpanda-data/docs
Open source content for the Redpanda documentation
Language: Gherkin - Size: 28 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 8 - Forks: 45

redpanda-data/cloud-docs
Redpanda Cloud documentation
Language: JavaScript - Size: 8.47 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 4

MaterializeInc/materialize
Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.
Language: Rust - Size: 260 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6,024 - Forks: 472

Correia-jpv/fucking-awesome-bigdata
A curated list of awesome big data frameworks, resources and other awesomeness. With repository stars⭐ and forks🍴
Size: 655 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11 - Forks: 1

infoslack/awesome-kafka
A list about Apache Kafka
Size: 96.7 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 579 - Forks: 164

Sinotrade/Shioaji
Shioaji all new cross platform api for trading ( 跨平台證券交易API )
Language: Dockerfile - Size: 67.4 KB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 189 - Forks: 15

aramisfacchinetti/streaming-json-parser
Streaming JSON parser designed to process JSON data incrementally. The primary goal is to handle potentially incomplete JSON data streams, such as those produced by Large Language Models (LLMs), and return the current state of the parsed object at any time.
Language: Python - Size: 27.3 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 11 - Forks: 2

infinyon/fluvio
🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.
Language: Rust - Size: 34.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 4,937 - Forks: 517

oxnr/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Size: 843 KB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 13,660 - Forks: 2,574

symongpt/twitter-analysis-app
This repository hosts a web application that analyzes tweets to recommend movies and songs based on public sentiment. Users can easily access real-time insights without scrolling through endless Twitter feeds. 🐙🌟
Language: Python - Size: 276 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

ParaGroup/WindFlow
A C++17 Data Stream Processing Parallel Library for Multicores and GPUs
Language: C++ - Size: 48.9 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 84 - Forks: 19

piskvorky/smart_open
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Language: Python - Size: 1.58 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 3,325 - Forks: 384

redpanda-data/connect
Fancy stream processing made operationally mundane
Language: Go - Size: 35 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 8,378 - Forks: 877

karafka/karafka-web
Web UI for monitoring and managing Karafka consumers
Language: Ruby - Size: 7.47 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 55 - Forks: 7

DoneDeal0/superdiff
Superdiff provides a complete and readable diff for both arrays and objects. Plus, it supports stream and file inputs for handling large datasets efficiently, is battle-tested, has zero dependencies, and is super fast.
Language: TypeScript - Size: 471 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 918 - Forks: 8

gyselle-marques/ScreenMatch-CommandLineRunner
Streaming de séries de TV por linha de comando.
Language: Java - Size: 18.6 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

python-streamz/streamz
Real-time stream processing for python
Language: Python - Size: 2.86 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 1,266 - Forks: 150

quixio/quix-streams
Python Streaming DataFrames for Kafka
Language: Python - Size: 8.51 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,388 - Forks: 80

joshday/OnlineStats.jl
⚡ Single-pass algorithms for statistics
Language: Julia - Size: 91.7 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 867 - Forks: 64

kLabUM/rrcf
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
Language: Python - Size: 4.45 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 509 - Forks: 113

AndGeo69/StreamingCotiles
A streaming implementation of COTILES algorithm using Apache Spark's Structured Streaming API
Language: Python - Size: 2.74 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

maraisr/meros
🪢 A fast utility that makes reading multipart responses simple
Language: TypeScript - Size: 590 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 186 - Forks: 11

online-ml/river
🌊 Online machine learning in Python
Language: Python - Size: 317 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5,373 - Forks: 578

tealtools/awesome-apache-pulsar
A curated list of resources about Apache Pulsar.
Size: 367 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 30 - Forks: 3

scikit-multiflow/scikit-multiflow
A machine learning package for streaming data in Python. The other ancestor of River.
Language: Python - Size: 60.9 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 778 - Forks: 188

Seddryck/Streamistry
Streamistry is a lightweight library designed to support pipeline, streaming, and ETL development for data engineering and integration. Its versatility makes it an excellent tool for building robust, scalable data workflows and optimizing data processing tasks. With features such as accumulators, windows, and sinks, it efficiently handles streaming
Language: C# - Size: 2.24 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 0

zpl-c/zpl
📐 Pushing the boundaries of simplicity
Language: C - Size: 3.99 MB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 1,026 - Forks: 48

ekrich/exip
Efficient XML Interchange (EXI) Embeddable C API
Language: C - Size: 5.4 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 7 - Forks: 0

saidsef/aws-kinesis-local
AWS Kinesis local for building applications with streaming data
Language: Dockerfile - Size: 120 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 21 - Forks: 4

selimfirat/pysad
Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)
Language: Python - Size: 438 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 262 - Forks: 25

maki-nage/rxsci
ReactiveX for data science
Language: Python - Size: 409 KB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 14 - Forks: 2

NathanP23/Big-Data-Mining-52002
Midterm and Final assignments of the course "Big Data Mining (52002)" at The Hebrew University of Jerusalem, in the Department of Statistics and Data Science. Focuses on analyzing massive datasets using Python, SQL, cloud computing, and network analysis. Includes project guidelines for scalable data mining techniques and distributed computing.
Language: Jupyter Notebook - Size: 21 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

ori88c/keyed-batched-items-accumulator
A lightweight utility for Node.js projects that accumulates items into fixed-size batches per key, preserving insertion order within each key. Streams items directly into their respective batches at runtime, eliminating the overhead of post-processing 1D arrays into chunks. It abstracts key-based batch management.
Language: TypeScript - Size: 93.8 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

ori88c/batched-items-accumulator
A lightweight utility for Node.js projects that accumulates items into fixed-size batches (number-of-items wise), preserving insertion order. It abstracts batch management, allowing users to focus on application logic. Ideal for delayed processing tasks such as bulk write/publish operations to kafka, databases, blob storage, etc.
Language: TypeScript - Size: 50.8 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

guillermo-navas-palencia/optbinning
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
Language: Python - Size: 10.4 MB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 476 - Forks: 108

microsoft/Trill
Trill is a single-node query processor for temporal or streaming data.
Language: C# - Size: 10.2 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 1,249 - Forks: 131

davidgao7/research_assistant
your personal research assistant!
Language: Python - Size: 1.05 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

HimanshuMohanty-Git24/StreamLineIRCTC
A real-time data pipeline simulating IRCTC bookings using GCP. It streams mock data via Pub/Sub, transforms it with Dataflow (Python UDF), stores results in BigQuery, and powers live dashboards. Includes error handling and schema validation.
Language: Python - Size: 10.7 KB - Last synced at: 15 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

streamdal/streamdal
Code-Native Data Privacy
Language: TypeScript - Size: 292 MB - Last synced at: 25 days ago - Pushed at: 7 months ago - Stars: 602 - Forks: 15

tinybirdco/mockingbird
Mockingbird is a mock streaming data generator
Language: TypeScript - Size: 2.57 MB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 120 - Forks: 18

systemaccounting/mxfactorial
a payment application intended for deployment by the united states treasury that replaces banking with accounting
Language: Rust - Size: 6.83 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 53 - Forks: 26

pathwaycom/pathway-benchmarks
Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams
Language: Python - Size: 4.7 MB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 70 - Forks: 4

slimandslam/schwab-client-js
A modern wrapper around the Schwab financial API for NodeJS Typescript and Javascript projects. Join our Discord: https://discord.gg/Q9z8EnB8xD
Language: TypeScript - Size: 2.59 MB - Last synced at: 13 days ago - Pushed at: 5 months ago - Stars: 10 - Forks: 7

quantfinlib/screamer
Screamingly fast streaming indicators with C++ performance and Python simplicity.
Language: C++ - Size: 22.4 MB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 1

silverton-io/buz
Serverless multi-protocol + multi-destination event collection system.
Language: Go - Size: 30.1 MB - Last synced at: 29 days ago - Pushed at: 7 months ago - Stars: 205 - Forks: 26

Lucchh/eilof
Efficient Incremental Local Outlier Factor (EILOF) for Online Anomaly Detection
Language: Jupyter Notebook - Size: 8.12 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

stensoosaar/IBKit
Interactive Brokers TWS/Gateway API in Swift
Language: Swift - Size: 1.62 MB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 20 - Forks: 9

swimos/swim
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
Language: Java - Size: 22.8 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 493 - Forks: 41

provectus/kafka-ui
Open-Source Web UI for Apache Kafka Management
Language: Java - Size: 29 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 10,773 - Forks: 1,272

EgorSborschikov/conferences_api
Video conferencing API with using REST & WebSocket
Language: Python - Size: 254 KB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

reugn/go-streams
A lightweight stream processing library for Go
Language: Go - Size: 598 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2,036 - Forks: 164

joshday/OnlineStatsBase.jl
Base types for OnlineStats.
Language: Julia - Size: 310 KB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 32 - Forks: 14

ylem-co/ylem
Ylem is an open-source platform for real-time data streaming orchestration
Language: JavaScript - Size: 5.87 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 71 - Forks: 0

Stratio/sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
Language: Scala - Size: 123 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 526 - Forks: 196

hstreamdb/hstream
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
Language: Haskell - Size: 6.28 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 722 - Forks: 55

thammo4/uvatradier
Python wrapper for the Tradier brokerage API
Language: Python - Size: 827 KB - Last synced at: 24 days ago - Pushed at: 2 months ago - Stars: 23 - Forks: 14

pravega/pravega
Pravega - Streaming as a new software defined storage primitive
Language: Java - Size: 47 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 1,998 - Forks: 409

whitaker-io/machine
Machine is a workflow/pipeline library for processing data
Language: Go - Size: 1.41 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 162 - Forks: 12

igopalakrishna/nyc-subway-foot-traffic-prediction-and-forecasting
Designed and implemented a scalable real-time analytics pipeline using Apache Kafka, Spark Structured Streaming, and MongoDB to simulate NYC MTA turnstile data and forecast real-time subway foot traffic using SparkML Random Forest models.
Language: Jupyter Notebook - Size: 1.27 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

Menziess/slipstream-async
Slipstream provides a data-flow model to simplify development of stateful streaming applications.
Language: Python - Size: 628 KB - Last synced at: 18 days ago - Pushed at: 2 months ago - Stars: 36 - Forks: 0

SaiRanjithReddyK/aws-retail-realtime-analytics
A real-time data pipeline built with AWS Kinesis, Lambda, S3, Athena, and QuickSight, simulating and analyzing 70+ retail transactions. This project demonstrates how to stream, store, query, and visualize transactional data using fully managed AWS services, along with Python for simulation and Lambda logic, and SQL for analytics in Athena.
Language: Python - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

zelros/cinnamon 📦
CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system
Language: Python - Size: 2.02 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 77 - Forks: 7

icicle-lang/icicle
Icicle Streaming Query Language
Language: Haskell - Size: 14.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 27 - Forks: 3

exajobs/data-engineering-collection
A collection of awesome software, libraries, Learning Tutorials, documents, books, resources and interesting stuff about Big Data Science & Engineering
Size: 241 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 1

7wilightxdev/market-stream
My personal playground in Flutter, MarketStream is a real-time market data visualization app, this project delivers a minimalist, high-performance UI for tracking market prices and custom candlestick chart. Every pixel, every interaction had to be manually calculated and optimized.
Language: Dart - Size: 974 KB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 6 - Forks: 2

sonhmai/data-systems-design
System Design, Solution Architecture, Data Systems Practice
Size: 24.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 45 - Forks: 11

maki-nage/makinage
Stream Processing Made Easy
Language: Python - Size: 227 KB - Last synced at: 10 days ago - Pushed at: about 3 years ago - Stars: 41 - Forks: 1

wso2/product-streaming-integrator
A stream processing runtime that allows connecting any streaming data source to any destination and act on it
Language: Python - Size: 152 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 113 - Forks: 52

embetrix/bmap-writer
bmaptool alternative written in C++
Language: C++ - Size: 216 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 12 - Forks: 3

microsoft/data-accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Language: C# - Size: 401 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 302 - Forks: 90

iresil/FowlFlightForensics
A Kafka-based CSV parser for bird-related airplane accidents
Language: Java - Size: 5.23 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

MikeJaredS/hermiter
Efficient Sequential and Batch Estimation of Univariate and Bivariate Probability Density Functions and Cumulative Distribution Functions along with Quantiles (Univariate) and Nonparametric Correlation (Bivariate)
Language: R - Size: 8.82 MB - Last synced at: 11 minutes ago - Pushed at: 10 months ago - Stars: 16 - Forks: 3

SAFZZ/real-time-product-activity-pyspark
Real-time Product Activity Tracker using Apache Kafka, PySpark, and PostgreSQL | Built for high-throughput streaming analytics
Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

bakansm/ViTHSD
Vietnamese Hate Speech Detection with real-time data from streaming platform such as Youtube, Facebook and Tiktok.
Language: Jupyter Notebook - Size: 946 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 2

microsoft/FabricRTIWorkshop
How to build a Medallion design pattern using Fabric Real-Time Intelligence
Language: Jupyter Notebook - Size: 22.8 MB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 12 - Forks: 14

redislabs-training/ru202
This is the repository for the Redis Streams learning path on Redis University.
Language: HTML - Size: 1000 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 37 - Forks: 31

keithknott26/datadash
Visualize and graph data in the terminal
Language: Go - Size: 96.8 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 281 - Forks: 14

fajarnugraha37/turborepo-nestjs
Fullstack multiple service application using turborepo typescript, nestjs, nextjs, prisma, mongodb and rabbitmq.
Language: TypeScript - Size: 579 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 4

evadne/packmatic
Zipping on the fly — Generate downloadable Zip streams by aggregating File or URL Sources
Language: Elixir - Size: 154 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 160 - Forks: 15

grahman20/ADF
Adaptive Decision Forest(ADF) is an incremental machine learning framework called to produce a decision forest to classify new records. ADF is capable to classify new records even if they are associated with previously unseen classes. ADF also is capable of identifying and handling concept drift; it, however, does not forget previously gained knowledge. Moreover, ADF is capable of handling big data if the data can be divided into batches.
Language: Java - Size: 1.63 MB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

vertica/PSTL
Parallel Streaming Transformation Loader
Language: Java - Size: 106 MB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 9 - Forks: 6

ParaGroup/StreamBenchmarks
Suite of Benchmark Applications for Stream Processing Systems
Language: C++ - Size: 44.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 13 - Forks: 4

Nasruddin/realtime-streaming-kafka-flink-pinot-postgres-superset
Setup for realtime data streaming using Kafka, Flink, Pinot, MySQL, Postgres and Superset
Language: Python - Size: 4.67 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

anhtt2211/social-api
Language: TypeScript - Size: 46.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

antonmry/galiglobal
Blog about Apache Kafka, Apache Flink and Java
Language: HTML - Size: 48.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

seznam/euphoria
Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model which can express both batch and stream transformations.
Language: Java - Size: 3.9 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 81 - Forks: 11

AubainMbk/RealTime-BikeStation-Tracker
Ce projet est un moyen de démontrer mes capacités de collecte de données grâce aux API. L'objectif est de créer une appli web qui permet d'accéder aux infos sur les stations de vélos en temps réel.
Language: Jupyter Notebook - Size: 1.45 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0
