An open API service providing repository metadata for many open source software ecosystems.

Topic: "stream-processing"

pathwaycom/pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Language: Python - Size: 133 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 50,738 - Forks: 1,454

google-ai-edge/mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Language: C++ - Size: 578 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 32,560 - Forks: 5,666

vectordotdev/vector

A high-performance observability data pipeline.

Language: Rust - Size: 124 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 20,946 - Forks: 1,950

zhisheng17/flink-learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Language: Java - Size: 41.6 MB - Last synced at: 7 months ago - Pushed at: 10 months ago - Stars: 14,800 - Forks: 3,940

oxnr/awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

Size: 845 KB - Last synced at: 5 days ago - Pushed at: 28 days ago - Stars: 14,102 - Forks: 2,589

redpanda-data/redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

Language: C++ - Size: 133 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11,469 - Forks: 704

madd86/awesome-system-design

A curated list of awesome System Design (A.K.A. Distributed Systems) resources.

Size: 1.71 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 11,304 - Forks: 1,235

ThreeDotsLabs/watermill

Building event-driven applications the easy way in Go.

Language: Go - Size: 26.7 MB - Last synced at: 24 days ago - Pushed at: 26 days ago - Stars: 9,280 - Forks: 479

risingwavelabs/risingwave

Streaming data platform. Real-time stream processing, low-latency serving, and Iceberg table management.

Language: Rust - Size: 186 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 8,592 - Forks: 714

redpanda-data/connect

Fancy stream processing made operationally mundane

Language: Go - Size: 36.6 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 8,536 - Forks: 901

fluent/fluent-bit

Fast and Lightweight Logs, Metrics and Traces processor for Linux, BSD, OSX and Windows

Language: C - Size: 128 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 7,521 - Forks: 1,843

robinhood/faust

Python Stream Processing

Language: Python - Size: 8.3 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 6,823 - Forks: 536

hazelcast/hazelcast

Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

Language: Java - Size: 389 MB - Last synced at: about 8 hours ago - Pushed at: 2 days ago - Stars: 6,540 - Forks: 1,874

MaterializeInc/materialize

The live data layer for apps and AI agents Create up-to-the-second views into your business, just using SQL

Language: Rust - Size: 288 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 6,192 - Forks: 487

apache/hudi

Upserts, Deletes And Incremental Processing on Big Data.

Language: Java - Size: 2.36 GB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 6,047 - Forks: 2,454

online-ml/river

🌊 Online machine learning in Python

Language: Python - Size: 325 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 5,641 - Forks: 603

fluvio-community/fluvio

🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.

Language: Rust - Size: 34.6 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5,115 - Forks: 522

javascriptdata/danfojs

Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.

Language: TypeScript - Size: 79.1 MB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 5,034 - Forks: 222

ag2ai/faststream

FastStream is a powerful and easy-to-use asynchronous Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.

Language: Python - Size: 48.3 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 4,846 - Forks: 309

ArroyoSystems/arroyo

Distributed stream processing engine in Rust

Language: Rust - Size: 15.6 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 4,705 - Forks: 325

memgraph/memgraph

Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.

Language: C++ - Size: 83.4 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 3,531 - Forks: 193

manuzhang/awesome-streaming

a curated list of awesome streaming frameworks, applications, etc

Size: 2.72 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 2,927 - Forks: 311

PeerDB-io/peerdb

Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage

Language: Go - Size: 19.9 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 2,850 - Forks: 163

douban/dpark 📦

Python clone of Spark, a MapReduce alike framework in Python

Language: Python - Size: 2.65 MB - Last synced at: 4 days ago - Pushed at: almost 5 years ago - Stars: 2,679 - Forks: 530

pipelinedb/pipelinedb

High-performance time-series aggregation for PostgreSQL

Language: C - Size: 46.7 MB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 2,643 - Forks: 243

numaproj/numaflow

Kubernetes-native platform to run massively parallel data/streaming jobs

Language: Rust - Size: 53.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2,419 - Forks: 147

francoispqt/gojay

high performance JSON encoder/decoder with stream API for Golang

Language: Go - Size: 5.61 MB - Last synced at: 7 months ago - Pushed at: about 2 years ago - Stars: 2,130 - Forks: 116

reugn/go-streams

A lightweight stream processing library for Go

Language: Go - Size: 561 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2,125 - Forks: 172

timeplus-io/proton

⚡ Fastest SQL ETL pipeline in a single C++ binary, built for stream processing, observability, analytics and AI/ML

Language: C++ - Size: 566 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 2,103 - Forks: 97

yomorun/yomo

🦖 Serverless AI Agent Framework with Geo-distributed Edge AI Infra.

Language: Go - Size: 66 MB - Last synced at: 18 days ago - Pushed at: 21 days ago - Stars: 1,873 - Forks: 139

bytewax/bytewax

Python Stream Processing

Language: Python - Size: 12 MB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 1,861 - Forks: 96

nerevu/riko

A Python stream processing engine modeled after Yahoo! Pipes

Language: Python - Size: 2.58 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 1,599 - Forks: 75

siddhi-io/siddhi

Stream Processing and Complex Event Processing Engine

Language: Java - Size: 33.6 MB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 1,554 - Forks: 531

quixio/quix-streams

Python Streaming DataFrames for Kafka

Language: Python - Size: 9.64 MB - Last synced at: 22 days ago - Pushed at: about 1 month ago - Stars: 1,494 - Forks: 91

WallarooLabs/wally

Distributed Stream Processing

Language: Pony - Size: 268 MB - Last synced at: 7 months ago - Pushed at: over 4 years ago - Stars: 1,481 - Forks: 68

halaxa/json-machine

Efficient, easy-to-use, and fast PHP JSON stream parser

Language: PHP - Size: 1.51 MB - Last synced at: 16 days ago - Pushed at: 20 days ago - Stars: 1,271 - Forks: 73

spring-attic/spring-cloud-dataflow 📦

A microservices-based Streaming and Batch data processing in Cloud Foundry and Kubernetes

Language: Java - Size: 68.3 MB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 1,136 - Forks: 591

hazelcast/hazelcast-jet 📦

Distributed Stream and Batch Processing

Language: Java - Size: 171 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 1,110 - Forks: 207

uhop/stream-json

The micro-library of Node.js stream components for creating custom JSON processing pipelines with a minimal memory footprint. It can parse JSON files far exceeding available memory streaming individual primitives using a SAX-inspired API.

Language: JavaScript - Size: 899 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 1,108 - Forks: 50

spring-cloud/spring-cloud-stream

Framework for building Event-Driven Microservices

Language: Java - Size: 19.3 MB - Last synced at: 3 days ago - Pushed at: 7 days ago - Stars: 1,056 - Forks: 629

clj-commons/manifold

A compatibility layer for event-driven abstractions

Language: Clojure - Size: 1.5 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 1,039 - Forks: 109

edp963/wormhole

Wormhole is a SPaaS (Stream Processing as a Service) Platform

Language: JavaScript - Size: 45.5 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 978 - Forks: 479

ververica/flink-sql-cookbook

The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.

Language: Dockerfile - Size: 14.1 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 907 - Forks: 209

nodefluent/kafka-streams

equivalent to kafka-streams :octopus: for nodejs :sparkles::turtle::rocket::sparkles:

Language: TypeScript - Size: 4.14 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 839 - Forks: 111

vladimirvivien/automi

A stream processing API for Go (now with generic type support)

Language: Go - Size: 2.72 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 799 - Forks: 63

gazette/core

Build platforms that flexibly mix SQL, batch, and stream processing paradigms

Language: Go - Size: 5.02 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 780 - Forks: 57

pipeless-ai/pipeless

An open-source computer vision framework to build and deploy apps in minutes

Language: Rust - Size: 142 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 774 - Forks: 41

gearpump/gearpump

Lightweight real-time big data streaming engine over Akka

Language: Scala - Size: 26.2 MB - Last synced at: 5 days ago - Pushed at: almost 4 years ago - Stars: 758 - Forks: 152

lw-lin/streaming-readings

Streaming System 相关的论文读物

Size: 6.84 KB - Last synced at: 7 months ago - Pushed at: almost 4 years ago - Stars: 733 - Forks: 154

hstreamdb/hstream

HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.

Language: Haskell - Size: 6.28 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 727 - Forks: 55

TouK/nussknacker

Low-code tool for automating actions on real time data | Stream processing for the users.

Language: Scala - Size: 206 MB - Last synced at: about 6 hours ago - Pushed at: 1 day ago - Stars: 703 - Forks: 103

apache/streampipes

Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.

Language: Java - Size: 315 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 695 - Forks: 215

infoslack/awesome-kafka

A list about Apache Kafka

Size: 96.7 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 583 - Forks: 165

microsoft/psi

Platform for Situated Intelligence

Language: C# - Size: 37.2 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 565 - Forks: 103

leesf/hudi-resources

汇总Apache Hudi相关资料

Size: 32.2 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 561 - Forks: 160

deep5050/radio-active

Play any radios around the globe right from the terminal :zap:

Language: Python - Size: 553 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 557 - Forks: 29

constellation-rs/amadeus

Harmonious distributed data analysis in Rust.

Language: Rust - Size: 2.46 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 482 - Forks: 25

nmaquet/kasper 📦

Kasper is a lightweight library for processing Kafka topics.

Language: Go - Size: 292 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 438 - Forks: 23

FutureSDR/FutureSDR

An Async SDR Runtime for Heterogeneous Architectures

Language: Rust - Size: 28.1 MB - Last synced at: 10 days ago - Pushed at: 12 days ago - Stars: 412 - Forks: 67

Kurento/kurento

Kurento WebRTC Media Server

Language: C - Size: 132 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 412 - Forks: 76

smooks/smooks

An extensible Java framework for building event-driven applications that break up XML and non-XML data into chunks for data integration

Language: Java - Size: 29.2 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 406 - Forks: 360

RedisGears/RedisGears

Dynamic execution framework for your Redis data

Language: Rust - Size: 4.92 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 380 - Forks: 67

Point72/csp

csp is a high performance reactive stream processing library, written in C++ and Python

Language: Python - Size: 5.36 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 373 - Forks: 75

graphform/swim-rust

Self-contained distributed software platform for building stateful, massively real-time streaming applications in Rust.

Language: Rust - Size: 25.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 352 - Forks: 12

p2panda/p2panda

All the things a panda needs

Language: Rust - Size: 3.94 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 348 - Forks: 25

AICoE/log-anomaly-detector 📦

Log Anomaly Detection - Machine learning to detect abnormal events logs

Language: Jupyter Notebook - Size: 18.7 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 339 - Forks: 137

memgraph/mage

MAGE - Memgraph Advanced Graph Extensions :crystal_ball:

Language: C++ - Size: 22.5 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 325 - Forks: 34

AxisCommunications/media-stream-library-js 📦

JavaScript library to handle media streams on the command line (Node.js) and in the browser.

Language: TypeScript - Size: 872 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 311 - Forks: 105

vectorengine/vectorsql 📦

VectorSQL is a free analytics DBMS for IoT & Big Data, compatible with ClickHouse.

Language: Go - Size: 1.36 MB - Last synced at: 8 months ago - Pushed at: about 4 years ago - Stars: 292 - Forks: 53

Chulong-Li/Real-time-Sentiment-Tracking-on-Twitter-for-Brand-Improvement-and-Trend-Recognition

A real-time interactive web app based on data pipelines using streaming Twitter data, automated sentiment analysis, and MySQL&PostgreSQL database (Deployed on Heroku)

Language: Jupyter Notebook - Size: 5.78 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 283 - Forks: 127

streamnative/pulsar-flink 📦

Elastic data processing with Apache Pulsar and Apache Flink

Language: Java - Size: 2.16 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 279 - Forks: 120

raystack/dagger

Dagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.

Language: Java - Size: 12 MB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 277 - Forks: 42

confluentinc/ksql

The database purpose-built for stream processing applications.

Language: Java - Size: 226 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 273 - Forks: 1,042

bbejeck/kafka-streams-in-action

Source code for the Kafka Streams in Action Book

Language: Java - Size: 176 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 251 - Forks: 177

cross-platform/dspatch

The Refreshingly Simple Cross-Platform C++ Dataflow / Patching / Pipelining / Graph Processing / Stream Processing / Reactive Programming Framework

Language: C++ - Size: 6.15 MB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 249 - Forks: 45

talariadb/talaria

TalariaDB is a distributed, highly available, and low latency time-series database for Presto

Language: Go - Size: 12.8 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 227 - Forks: 32

streamnative/function-mesh

The serverless framework purpose-built for event streaming applications.

Language: Go - Size: 55.8 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 224 - Forks: 29

schollz/pluck

Pluck text in a fast and intuitive way :rooster:

Language: Go - Size: 6.55 MB - Last synced at: 2 months ago - Pushed at: over 6 years ago - Stars: 215 - Forks: 6

TomasMikula/libretto

Declarative concurrency and stream processing library for Scala

Language: Scala - Size: 15.5 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 213 - Forks: 7

logrange/logrange

High performance data aggregating storage

Language: Go - Size: 69.2 MB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 191 - Forks: 16

miroslavpejic85/mirotalkbro

📡 MiroTalk WebRTC Live Broadcast allows to broadcast live video, audio and screen stream to all connected users (viewers).

Language: JavaScript - Size: 1.89 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 186 - Forks: 47

neo4j-contrib/neo4j-streams

Neo4j Kafka Connector

Language: Kotlin - Size: 3.82 MB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 179 - Forks: 74

bytewatch/dolphinbeat

A server that pulls and parses MySQL binlog, pushs change data into different sinks like Kafka.

Language: Go - Size: 2.97 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 176 - Forks: 30

Claviz/xlstream

Turns XLSX into a readable stream.

Language: TypeScript - Size: 641 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 175 - Forks: 19

pemrouz/fero

light, fast, scalable, streaming microservices made easy

Language: JavaScript - Size: 1.05 MB - Last synced at: 5 months ago - Pushed at: over 7 years ago - Stars: 174 - Forks: 4

akka/akka-stream-contrib

Add-ons to Akka Stream

Language: Scala - Size: 315 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 173 - Forks: 78

vsouza/go-kafka-example

Golang Kafka consumer and producer example

Language: Go - Size: 505 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 172 - Forks: 59

fluxus-labs/fluxus

Fluxus Stream Processing Engine

Language: Rust - Size: 5.07 MB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 169 - Forks: 22

GridProtectionAlliance/gsf

Grid Solutions Framework

Language: C# - Size: 249 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 168 - Forks: 73

mariomac/gostream

A Go port of the Java Streams API. Type-safe and functional Go Streams processing

Language: Go - Size: 297 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 167 - Forks: 11

whitaker-io/machine

Machine is a workflow/pipeline library for processing data

Language: Go - Size: 1.39 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 164 - Forks: 12

LearningJournal/Kafka-Streams-Real-time-Stream-Processing

This is the central repository for all materials related to Kafka Streams : Real-time Stream Processing! Book by Prashant Pandey.

Language: Java - Size: 5.25 MB - Last synced at: 9 months ago - Pushed at: over 5 years ago - Stars: 164 - Forks: 163

synacker/daggy

Daggy - Data Aggregation Utility and C/C++ developer library for data streams catching

Language: C++ - Size: 7.36 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 157 - Forks: 16

psila-ai/callysto

Stream processing & Service framework.

Language: Rust - Size: 3.67 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 156 - Forks: 9

GridProtectionAlliance/openPDC

Open Source Phasor Data Concentrator

Language: C# - Size: 2.66 GB - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 150 - Forks: 58

ElAlev/Wayeb

Wayeb is a Complex Event Processing and Forecasting (CEP/F) engine written in Scala.

Language: Scala - Size: 8.17 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 147 - Forks: 6

samsara/samsara

Samsara is a real-time analytics platform

Language: Clojure - Size: 22.8 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 147 - Forks: 17

runreveal/kawa

A blazingly fast event stream processing library powering the reveald event processing daemon.

Language: Go - Size: 1 MB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 144 - Forks: 3

zakariamaaraki/RemoteCodeCompiler

An online code compiler supporting 11 programming languages (Java, Kotlin, Scala, C, C++, C#, Golang, Python, Ruby, Rust and Haskell) for competitive programming and coding interviews.

Language: Java - Size: 3.34 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 135 - Forks: 47

mjwestcott/runnel

Distributed event processing for Python based on Redis Streams

Language: Python - Size: 6.58 MB - Last synced at: 10 months ago - Pushed at: over 5 years ago - Stars: 133 - Forks: 11