An open API service providing repository metadata for many open source software ecosystems.

Topic: "stream-processing"

google-ai-edge/mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Language: C++ - Size: 575 MB - Last synced at: about 16 hours ago - Pushed at: about 21 hours ago - Stars: 29,453 - Forks: 5,320

pathwaycom/pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Language: Python - Size: 132 MB - Last synced at: about 4 hours ago - Pushed at: about 6 hours ago - Stars: 24,616 - Forks: 359

vectordotdev/vector

A high-performance observability data pipeline.

Language: Rust - Size: 115 MB - Last synced at: about 5 hours ago - Pushed at: about 19 hours ago - Stars: 19,209 - Forks: 1,725

zhisheng17/flink-learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Language: Java - Size: 41.6 MB - Last synced at: 15 minutes ago - Pushed at: about 1 month ago - Stars: 14,744 - Forks: 3,931

oxnr/awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

Size: 843 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 13,568 - Forks: 2,568

redpanda-data/redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

Language: C++ - Size: 107 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 10,229 - Forks: 617

madd86/awesome-system-design

A curated list of awesome System Design (A.K.A. Distributed Systems) resources.

Size: 1.71 MB - Last synced at: 10 days ago - Pushed at: 10 months ago - Stars: 10,180 - Forks: 1,140

redpanda-data/connect

Fancy stream processing made operationally mundane

Language: Go - Size: 34 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 8,309 - Forks: 863

ThreeDotsLabs/watermill

Building event-driven applications the easy way in Go.

Language: Go - Size: 25.4 MB - Last synced at: about 4 hours ago - Pushed at: 4 days ago - Stars: 8,306 - Forks: 430

risingwavelabs/risingwave

Stream processing and management platform.

Language: Rust - Size: 158 MB - Last synced at: about 3 hours ago - Pushed at: about 4 hours ago - Stars: 7,667 - Forks: 626

fluent/fluent-bit

Fast and Lightweight Logs, Metrics and Traces processor for Linux, BSD, OSX and Windows

Language: C - Size: 104 MB - Last synced at: 24 minutes ago - Pushed at: 25 minutes ago - Stars: 6,791 - Forks: 1,668

robinhood/faust

Python Stream Processing

Language: Python - Size: 8.3 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 6,783 - Forks: 535

hazelcast/hazelcast

Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

Language: Java - Size: 393 MB - Last synced at: about 4 hours ago - Pushed at: about 5 hours ago - Stars: 6,303 - Forks: 1,861

MaterializeInc/materialize

Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.

Language: Rust - Size: 256 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 5,954 - Forks: 470

apache/hudi

Upserts, Deletes And Incremental Processing on Big Data.

Language: Java - Size: 1.71 GB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 5,740 - Forks: 2,393

online-ml/river

🌊 Online machine learning in Python

Language: Python - Size: 315 MB - Last synced at: 26 minutes ago - Pushed at: about 2 months ago - Stars: 5,294 - Forks: 568

javascriptdata/danfojs

Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.

Language: TypeScript - Size: 79 MB - Last synced at: 4 days ago - Pushed at: 13 days ago - Stars: 4,898 - Forks: 214

infinyon/fluvio

Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.

Language: Rust - Size: 34.2 MB - Last synced at: about 4 hours ago - Pushed at: about 12 hours ago - Stars: 4,613 - Forks: 511

ag2ai/faststream

FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.

Language: Python - Size: 455 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 3,778 - Forks: 219

ArroyoSystems/arroyo

Distributed stream processing engine in Rust

Language: Rust - Size: 13.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 3,732 - Forks: 215

manuzhang/awesome-streaming

a curated list of awesome streaming frameworks, applications, etc

Size: 2.7 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 2,800 - Forks: 305

memgraph/memgraph

Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.

Language: C++ - Size: 42.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2,755 - Forks: 144

douban/dpark 📦

Python clone of Spark, a MapReduce alike framework in Python

Language: Python - Size: 2.65 MB - Last synced at: 3 days ago - Pushed at: over 4 years ago - Stars: 2,682 - Forks: 530

pipelinedb/pipelinedb

High-performance time-series aggregation for PostgreSQL

Language: C - Size: 46.7 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 2,643 - Forks: 243

PeerDB-io/peerdb

Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage

Language: Go - Size: 13.8 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 2,491 - Forks: 111

francoispqt/gojay

high performance JSON encoder/decoder with stream API for Golang

Language: Go - Size: 5.61 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 2,116 - Forks: 111

reugn/go-streams

A lightweight stream processing library for Go

Language: Go - Size: 575 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2,020 - Forks: 164

numaproj/numaflow

Kubernetes-native platform to run massively parallel data/streaming jobs

Language: Go - Size: 39.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,862 - Forks: 130

timeplus-io/proton

High-performance, low-footprint SQL database written in C++. Process millions of rows per second from Kafka/Pulsar, Iceberg, or ClickHouse, and seamlessly write results back. Supports powerful features like JOIN, CDC, UPSERT, and LOOKUP, enabling real-time analytics and ETL at scale.

Language: C++ - Size: 466 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,770 - Forks: 81

yomorun/yomo

🦖 Stateful Serverless Framework for Geo-distributed Edge AI Infra. with function calling support, write once, run on any model.

Language: Go - Size: 65.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,745 - Forks: 132

bytewax/bytewax

Python Stream Processing

Language: Python - Size: 12 MB - Last synced at: 2 days ago - Pushed at: 26 days ago - Stars: 1,711 - Forks: 78

nerevu/riko

A Python stream processing engine modeled after Yahoo! Pipes

Language: Python - Size: 2.58 MB - Last synced at: 8 days ago - Pushed at: over 3 years ago - Stars: 1,603 - Forks: 76

siddhi-io/siddhi

Stream Processing and Complex Event Processing Engine

Language: Java - Size: 33.6 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 1,541 - Forks: 530

WallarooLabs/wally

Distributed Stream Processing

Language: Pony - Size: 268 MB - Last synced at: 13 days ago - Pushed at: about 4 years ago - Stars: 1,479 - Forks: 68

quixio/quix-streams

Python Streaming DataFrames for Kafka

Language: Python - Size: 8.13 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,354 - Forks: 77

spring-cloud/spring-cloud-dataflow

A microservices-based Streaming and Batch data processing in Cloud Foundry and Kubernetes

Language: Java - Size: 68.3 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 1,131 - Forks: 588

hazelcast/hazelcast-jet 📦

Distributed Stream and Batch Processing

Language: Java - Size: 171 MB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 1,101 - Forks: 207

halaxa/json-machine

Efficient, easy-to-use, and fast PHP JSON stream parser

Language: PHP - Size: 1.48 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1,089 - Forks: 65

uhop/stream-json

The micro-library of Node.js stream components for creating custom JSON processing pipelines with a minimal memory footprint. It can parse JSON files far exceeding available memory streaming individual primitives using a SAX-inspired API.

Language: JavaScript - Size: 899 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1,041 - Forks: 47

clj-commons/manifold

A compatibility layer for event-driven abstractions

Language: Clojure - Size: 1.56 MB - Last synced at: about 9 hours ago - Pushed at: 11 months ago - Stars: 1,028 - Forks: 108

spring-cloud/spring-cloud-stream

Framework for building Event-Driven Microservices

Language: Java - Size: 18.1 MB - Last synced at: about 1 hour ago - Pushed at: about 2 hours ago - Stars: 1,019 - Forks: 621

edp963/wormhole

Wormhole is a SPaaS (Stream Processing as a Service) Platform

Language: JavaScript - Size: 45.5 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 976 - Forks: 482

ververica/flink-sql-cookbook

The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.

Language: Dockerfile - Size: 14.1 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 879 - Forks: 202

nodefluent/kafka-streams

equivalent to kafka-streams :octopus: for nodejs :sparkles::turtle::rocket::sparkles:

Language: TypeScript - Size: 4.14 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 821 - Forks: 111

vladimirvivien/automi

A stream processing API for Go (alpha)

Language: Go - Size: 2.27 MB - Last synced at: 12 days ago - Pushed at: 24 days ago - Stars: 789 - Forks: 62

gearpump/gearpump

Lightweight real-time big data streaming engine over Akka

Language: Scala - Size: 26.2 MB - Last synced at: 3 days ago - Pushed at: about 3 years ago - Stars: 762 - Forks: 152

pipeless-ai/pipeless

An open-source computer vision framework to build and deploy apps in minutes

Language: Rust - Size: 142 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 749 - Forks: 38

gazette/core

Build platforms that flexibly mix SQL, batch, and stream processing paradigms

Language: Go - Size: 4.81 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 746 - Forks: 53

lw-lin/streaming-readings

Streaming System 相关的论文读物

Size: 6.84 KB - Last synced at: 18 days ago - Pushed at: about 3 years ago - Stars: 731 - Forks: 155

hstreamdb/hstream

HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.

Language: Haskell - Size: 6.28 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 722 - Forks: 55

TouK/nussknacker

Low-code tool for automating actions on real time data | Stream processing for the users.

Language: Scala - Size: 179 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 683 - Forks: 95

apache/streampipes

Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.

Language: Java - Size: 310 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 639 - Forks: 195

infoslack/awesome-kafka

A list about Apache Kafka

Size: 96.7 KB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 580 - Forks: 163

microsoft/psi

Platform for Situated Intelligence

Language: C# - Size: 37.2 MB - Last synced at: 9 days ago - Pushed at: 29 days ago - Stars: 552 - Forks: 101

leesf/hudi-resources

汇总Apache Hudi相关资料

Size: 23.7 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 550 - Forks: 160

deep5050/radio-active

Play any radios around the globe right from the terminal :zap:

Language: Python - Size: 553 KB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 526 - Forks: 25

constellation-rs/amadeus

Harmonious distributed data analysis in Rust.

Language: Rust - Size: 2.46 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 474 - Forks: 26

nmaquet/kasper

Kasper is a lightweight library for processing Kafka topics.

Language: Go - Size: 288 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 439 - Forks: 24

smooks/smooks

An extensible Java framework for building event-driven applications that break up XML and non-XML data into chunks for data integration

Language: Java - Size: 29.2 MB - Last synced at: 10 days ago - Pushed at: 15 days ago - Stars: 400 - Forks: 360

RedisGears/RedisGears

Dynamic execution framework for your Redis data

Language: Rust - Size: 4.76 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 373 - Forks: 66

AICoE/log-anomaly-detector 📦

Log Anomaly Detection - Machine learning to detect abnormal events logs

Language: Jupyter Notebook - Size: 18.7 MB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 328 - Forks: 132

AxisCommunications/media-stream-library-js

JavaScript library to handle media streams on the command line (Node.js) and in the browser.

Language: TypeScript - Size: 872 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 311 - Forks: 104

vectorengine/vectorsql 📦

VectorSQL is a free analytics DBMS for IoT & Big Data, compatible with ClickHouse.

Language: Go - Size: 1.36 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 292 - Forks: 53

Chulong-Li/Real-time-Sentiment-Tracking-on-Twitter-for-Brand-Improvement-and-Trend-Recognition

A real-time interactive web app based on data pipelines using streaming Twitter data, automated sentiment analysis, and MySQL&PostgreSQL database (Deployed on Heroku)

Language: Jupyter Notebook - Size: 5.78 MB - Last synced at: 6 months ago - Pushed at: almost 5 years ago - Stars: 283 - Forks: 127

memgraph/mage

MAGE - Memgraph Advanced Graph Extensions :crystal_ball:

Language: C++ - Size: 21.7 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 279 - Forks: 29

streamnative/pulsar-flink 📦

Elastic data processing with Apache Pulsar and Apache Flink

Language: Java - Size: 2.16 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 279 - Forks: 119

raystack/dagger

Dagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.

Language: Java - Size: 12 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 273 - Forks: 41

Point72/csp

csp is a high performance reactive stream processing library, written in C++ and Python

Language: Python - Size: 4.72 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 270 - Forks: 45

FutureSDR/FutureSDR

An Async SDR Runtime for Heterogeneous Architectures

Language: Rust - Size: 25.1 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 262 - Forks: 46

bbejeck/kafka-streams-in-action

Source code for the Kafka Streams in Action Book

Language: Java - Size: 176 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 251 - Forks: 177

cross-platform/dspatch

The Refreshingly Simple Cross-Platform C++ Dataflow / Patching / Pipelining / Graph Processing / Stream Processing / Reactive Programming Framework

Language: C++ - Size: 6.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 236 - Forks: 43

Kurento/kurento

Kurento WebRTC Media Server

Language: C - Size: 131 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 222 - Forks: 49

talariadb/talaria

TalariaDB is a distributed, highly available, and low latency time-series database for Presto

Language: Go - Size: 12.8 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 219 - Forks: 31

streamnative/function-mesh

The serverless framework purpose-built for event streaming applications.

Language: Go - Size: 55.6 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 215 - Forks: 27

schollz/pluck

Pluck text in a fast and intuitive way :rooster:

Language: Go - Size: 6.55 MB - Last synced at: 2 days ago - Pushed at: over 5 years ago - Stars: 215 - Forks: 6

TomasMikula/libretto

Declarative concurrency and stream processing library for Scala

Language: Scala - Size: 15.9 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 210 - Forks: 7

logrange/logrange

High performance data aggregating storage

Language: Go - Size: 69.2 MB - Last synced at: 30 days ago - Pushed at: about 1 year ago - Stars: 191 - Forks: 16

confluentinc/ksql

The database purpose-built for stream processing applications.

Language: Java - Size: 205 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 186 - Forks: 1,042

bytewatch/dolphinbeat

A server that pulls and parses MySQL binlog, pushs change data into different sinks like Kafka.

Language: Go - Size: 2.97 MB - Last synced at: 5 months ago - Pushed at: about 6 years ago - Stars: 176 - Forks: 30

pemrouz/fero

light, fast, scalable, streaming microservices made easy

Language: JavaScript - Size: 1.05 MB - Last synced at: 5 months ago - Pushed at: almost 7 years ago - Stars: 174 - Forks: 10

neo4j-contrib/neo4j-streams

Neo4j Kafka Connector

Language: Kotlin - Size: 3.74 MB - Last synced at: 10 days ago - Pushed at: 12 days ago - Stars: 173 - Forks: 72

akka/akka-stream-contrib

Add-ons to Akka Stream

Language: Scala - Size: 315 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 172 - Forks: 77

Claviz/xlstream

Turns XLSX into a readable stream.

Language: TypeScript - Size: 709 KB - Last synced at: 18 days ago - Pushed at: 9 months ago - Stars: 171 - Forks: 17

miroslavpejic85/mirotalkbro

📡 MiroTalk WebRTC Live Broadcast allows to broadcast live video, audio and screen stream to all connected users (viewers).

Language: JavaScript - Size: 1.37 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 169 - Forks: 43

vsouza/go-kafka-example

Golang Kafka consumer and producer example

Language: Go - Size: 505 KB - Last synced at: about 11 hours ago - Pushed at: over 1 year ago - Stars: 169 - Forks: 59

LearningJournal/Kafka-Streams-Real-time-Stream-Processing

This is the central repository for all materials related to Kafka Streams : Real-time Stream Processing! Book by Prashant Pandey.

Language: Java - Size: 5.25 MB - Last synced at: 9 days ago - Pushed at: over 4 years ago - Stars: 164 - Forks: 163

GridProtectionAlliance/gsf

Grid Solutions Framework

Language: C# - Size: 248 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 163 - Forks: 70

mariomac/gostream

A Go port of the Java Streams API. Type-safe and functional Go Streams processing for Go 1.18+.

Language: Go - Size: 211 KB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 161 - Forks: 9

whitaker-io/machine

Machine is a workflow/pipeline library for processing data

Language: Go - Size: 1.39 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 157 - Forks: 12

synacker/daggy

Daggy - Data Aggregation Utility and C/C++ developer library for data streams catching

Language: C++ - Size: 7.28 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 154 - Forks: 15

Jet-Engine/callysto

Stream processing & Service framework.

Language: Rust - Size: 3.67 MB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 147 - Forks: 8

ElAlev/Wayeb

Wayeb is a Complex Event Processing and Forecasting (CEP/F) engine written in Scala.

Language: Scala - Size: 8.17 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 147 - Forks: 6

samsara/samsara

Samsara is a real-time analytics platform

Language: Clojure - Size: 22.8 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 147 - Forks: 17

zakariamaaraki/RemoteCodeCompiler

An online code compiler supporting 11 programming languages (Java, Kotlin, Scala, C, C++, C#, Golang, Python, Ruby, Rust and Haskell) for competitive programming and coding interviews.

Language: Java - Size: 3.34 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 135 - Forks: 47

GridProtectionAlliance/openPDC

Open Source Phasor Data Concentrator

Language: C# - Size: 2.47 GB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 134 - Forks: 56

mjwestcott/runnel

Distributed event processing for Python based on Redis Streams

Language: Python - Size: 6.58 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 133 - Forks: 11

runreveal/kawa

A blazingly fast event stream processing library powering the reveald event processing daemon.

Language: Go - Size: 1010 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 129 - Forks: 3

devshawn/kafka-shell

⚡A supercharged, interactive Kafka shell built on top of the existing Kafka CLI tools.

Language: Python - Size: 82 KB - Last synced at: 11 days ago - Pushed at: about 5 years ago - Stars: 126 - Forks: 11

ExpediaGroup/stream-registry

Stream Discovery and Stream Orchestration

Language: Java - Size: 5.52 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 122 - Forks: 54

xerial/streamdb-readings

Readings in Stream Processing

Size: 265 KB - Last synced at: 10 days ago - Pushed at: 5 months ago - Stars: 122 - Forks: 13