An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: streaming-data

kafbat/kafka-ui

Open-Source Web UI for managing Apache Kafka clusters

Language: Java - Size: 34.9 MB - Last synced at: about 22 hours ago - Pushed at: 5 days ago - Stars: 1,181 - Forks: 148

GridProtectionAlliance/openPDC

Open Source Phasor Data Concentrator

Language: C# - Size: 2.48 GB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 138 - Forks: 57

johnkerl/miller

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

Language: Go - Size: 201 MB - Last synced at: about 10 hours ago - Pushed at: 9 days ago - Stars: 9,329 - Forks: 224

MinaEssam16/stream-to-river

Streams to River is a microservice system designed for effective English learning, utilizing the Hertz and Kitex frameworks. This project integrates features like real-time chat and speech recognition to enhance the learning experience. 🛠️🌊

Language: Go - Size: 5.93 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

readysettech/readyset

Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the results of cached select statements and incrementally updates these results over time as the underlying data changes.

Language: Rust - Size: 229 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4,998 - Forks: 143

GridProtectionAlliance/gsf

Grid Solutions Framework

Language: C# - Size: 249 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 165 - Forks: 70

goodboy/tractor

A distributed, structured concurrency runtime for Python (and friends)

Language: Python - Size: 2.99 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 287 - Forks: 11

bytewax/bytewax

Python Stream Processing

Language: Python - Size: 12 MB - Last synced at: about 1 hour ago - Pushed at: 3 months ago - Stars: 1,765 - Forks: 82

jsa-aerial/aerobio

Extensible full DAG streaming computation server with services and jobs for RNA-Seq, Tn-Seq, WG-Seq and Term-Seq.

Language: Clojure - Size: 1.04 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 11 - Forks: 1

certeu/morio

Connect - Stream - Observe - Respond | Morio provides the plumbing for your observability needs

Language: JavaScript - Size: 27.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 27 - Forks: 3

memgraph/memgraph

Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.

Language: C++ - Size: 43.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,999 - Forks: 164

luisKING2008/Stream-Omni

Stream-Omni enables seamless interactions across text, vision, and speech using a large language model. This repository includes the model, datasets, and tools for developers to explore multimodal capabilities. 🌟🌐

Language: Python - Size: 10.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

SamarthKulkarni20/keyed-batched-items-accumulator

A lightweight utility for Node.js projects that accumulates items into fixed-size batches per key, preserving insertion order within each key. Streams items directly into their respective batches at runtime, eliminating the overhead of post-processing 1D arrays into chunks. It abstracts key-based batch management.

Language: TypeScript - Size: 52.7 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

redpanda-data/docs

Open source content for the Redpanda documentation

Language: Gherkin - Size: 28 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 8 - Forks: 45

redpanda-data/cloud-docs

Redpanda Cloud documentation

Language: JavaScript - Size: 8.47 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 4

MaterializeInc/materialize

Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.

Language: Rust - Size: 260 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6,024 - Forks: 472

Correia-jpv/fucking-awesome-bigdata

A curated list of awesome big data frameworks, resources and other awesomeness. With repository stars⭐ and forks🍴

Size: 655 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11 - Forks: 1

infoslack/awesome-kafka

A list about Apache Kafka

Size: 96.7 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 579 - Forks: 164

Sinotrade/Shioaji

Shioaji all new cross platform api for trading ( 跨平台證券交易API )

Language: Dockerfile - Size: 67.4 KB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 189 - Forks: 15

aramisfacchinetti/streaming-json-parser

Streaming JSON parser designed to process JSON data incrementally. The primary goal is to handle potentially incomplete JSON data streams, such as those produced by Large Language Models (LLMs), and return the current state of the parsed object at any time.

Language: Python - Size: 27.3 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 11 - Forks: 2

infinyon/fluvio

🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.

Language: Rust - Size: 34.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 4,937 - Forks: 517

oxnr/awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

Size: 843 KB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 13,660 - Forks: 2,574

symongpt/twitter-analysis-app

This repository hosts a web application that analyzes tweets to recommend movies and songs based on public sentiment. Users can easily access real-time insights without scrolling through endless Twitter feeds. 🐙🌟

Language: Python - Size: 276 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

ParaGroup/WindFlow

A C++17 Data Stream Processing Parallel Library for Multicores and GPUs

Language: C++ - Size: 48.9 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 84 - Forks: 19

piskvorky/smart_open

Utils for streaming large files (S3, HDFS, gzip, bz2...)

Language: Python - Size: 1.58 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 3,325 - Forks: 384

redpanda-data/connect

Fancy stream processing made operationally mundane

Language: Go - Size: 35 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 8,378 - Forks: 877

karafka/karafka-web

Web UI for monitoring and managing Karafka consumers

Language: Ruby - Size: 7.47 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 55 - Forks: 7

DoneDeal0/superdiff

Superdiff provides a complete and readable diff for both arrays and objects. Plus, it supports stream and file inputs for handling large datasets efficiently, is battle-tested, has zero dependencies, and is super fast.

Language: TypeScript - Size: 471 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 918 - Forks: 8

gyselle-marques/ScreenMatch-CommandLineRunner

Streaming de séries de TV por linha de comando.

Language: Java - Size: 18.6 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

python-streamz/streamz

Real-time stream processing for python

Language: Python - Size: 2.86 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 1,266 - Forks: 150

quixio/quix-streams

Python Streaming DataFrames for Kafka

Language: Python - Size: 8.51 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,388 - Forks: 80

joshday/OnlineStats.jl

⚡ Single-pass algorithms for statistics

Language: Julia - Size: 91.7 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 867 - Forks: 64

kLabUM/rrcf

🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams

Language: Python - Size: 4.45 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 509 - Forks: 113

AndGeo69/StreamingCotiles

A streaming implementation of COTILES algorithm using Apache Spark's Structured Streaming API

Language: Python - Size: 2.74 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

maraisr/meros

🪢 A fast utility that makes reading multipart responses simple

Language: TypeScript - Size: 590 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 186 - Forks: 11

online-ml/river

🌊 Online machine learning in Python

Language: Python - Size: 317 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5,373 - Forks: 578

tealtools/awesome-apache-pulsar

A curated list of resources about Apache Pulsar.

Size: 367 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 30 - Forks: 3

scikit-multiflow/scikit-multiflow

A machine learning package for streaming data in Python. The other ancestor of River.

Language: Python - Size: 60.9 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 778 - Forks: 188

Seddryck/Streamistry

Streamistry is a lightweight library designed to support pipeline, streaming, and ETL development for data engineering and integration. Its versatility makes it an excellent tool for building robust, scalable data workflows and optimizing data processing tasks. With features such as accumulators, windows, and sinks, it efficiently handles streaming

Language: C# - Size: 2.24 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 0

zpl-c/zpl

📐 Pushing the boundaries of simplicity

Language: C - Size: 3.99 MB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 1,026 - Forks: 48

ekrich/exip

Efficient XML Interchange (EXI) Embeddable C API

Language: C - Size: 5.4 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 7 - Forks: 0

saidsef/aws-kinesis-local

AWS Kinesis local for building applications with streaming data

Language: Dockerfile - Size: 120 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 21 - Forks: 4

selimfirat/pysad

Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)

Language: Python - Size: 438 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 262 - Forks: 25

maki-nage/rxsci

ReactiveX for data science

Language: Python - Size: 409 KB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 14 - Forks: 2

NathanP23/Big-Data-Mining-52002

Midterm and Final assignments of the course "Big Data Mining (52002)" at The Hebrew University of Jerusalem, in the Department of Statistics and Data Science. Focuses on analyzing massive datasets using Python, SQL, cloud computing, and network analysis. Includes project guidelines for scalable data mining techniques and distributed computing.

Language: Jupyter Notebook - Size: 21 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

ori88c/keyed-batched-items-accumulator

A lightweight utility for Node.js projects that accumulates items into fixed-size batches per key, preserving insertion order within each key. Streams items directly into their respective batches at runtime, eliminating the overhead of post-processing 1D arrays into chunks. It abstracts key-based batch management.

Language: TypeScript - Size: 93.8 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

ori88c/batched-items-accumulator

A lightweight utility for Node.js projects that accumulates items into fixed-size batches (number-of-items wise), preserving insertion order. It abstracts batch management, allowing users to focus on application logic. Ideal for delayed processing tasks such as bulk write/publish operations to kafka, databases, blob storage, etc.

Language: TypeScript - Size: 50.8 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

guillermo-navas-palencia/optbinning

Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.

Language: Python - Size: 10.4 MB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 476 - Forks: 108

microsoft/Trill

Trill is a single-node query processor for temporal or streaming data.

Language: C# - Size: 10.2 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 1,249 - Forks: 131

davidgao7/research_assistant

your personal research assistant!

Language: Python - Size: 1.05 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

HimanshuMohanty-Git24/StreamLineIRCTC

A real-time data pipeline simulating IRCTC bookings using GCP. It streams mock data via Pub/Sub, transforms it with Dataflow (Python UDF), stores results in BigQuery, and powers live dashboards. Includes error handling and schema validation.

Language: Python - Size: 10.7 KB - Last synced at: 15 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

streamdal/streamdal

Code-Native Data Privacy

Language: TypeScript - Size: 292 MB - Last synced at: 25 days ago - Pushed at: 7 months ago - Stars: 602 - Forks: 15

tinybirdco/mockingbird

Mockingbird is a mock streaming data generator

Language: TypeScript - Size: 2.57 MB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 120 - Forks: 18

systemaccounting/mxfactorial

a payment application intended for deployment by the united states treasury that replaces banking with accounting

Language: Rust - Size: 6.83 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 53 - Forks: 26

pathwaycom/pathway-benchmarks

Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams

Language: Python - Size: 4.7 MB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 70 - Forks: 4

slimandslam/schwab-client-js

A modern wrapper around the Schwab financial API for NodeJS Typescript and Javascript projects. Join our Discord: https://discord.gg/Q9z8EnB8xD

Language: TypeScript - Size: 2.59 MB - Last synced at: 13 days ago - Pushed at: 5 months ago - Stars: 10 - Forks: 7

quantfinlib/screamer

Screamingly fast streaming indicators with C++ performance and Python simplicity.

Language: C++ - Size: 22.4 MB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 1

silverton-io/buz

Serverless multi-protocol + multi-destination event collection system.

Language: Go - Size: 30.1 MB - Last synced at: 29 days ago - Pushed at: 7 months ago - Stars: 205 - Forks: 26

Lucchh/eilof

Efficient Incremental Local Outlier Factor (EILOF) for Online Anomaly Detection

Language: Jupyter Notebook - Size: 8.12 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

stensoosaar/IBKit

Interactive Brokers TWS/Gateway API in Swift

Language: Swift - Size: 1.62 MB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 20 - Forks: 9

swimos/swim

Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs

Language: Java - Size: 22.8 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 493 - Forks: 41

provectus/kafka-ui

Open-Source Web UI for Apache Kafka Management

Language: Java - Size: 29 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 10,773 - Forks: 1,272

EgorSborschikov/conferences_api

Video conferencing API with using REST & WebSocket

Language: Python - Size: 254 KB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

reugn/go-streams

A lightweight stream processing library for Go

Language: Go - Size: 598 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2,036 - Forks: 164

joshday/OnlineStatsBase.jl

Base types for OnlineStats.

Language: Julia - Size: 310 KB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 32 - Forks: 14

ylem-co/ylem

Ylem is an open-source platform for real-time data streaming orchestration

Language: JavaScript - Size: 5.87 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 71 - Forks: 0

Stratio/sparta

Real Time Analytics and Data Pipelines based on Spark Streaming

Language: Scala - Size: 123 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 526 - Forks: 196

hstreamdb/hstream

HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.

Language: Haskell - Size: 6.28 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 722 - Forks: 55

thammo4/uvatradier

Python wrapper for the Tradier brokerage API

Language: Python - Size: 827 KB - Last synced at: 24 days ago - Pushed at: 2 months ago - Stars: 23 - Forks: 14

pravega/pravega

Pravega - Streaming as a new software defined storage primitive

Language: Java - Size: 47 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 1,998 - Forks: 409

whitaker-io/machine

Machine is a workflow/pipeline library for processing data

Language: Go - Size: 1.41 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 162 - Forks: 12

igopalakrishna/nyc-subway-foot-traffic-prediction-and-forecasting

Designed and implemented a scalable real-time analytics pipeline using Apache Kafka, Spark Structured Streaming, and MongoDB to simulate NYC MTA turnstile data and forecast real-time subway foot traffic using SparkML Random Forest models.

Language: Jupyter Notebook - Size: 1.27 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

Menziess/slipstream-async

Slipstream provides a data-flow model to simplify development of stateful streaming applications.

Language: Python - Size: 628 KB - Last synced at: 18 days ago - Pushed at: 2 months ago - Stars: 36 - Forks: 0

SaiRanjithReddyK/aws-retail-realtime-analytics

A real-time data pipeline built with AWS Kinesis, Lambda, S3, Athena, and QuickSight, simulating and analyzing 70+ retail transactions. This project demonstrates how to stream, store, query, and visualize transactional data using fully managed AWS services, along with Python for simulation and Lambda logic, and SQL for analytics in Athena.

Language: Python - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

zelros/cinnamon 📦

CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system

Language: Python - Size: 2.02 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 77 - Forks: 7

icicle-lang/icicle

Icicle Streaming Query Language

Language: Haskell - Size: 14.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 27 - Forks: 3

exajobs/data-engineering-collection

A collection of awesome software, libraries, Learning Tutorials, documents, books, resources and interesting stuff about Big Data Science & Engineering

Size: 241 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 1

7wilightxdev/market-stream

My personal playground in Flutter, MarketStream is a real-time market data visualization app, this project delivers a minimalist, high-performance UI for tracking market prices and custom candlestick chart. Every pixel, every interaction had to be manually calculated and optimized.

Language: Dart - Size: 974 KB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 6 - Forks: 2

sonhmai/data-systems-design

System Design, Solution Architecture, Data Systems Practice

Size: 24.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 45 - Forks: 11

maki-nage/makinage

Stream Processing Made Easy

Language: Python - Size: 227 KB - Last synced at: 10 days ago - Pushed at: about 3 years ago - Stars: 41 - Forks: 1

wso2/product-streaming-integrator

A stream processing runtime that allows connecting any streaming data source to any destination and act on it

Language: Python - Size: 152 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 113 - Forks: 52

embetrix/bmap-writer

bmaptool alternative written in C++

Language: C++ - Size: 216 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 12 - Forks: 3

microsoft/data-accelerator

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

Language: C# - Size: 401 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 302 - Forks: 90

iresil/FowlFlightForensics

A Kafka-based CSV parser for bird-related airplane accidents

Language: Java - Size: 5.23 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

MikeJaredS/hermiter

Efficient Sequential and Batch Estimation of Univariate and Bivariate Probability Density Functions and Cumulative Distribution Functions along with Quantiles (Univariate) and Nonparametric Correlation (Bivariate)

Language: R - Size: 8.82 MB - Last synced at: 11 minutes ago - Pushed at: 10 months ago - Stars: 16 - Forks: 3

SAFZZ/real-time-product-activity-pyspark

Real-time Product Activity Tracker using Apache Kafka, PySpark, and PostgreSQL | Built for high-throughput streaming analytics

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

bakansm/ViTHSD

Vietnamese Hate Speech Detection with real-time data from streaming platform such as Youtube, Facebook and Tiktok.

Language: Jupyter Notebook - Size: 946 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 2

microsoft/FabricRTIWorkshop

How to build a Medallion design pattern using Fabric Real-Time Intelligence

Language: Jupyter Notebook - Size: 22.8 MB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 12 - Forks: 14

redislabs-training/ru202

This is the repository for the Redis Streams learning path on Redis University.

Language: HTML - Size: 1000 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 37 - Forks: 31

keithknott26/datadash

Visualize and graph data in the terminal

Language: Go - Size: 96.8 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 281 - Forks: 14

fajarnugraha37/turborepo-nestjs

Fullstack multiple service application using turborepo typescript, nestjs, nextjs, prisma, mongodb and rabbitmq.

Language: TypeScript - Size: 579 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 4

evadne/packmatic

Zipping on the fly — Generate downloadable Zip streams by aggregating File or URL Sources

Language: Elixir - Size: 154 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 160 - Forks: 15

grahman20/ADF

Adaptive Decision Forest(ADF) is an incremental machine learning framework called to produce a decision forest to classify new records. ADF is capable to classify new records even if they are associated with previously unseen classes. ADF also is capable of identifying and handling concept drift; it, however, does not forget previously gained knowledge. Moreover, ADF is capable of handling big data if the data can be divided into batches.

Language: Java - Size: 1.63 MB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

vertica/PSTL

Parallel Streaming Transformation Loader

Language: Java - Size: 106 MB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 9 - Forks: 6

ParaGroup/StreamBenchmarks

Suite of Benchmark Applications for Stream Processing Systems

Language: C++ - Size: 44.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 13 - Forks: 4

Nasruddin/realtime-streaming-kafka-flink-pinot-postgres-superset

Setup for realtime data streaming using Kafka, Flink, Pinot, MySQL, Postgres and Superset

Language: Python - Size: 4.67 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

anhtt2211/social-api

Language: TypeScript - Size: 46.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

antonmry/galiglobal

Blog about Apache Kafka, Apache Flink and Java

Language: HTML - Size: 48.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

seznam/euphoria

Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model which can express both batch and stream transformations.

Language: Java - Size: 3.9 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 81 - Forks: 11

AubainMbk/RealTime-BikeStation-Tracker

Ce projet est un moyen de démontrer mes capacités de collecte de données grâce aux API. L'objectif est de créer une appli web qui permet d'accéder aux infos sur les stations de vélos en temps réel.

Language: Jupyter Notebook - Size: 1.45 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0