An open API service providing repository metadata for many open source software ecosystems.

Topic: "google-cloud-dataflow"

GoogleCloudPlatform/professional-services

Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially supported Google product.

Language: Python - Size: 390 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 2,890 - Forks: 1,359

GoogleCloudPlatform/DataflowTemplates

Cloud Dataflow Google-provided templates for solving in-Cloud data tasks

Language: Java - Size: 23.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,198 - Forks: 1,003

GoogleCloudPlatform/DataflowJavaSDK 📦

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

Size: 12.9 MB - Last synced at: 10 days ago - Pushed at: over 4 years ago - Stars: 856 - Forks: 320

Fematich/mlengine-boilerplate

Repository to quickly get you started with new Machine Learning projects on Google Cloud Platform. More info(slides):

Language: Python - Size: 54.7 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 63 - Forks: 18

snowplow-archive/google-cloud-dataflow-example-project 📦

Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow

Language: Scala - Size: 481 KB - Last synced at: 9 days ago - Pushed at: about 8 years ago - Stars: 30 - Forks: 10

asaharland/beam-pipeline-examples

Apache Beam examples for running on Google Cloud Dataflow.

Language: Java - Size: 59.5 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 29 - Forks: 14

jeremylorino/gcp-dataprep-bigquery-twitter-stream

Stream Twitter Data into BigQuery with Cloud Dataprep

Language: JavaScript - Size: 1.53 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 7

RajeshHegde/apache-beam-example

Apache Beam example project

Language: Python - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 13 - Forks: 14

topgate/retail-demo

Google Cloud Dataflow Demo Application. デモ用アプリのため更新(依存関係の更新・脆弱性対応)は行っていません。参考にされる方はご注意ください。

Language: Java - Size: 548 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 12 - Forks: 2

google/exposure-notifications-private-analytics-ingestion

This repository contains implementation to process private data shares collected according to the Exposure Notification Private Analytics protocol. It assumes private data shares uploaded as done in the Exposure Notification Express template app. These documents contain encrypted packets using the Prio protocol. The pipeline implementation converts them into the format that downstream Prio data processing servers expect.

Language: Java - Size: 13.9 MB - Last synced at: 10 days ago - Pushed at: 7 months ago - Stars: 10 - Forks: 11

sanderploegsma/beam-scheduling-kubernetes

Scheduled Dataflow pipelines using Kubernetes Cronjobs

Language: Kotlin - Size: 59.6 KB - Last synced at: about 11 hours ago - Pushed at: about 7 years ago - Stars: 8 - Forks: 2

spotify/limbo 📦

Language: Scala - Size: 136 KB - Last synced at: 6 days ago - Pushed at: over 8 years ago - Stars: 6 - Forks: 2

JonnyDaenen/ZUNA

Cloud native system to decommission Google Cloud resources when they aren't needed anymore.

Language: Python - Size: 16.6 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 5 - Forks: 0

vicenteg/dataflow-example

Language: Java - Size: 250 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 6

sb2nov/beam Fork of apache/beam

Mirror of Apache Beam

Language: Java - Size: 40.9 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

mponce/google-cloud-dataflow-pipeline

Google Cloud DataFlow - Load CSV Files to BigQuery Tables

Language: Java - Size: 32.2 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

swapnil3597/dataflow-tfrecord

This repository is a reference to build Custom ETL Pipeline for creating TF-Records using Apache Beam Python SDK on Google Cloud Dataflow

Language: Python - Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

goatcheesesaladwithpeanutoildressing/beam-amazon-batch-example

A practical example of batch processing on Google Cloud Dataflow using the Go SDK for Apache Beam :fire:

Language: Go - Size: 455 KB - Last synced at: 5 days ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

ryanmcdowell/dataflow-pubsub-event-router

An example pipeline which re-publishes events to different topics based a message attribute.

Language: Java - Size: 11.7 KB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

viveknaskar/google-dataflow-redis-example Fork of arun-james/dataflow-example

Cloud dataflow pipeline code that processes data from a cloud storage bucket, transforms it and stores in Google's highly scalable, reduced latency in-memory database, memorystore which is an implementation of Redis.

Language: Java - Size: 80.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

GoogleCloudPlatform/dataflow-metrics-exporter

CLI tool to collect dataflow resource & execution metrics and export to either BigQuery or Google Cloud Storage. Tool will be useful to compare & visualize the metrics while benchmarking the dataflow pipelines using various data formats, resource configurations etc

Language: Java - Size: 65.4 KB - Last synced at: 10 days ago - Pushed at: 12 months ago - Stars: 2 - Forks: 3

viveknaskar/triggering-dataflow-pipeline-function

Google Cloud function to trigger cloud-dataflow pipeline when a file is uploaded into a cloud storage bucket

Language: JavaScript - Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

rm3l/apache-beam-java-firestore-batch-dataflow 📦

Companion Repo for blog post : https://rm3l.org/batch-writes-to-google-cloud-firestore-using-the-apache-beam-java-sdk-on-google-cloud-dataflow/

Language: Java - Size: 101 KB - Last synced at: 25 days ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

JonnyDaenen/dissi-bq

Distributed schema inference and data loader for BigQuery written in Apache Beam

Language: Python - Size: 88.9 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

ryanmcdowell/dataflow-bigquery-dynamic-destinations

An example pipeline for dynamically routing events from Pub/Sub to different BigQuery tables based on a message attribute.

Language: Java - Size: 14.6 KB - Last synced at: 24 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 5

sinmetal/pug2pug

Cloud Dataflowを使って、Cloud DatastoreのMigrationを行う

Language: Java - Size: 21.5 KB - Last synced at: 8 days ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

EmediongFrancis/Enhancing-Data-Quality-and-Consistency-GCP-Kafka-Airflow-Snowflake

This project focuses on maintaining data quality and consistency across different data sources. This project features Google Cloud Dataflow for data cataloging, Apache Airflow for ETL, Google Cloud Data Catalog for visual data preparation, and Snowflake for high-quality data storage and analysis.

Language: HCL - Size: 641 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

theterminalguy/beamer

Language: Go - Size: 23.4 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

CuriousDima/dataflowclass1 Fork of joegagliardo/dataflowclass1

Size: 5.51 MB - Last synced at: 28 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

goatcheesesaladwithpeanutoildressing/hands-on-apache-beam

Work In Progress - Une explication simple de qu'est-ce que c'est que le traitement par lots (batch) et le traitement par flux (stream) avec Apache Beam et Cloud Dataflow.

Size: 44.9 KB - Last synced at: 5 days ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

kurikei/gcp-playground

Language: Java - Size: 119 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0