Topic: "bigquery"
hasura/graphql-engine
Blazing fast, instant realtime GraphQL APIs on all your data with fine grained access control, also trigger webhooks on database events.
Language: TypeScript - Size: 4.81 GB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 31,485 - Forks: 2,801

getredash/redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Language: Python - Size: 26.4 MB - Last synced at: 3 days ago - Pushed at: 10 days ago - Stars: 27,229 - Forks: 4,461

beekeeper-studio/beekeeper-studio
Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.
Language: TypeScript - Size: 90.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 18,531 - Forks: 1,199

cube-js/cube
📊 Cube’s universal semantic layer platform is the next evolution of OLAP technology for AI, BI, spreadsheets, and embedded analytics
Language: Rust - Size: 341 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 18,451 - Forks: 1,830

airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Language: Python - Size: 660 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 17,937 - Forks: 4,468

apache/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
Language: Java - Size: 1000 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 13,553 - Forks: 3,407

oceanbase/oceanbase
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
Language: C++ - Size: 646 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 9,061 - Forks: 1,722

Canner/WrenAI
🤖 Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, dashboards and BI. 📈📊📋🧑💻
Language: TypeScript - Size: 19.3 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 7,588 - Forks: 721

tobymao/sqlglot
Python SQL Parser and Transpiler
Language: Python - Size: 489 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7,526 - Forks: 830

growthbook/growthbook
Open Source Feature Flagging and A/B Testing Platform
Language: TypeScript - Size: 108 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6,509 - Forks: 550

cloudquery/cloudquery
The developer first cloud governance platform
Language: Go - Size: 171 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 6,080 - Forks: 527

ibis-project/ibis
the portable Python dataframe library
Language: Python - Size: 173 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 5,703 - Forks: 633

jitsucom/jitsu
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
Language: TypeScript - Size: 42.2 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 4,279 - Forks: 312

rudderlabs/rudder-server
Privacy and Security focused Segment-alternative, in Golang and React
Language: Go - Size: 308 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 4,172 - Forks: 329

HVF/franchise
🍟 a notebook sql client. what you get when have a lot of sequels.
Language: JavaScript - Size: 1.96 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 4,011 - Forks: 262

briefercloud/briefer
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Language: TypeScript - Size: 65.1 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 3,979 - Forks: 248

k1LoW/tbls
tbls is a CI-Friendly tool to document a database, written in Go.
Language: Go - Size: 125 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 3,751 - Forks: 174

blockchain-etl/ethereum-etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Language: Python - Size: 1.81 MB - Last synced at: 2 days ago - Pushed at: 21 days ago - Stars: 3,010 - Forks: 877

bruin-data/ingestr
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Language: Python - Size: 166 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2,942 - Forks: 78

GoogleCloudPlatform/professional-services
Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially supported Google product.
Language: Python - Size: 390 MB - Last synced at: about 6 hours ago - Pushed at: 1 day ago - Stars: 2,890 - Forks: 1,359

swirlai/swirl-search
AI Search & RAG Without Moving Your Data. Get instant answers from your company's knowledge across 100+ apps while keeping data secure. Deploy in minutes, not months.
Language: Python - Size: 214 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2,745 - Forks: 252

spotify/scio
A Scala API for Apache Beam and Google Cloud Dataflow.
Language: Scala - Size: 77.8 MB - Last synced at: 2 days ago - Pushed at: 8 days ago - Stars: 2,589 - Forks: 517

PeerDB-io/peerdb
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
Language: Go - Size: 13.7 MB - Last synced at: about 9 hours ago - Pushed at: about 9 hours ago - Stars: 2,501 - Forks: 112

elementary-data/elementary
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Language: HTML - Size: 205 MB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 2,050 - Forks: 183

EvgSkv/logica
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
Language: Jupyter Notebook - Size: 6.56 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,960 - Forks: 102

Multiwoven/multiwoven
🔥🔥🔥 Open source composable CDP - alternative to hightouch and census.
Language: Ruby - Size: 6.17 MB - Last synced at: 37 minutes ago - Pushed at: about 8 hours ago - Stars: 1,601 - Forks: 71

GoogleCloudPlatform/bigquery-utils
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Language: Jupyter Notebook - Size: 28.8 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 1,202 - Forks: 303

GoogleCloudPlatform/DataflowTemplates
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
Language: Java - Size: 23.9 MB - Last synced at: about 14 hours ago - Pushed at: about 14 hours ago - Stars: 1,198 - Forks: 1,003

scratchdata/scratchdata
Scratch is a swiss army knife for big data.
Language: Go - Size: 13.6 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 1,113 - Forks: 54

madnight/githut
Github Language Statistics
Language: JavaScript - Size: 38.4 MB - Last synced at: 17 days ago - Pushed at: about 1 year ago - Stars: 991 - Forks: 130

goccy/bigquery-emulator
BigQuery emulator server implemented in Go
Language: Go - Size: 413 KB - Last synced at: 14 days ago - Pushed at: 5 months ago - Stars: 921 - Forks: 129

bruin-data/bruin
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Language: Go - Size: 68.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 917 - Forks: 34

quarylabs/sqruff
Fast SQL formatter/linter
Language: Rust - Size: 7.37 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 785 - Forks: 30

raystack/optimus
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Language: Go - Size: 12.2 MB - Last synced at: 17 days ago - Pushed at: 11 months ago - Stars: 748 - Forks: 155

unytics/bigfunctions
Supercharge BigQuery with BigFunctions
Language: Python - Size: 28.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 739 - Forks: 70

Canner/vulcan-sql
Data API Framework for AI Agents and Data Apps
Language: TypeScript - Size: 70.6 MB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 677 - Forks: 33

artie-labs/transfer
Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.
Language: Go - Size: 3.85 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 648 - Forks: 32

dbt-checkpoint/dbt-checkpoint
:fishing_pole_and_fish: List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
Language: Python - Size: 1.38 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 640 - Forks: 132

HTTPArchive/almanac.httparchive.org
HTTP Archive's annual "State of the Web" report made by the web community
Language: HTML - Size: 396 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 639 - Forks: 187

ploomber/jupysql Fork of catherinedevlin/ipython-sql
Better SQL in Jupyter. 📊
Language: Python - Size: 12.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 588 - Forks: 70

synmetrix/synmetrix
Synmetrix – production-ready open source semantic layer on Cube
Language: JavaScript - Size: 4.38 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 546 - Forks: 31

r-dbi/bigrquery
An interface to Google's BigQuery from R.
Language: R - Size: 7.63 MB - Last synced at: 15 days ago - Pushed at: 2 months ago - Stars: 521 - Forks: 185

googleapis/nodejs-bigquery
Node.js client for Google Cloud BigQuery: A fast, economical and fully-managed enterprise data warehouse for large-scale data analytics.
Language: TypeScript - Size: 8.02 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 482 - Forks: 214

googleapis/python-bigquery-pandas
Google BigQuery connector for pandas
Language: Python - Size: 1.58 MB - Last synced at: 7 days ago - Pushed at: 22 days ago - Stars: 466 - Forks: 125

tylertreat/BigQuery-Python
Simple Python client for interacting with Google BigQuery.
Language: Python - Size: 1.13 MB - Last synced at: 18 days ago - Pushed at: over 3 years ago - Stars: 457 - Forks: 176

HariSekhon/SQL-scripts
100+ SQL Scripts - PostgreSQL, MySQL, Oracle, Google BigQuery, MariaDB, AWS Athena. DBA, Analytics, DevOps, performance engineering. Google BigQuery ML machine learning classification.
Language: Shell - Size: 620 KB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 437 - Forks: 120

ofek/pypinfo
Easily view PyPI download statistics via Google's BigQuery.
Language: Python - Size: 199 KB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 430 - Forks: 33

basedosdados/sdk
⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.github.io/sdk/
Language: SQL - Size: 35.2 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 403 - Forks: 86

GoogleCloudDataproc/spark-bigquery-connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Language: Java - Size: 6.07 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 393 - Forks: 203

astronomer/astro-sdk
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Language: Python - Size: 7.54 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 369 - Forks: 48

tellery/tellery
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Language: TypeScript - Size: 6.18 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 355 - Forks: 26

spotify/ratatool
A tool for data sampling, data generation, and data diffing
Language: Scala - Size: 1.27 MB - Last synced at: 2 days ago - Pushed at: 21 days ago - Stars: 342 - Forks: 54

GoogleCloudPlatform/security-analytics
Community Security Analytics provides a set of community-driven audit & threat queries for Google Cloud
Language: Python - Size: 965 KB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 338 - Forks: 70

raystack/firehose
Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.
Language: Java - Size: 15.2 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 336 - Forks: 62

machine-learning-apps/Issue-Label-Bot 📦
Code For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"
Language: SCSS - Size: 10.3 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 326 - Forks: 84

mprove-io/mprove
Open Source Self-service Business Intelligence with Version Control :tada:
Language: TypeScript - Size: 24.9 MB - Last synced at: 19 days ago - Pushed at: about 2 years ago - Stars: 323 - Forks: 26

data-drift/data-drift
Metrics Observability & Troubleshooting
Language: HTML - Size: 11.7 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 320 - Forks: 11

scale8/scale8-tag-manager-and-analytics 📦
Website analytics, JavaScript error tracking + analytics, tag manager, data ingest endpoint creation (tracking pixels). GDPR + CCPA compliant.
Language: TypeScript - Size: 3.92 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 301 - Forks: 17

GoogleCloudDataproc/hadoop-connectors
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Language: Java - Size: 11.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 285 - Forks: 249

yoshidan/google-cloud-rust
Google Cloud Client Libraries for Rust.
Language: Rust - Size: 2.41 MB - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 276 - Forks: 112

wix-incubator/quix
Quix Notebook Manager
Language: TypeScript - Size: 11.4 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 273 - Forks: 36

datacoves/dbt-coves
CLI tool for dbt users to simplify creation of staging models (yml and sql) files
Language: Python - Size: 2.41 MB - Last synced at: 11 days ago - Pushed at: 18 days ago - Stars: 261 - Forks: 16

tuva-health/tuva
Main repo including core data model, data marts, data quality tests, and terminology sets.
Language: Shell - Size: 43.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 249 - Forks: 83

lynnlangit/gcp-essentials
Sample code and notes for my GCP courses on LinkedIn Learning
Language: Jupyter Notebook - Size: 245 MB - Last synced at: about 22 hours ago - Pushed at: 21 days ago - Stars: 248 - Forks: 175

GoogleCloudPlatform/data-analytics-golden-demo
An end to end demo of Google's Cloud data and analytic stack.
Language: Jupyter Notebook - Size: 11.2 MB - Last synced at: 6 days ago - Pushed at: 8 days ago - Stars: 243 - Forks: 78

bxparks/bigquery-schema-generator
Generates the BigQuery schema from newline-delimited JSON or CSV data records.
Language: Python - Size: 5.52 MB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 243 - Forks: 50

doitintl/bigquery-grafana 📦
Google BigQuery Datasource Plugin for Grafana. (NO LONGER MAINTAINED)
Language: TypeScript - Size: 280 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 242 - Forks: 78

googleapis/python-bigquery-dataframes
BigQuery DataFrames
Language: Python - Size: 17.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 239 - Forks: 48

cuebook/CueObserve
Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases
Language: Python - Size: 6.15 MB - Last synced at: 16 days ago - Pushed at: about 3 years ago - Stars: 229 - Forks: 24

digitalghost-dev/premier-league 📦
A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.
Language: Python - Size: 487 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 224 - Forks: 17

GoogleCloudPlatform/fraudfinder
Fraudfinder: A comprehensive lab series on how to build a real-time fraud detection system on Google Cloud
Language: Jupyter Notebook - Size: 13 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 220 - Forks: 84

thinkingmachines/geomancer 📦
Automated feature engineering for geospatial data
Language: Python - Size: 1.17 MB - Last synced at: 30 days ago - Pushed at: about 4 years ago - Stars: 216 - Forks: 16

xnuinside/simple-ddl-parser
Simple DDL Parser to parse SQL (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects) ddl files to json/python dict with full information about columns: types, defaults, primary keys, etc. & table properties, types, domains, etc.
Language: Python - Size: 2.04 MB - Last synced at: 19 days ago - Pushed at: 7 months ago - Stars: 195 - Forks: 42

lots-of-things/gpt2-bert-reddit-bot
a bot that generates realistic replies using a combination of pretrained GPT-2 and BERT models
Language: Jupyter Notebook - Size: 137 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 193 - Forks: 28

CartoDB/analytics-toolbox-core
A set of UDFs and Procedures to extend BigQuery, Snowflake, Redshift, Postgres and Databricks with Spatial Analytics capabilities
Language: JavaScript - Size: 12.6 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 192 - Forks: 44

omnata-labs/dbt-ml-preprocessing
A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
Language: Python - Size: 1.65 MB - Last synced at: 25 days ago - Pushed at: almost 2 years ago - Stars: 183 - Forks: 17

GoogleCloudPlatform/cortex-data-foundation
Data Foundation - Google Cloud Cortex Framework
Language: Python - Size: 34.8 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 182 - Forks: 103

mara/mara-example-project-2
An example mini data warehouse for python project stats, template for new projects
Language: Python - Size: 24 MB - Last synced at: 17 days ago - Pushed at: almost 5 years ago - Stars: 178 - Forks: 39

spotify/magnolify
A collection of Magnolia add-on modules
Language: Scala - Size: 5.33 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 173 - Forks: 26

unytics/airbyte_serverless
Airbyte made simple (no UI, no database, no cluster)
Language: Python - Size: 3.43 MB - Last synced at: about 18 hours ago - Pushed at: 15 days ago - Stars: 171 - Forks: 13

google/starthinker 📦
Reference framework for building data workflows provided by Google. Accelerates authentication, logging, scheduling, and deployment of solutions using GCP. To borrow a tagline.. "The framework for professionals with deadlines."
Language: Python - Size: 30.7 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 170 - Forks: 50

GoogleCloudPlatform/public-datasets-pipelines
Cloud-native, data onboarding architecture for Google Cloud Datasets
Language: Python - Size: 6.64 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 160 - Forks: 70

ScalefreeCOM/datavault4dbt
Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including the Staging Area, DV2.0 main entities, PITs and Snapshot Tables.
Language: PLSQL - Size: 31.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 152 - Forks: 29

google/vscode-bigquery 📦
A Visual Studio Code plugin for running BigQuery queries.
Language: TypeScript - Size: 207 KB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 149 - Forks: 22

sambacha/dune-snippets
dune snippets is a collection of sql queries for duneanalytics.com / Google BigQuery
Language: PLpgSQL - Size: 23.9 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 148 - Forks: 20

gojekfarm/beast 📦
[Deprecated] Load data from Kafka to any data warehouse. BQ sink is being supported in Firehose now. https://github.com/odpf/firehose
Language: Java - Size: 628 KB - Last synced at: 11 months ago - Pushed at: about 3 years ago - Stars: 147 - Forks: 23

GoogleCloudPlatform/bigquery-data-lineage 📦
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Language: Java - Size: 356 KB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 143 - Forks: 41

googlegenomics/gcp-variant-transforms
GCP Variant Transforms
Language: Python - Size: 20.5 MB - Last synced at: 25 days ago - Pushed at: about 3 years ago - Stars: 139 - Forks: 55

google/megalista 📦
First Party data integration solution built for marketing teams to enable audience and conversion onboarding into Google Marketing products (Google Ads, Campaign Manager, Google Analytics).
Language: Python - Size: 1.34 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 137 - Forks: 55

kikinteractive/go-bqstreamer 📦
Stream data into Google BigQuery concurrently using InsertAll()
Language: Go - Size: 97.7 KB - Last synced at: 11 months ago - Pushed at: over 7 years ago - Stars: 133 - Forks: 19

GoogleCloudPlatform/dataproc-templates
Dataproc templates and pipelines for solving in-cloud data tasks
Language: Python - Size: 18.6 MB - Last synced at: 6 days ago - Pushed at: 24 days ago - Stars: 127 - Forks: 97

embulk/embulk-output-bigquery
Embulk output plugin to load/insert data into Google BigQuery
Language: Ruby - Size: 518 KB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 126 - Forks: 60

GoogleCloudPlatform/bigquery-geo-viz
Visualize Google BigQuery geospatial data using Google Maps Platform APIs
Language: TypeScript - Size: 3.95 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 118 - Forks: 45

allegro/bigflow
A Python framework for data processing on GCP.
Language: Python - Size: 107 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 117 - Forks: 23

datainsider-co/rocket-bi
A free, open-source, web-based self-service BI tailor-made for clickhouse, google bigquery, mysql, postgresql, vertica
Language: TypeScript - Size: 69.5 MB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 107 - Forks: 30

cata-network/cata_database
CATA.Search. Blockchain database, cata metadata query
Language: Java - Size: 57.6 KB - Last synced at: 16 days ago - Pushed at: over 3 years ago - Stars: 107 - Forks: 0

blockchain-etl/polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Language: Python - Size: 1.46 MB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 100 - Forks: 76

mikeghen/airflow-tutorial
Use Airflow to move data from multiple MySQL databases to BigQuery
Language: PLpgSQL - Size: 879 KB - Last synced at: 18 days ago - Pushed at: almost 5 years ago - Stars: 100 - Forks: 21

autotraderuk/dbt-dry-run
Dry run capability for dbt projects using BigQuery
Language: Python - Size: 852 KB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 96 - Forks: 13

GoogleCloudPlatform/dlp-dataflow-deidentification
Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP
Language: Java - Size: 47.5 MB - Last synced at: 6 days ago - Pushed at: 9 months ago - Stars: 92 - Forks: 51
