GitHub topics: trino
zncdatadev/trino-operator
Operator for Trino, the distributed SQL query engine for big data
Language: Go - Size: 699 KB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 4 - Forks: 7

tobymao/sqlglot
Python SQL Parser and Transpiler
Language: Python - Size: 489 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 7,526 - Forks: 830

ibis-project/ibis
the portable Python dataframe library
Language: Python - Size: 173 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 5,698 - Forks: 631

snowlift/trino-storage
Storage connector for Trino
Language: Java - Size: 2.67 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 109 - Forks: 34

wvlet/wvlet
A flow-style query language for SQL engines
Language: Scala - Size: 17.4 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 131 - Forks: 8

DNAstack/data-connect-trino
Cloned from https://github.com/DNAstack/ga4gh-search-adapter-presto
Language: Java - Size: 3.24 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 3

ac-gomes/spark-iceberg-hive
Language: Jupyter Notebook - Size: 639 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Language: Java - Size: 270 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 11,137 - Forks: 3,170

kestra-io/plugin-jdbc
Language: Java - Size: 2.4 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 16 - Forks: 16

SonicEXEDVP/real-time-data-pipeline
📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.
Size: 1000 Bytes - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

tuannvm/mcp-trino
A high-performance Model Context Protocol (MCP) server for Trino implemented in Go.
Language: Go - Size: 112 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 7 - Forks: 0

davidkhala/database
the databases index
Language: PowerShell - Size: 98.7 MB - Last synced at: 1 day ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

datafold/data-diff 📦
Compare tables within or across databases
Language: Python - Size: 3.98 MB - Last synced at: about 17 hours ago - Pushed at: 11 months ago - Stars: 2,965 - Forks: 278

wgzhao/Addax
A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL seamlessly
Language: Java - Size: 43.9 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 1,257 - Forks: 309

trinodb/trino-js-client
TypeScript client library for Trino
Language: TypeScript - Size: 47 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 38 - Forks: 19

devlive-community/datacap
DataCap is integrated software for data transformation, integration, and visualization. Support a variety of data sources, file types, big data related database, relational database, NoSQL database, etc. Through the software can realize the management of multiple data sources, the data under the source of various operations conversion ...
Language: Java - Size: 114 MB - Last synced at: 7 days ago - Pushed at: 27 days ago - Stars: 933 - Forks: 102

regadas/trino-pubsub-event-listener
Trino Google Pub/Sub event listener
Language: Java - Size: 427 KB - Last synced at: 8 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 6

DTStack/dt-sql-parser
SQL Parsers for BigData, built with antlr4.
Language: TypeScript - Size: 45.8 MB - Last synced at: 7 days ago - Pushed at: 24 days ago - Stars: 336 - Forks: 101

EvgSkv/logica
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
Language: Jupyter Notebook - Size: 6.56 MB - Last synced at: 11 days ago - Pushed at: 29 days ago - Stars: 1,958 - Forks: 102

zsvoboda/ngods
New generation opensource data stack
Language: Dockerfile - Size: 1.62 MB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 66 - Forks: 9

CybercentreCanada/trino Fork of trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Language: Java - Size: 265 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 1

Turakulov/datalakehouse
A project to create a Data Lake House
Language: Python - Size: 151 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

wAVeckx/trino-jtopen
JTOpen IBMi Db2 Trino Connector
Language: Java - Size: 254 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 9 - Forks: 4

zsvoboda/ngods-stocks
New Generation Opensource Data Stack Demo
Language: Jupyter Notebook - Size: 22.1 MB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 429 - Forks: 100

ebyhr/puffin-tools
Language: Java - Size: 62.5 KB - Last synced at: 9 days ago - Pushed at: 20 days ago - Stars: 9 - Forks: 0

aerospike/trino-aerospike.docker
Trino with the Aeropsike connector Docker Image
Language: Shell - Size: 28.3 KB - Last synced at: 1 day ago - Pushed at: 20 days ago - Stars: 5 - Forks: 2

trannhatnguyen2/NYC_Taxi_Data_Pipeline
Nyc_Taxi_Data_Pipeline - DE Project
Language: Python - Size: 6.58 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 104 - Forks: 21

regadas/sqltools-trino-driver
SQLTools driver for Trino
Language: TypeScript - Size: 1.74 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 9 - Forks: 7

starburstdata/dbt-trino
The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)
Language: Python - Size: 721 KB - Last synced at: 5 days ago - Pushed at: 25 days ago - Stars: 232 - Forks: 61

bitsondatadev/trino-getting-started
Language: Python - Size: 40.2 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 261 - Forks: 101

criccomini/hive-metastore-standalone
Apache Hive Metastore in Standalone Mode With Docker
Language: Dockerfile - Size: 15.6 KB - Last synced at: 20 days ago - Pushed at: 9 months ago - Stars: 11 - Forks: 3

abeltavares/real-time-data-pipeline
📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.
Language: Python - Size: 1010 KB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 41 - Forks: 3

wix-incubator/quix
Quix Notebook Manager
Language: TypeScript - Size: 11.4 MB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 273 - Forks: 36

stackabletech/trino-lb
Trino load balancer with support for routing, queueing and auto-scaling
Language: Rust - Size: 2.02 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 26 - Forks: 8

dain55788/End-To-End-Streaming-Big-Data
End-To-End Streaming Big Data Project makes processing big data easy.
Language: Python - Size: 12.7 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 1 - Forks: 0

1ambda/lakehouse
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
Language: Kotlin - Size: 3.28 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 53 - Forks: 9

yuhexiong/doris-to-trino-data-pipeline-spark-python
Language: Python - Size: 1000 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

alberttwong/databasecomparison
Comparison of batch and real time OLAP databases
Size: 78.1 KB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 17 - Forks: 0

zookage/zookage
Hadoop on Kubernetes on Docker Desktop.
Language: Shell - Size: 247 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 6

PeiFengBin/trino-db2
Db2 JDBC connector for Trino
Language: Java - Size: 16.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

nooberfsh/prusto
A presto/trino client library written in rust.
Language: Rust - Size: 167 KB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 41 - Forks: 24

CybercentreCanada/jupyterlab-sql-editor
A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino
Language: Jupyter Notebook - Size: 90.5 MB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 86 - Forks: 14

wgzhao/trino-event-logger
Trino 查询日志保存插件,用于保存所有Trino的查询语句以及相关信息
Language: Java - Size: 29.3 KB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 2

os-climate/osc-ingest-tools
python tools to assist with standardized data ingestion workflows
Language: Python - Size: 328 KB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 7 - Forks: 10

sutrolabs/iceberg-fyi
Iceberg FYI: Verified test results for 100+ combinations of tools in the Apache Iceberg ecosystem
Language: Python - Size: 319 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

yanagishima/yanagishima
Web UI for Trino, Hive and SparkSQL
Language: Java - Size: 44.9 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 632 - Forks: 199

starburstdata/metabase-driver
Starburst Metabase driver
Language: Clojure - Size: 127 KB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 65 - Forks: 11

Pirate-Emperor/BigData-Pipeline
BigData Pipeline is a local testing environment for experimenting with various storage solutions (RDB, HDFS), query engines (Trino), schedulers (Airflow), and ETL/ELT tools (DBT). It supports MySQL, Hadoop, Hive, Kudu, and more.
Language: Dockerfile - Size: 7.95 MB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

runprism/prism
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.
Language: Python - Size: 2.42 MB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 84 - Forks: 2

tcd93/invoice-data-pipeline
A sample data pipeline for transforming invoice images and CSV files into beautiful numbers
Language: Shell - Size: 10.3 MB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

josephmachado/analytical_dp_with_sql
Code for my "Efficient Data Processing in SQL" book.
Language: Python - Size: 398 KB - Last synced at: 6 days ago - Pushed at: 9 months ago - Stars: 56 - Forks: 17

eneserdogan/trino
Trino: Master your translations with command line!
Language: JavaScript - Size: 60.5 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 137 - Forks: 12

akarce/RedditDataPipeline
Data Engineering with Reddit Api, Airflow, Hive, Postgres, MinIO, Nifi, Trino, Tableau and Superset
Language: Python - Size: 137 MB - Last synced at: 13 days ago - Pushed at: 7 months ago - Stars: 5 - Forks: 3

immuta/trino-artifacts
Immuta Trino Plugin Releases
Size: 23.4 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

BlazejNowicki/datalake
On-premise data lake architecture with Trino, Delta Tables and Hive Metastore
Language: Dockerfile - Size: 1.11 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rogeriomm/labtools-k8s
Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,Airflow, Kafka Strimzi, Datahub, OpenMetadata,Zeppelin, Jupyter, JFrog Container Registry
Language: Shell - Size: 12.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 27 - Forks: 5

Ren294/SmartTraffic_Lakehouse_for_HCMC
A Smart Traffic Management System for Ho Chi Minh City, Vietnam leveraging batch and real-time data processing, intuitive dashboards, and monitoring tools to optimize traffic flow, enhance safety, and support sustainable urban mobility through advanced analytics and user-friendly applications.
Language: Python - Size: 163 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

VuBacktracking/stream-data-processing
Streaming data processing pipeline using Spark, PostgreSQL, Debezium, Kafka, Minio, Delta Lake, Trino and DBeaver
Language: Python - Size: 1.71 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

yaooqinn/itachi
A library that brings useful functions from various modern database management systems to Apache Spark
Language: Scala - Size: 167 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 58 - Forks: 6

1ambda/lakehouse-cdc
Playground for Lakehouse CDC (Flink, Iceberg, Kafka and Debezium)
Language: Java - Size: 138 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Jsarde/iceberg-lakehouse
Apache Iceberg Lakehouse using MinIO, Trino and Nessie.
Size: 5.86 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

gmrqs/lasagna
A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka
Language: Jupyter Notebook - Size: 11.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 36 - Forks: 11

nabilseid/athenaSQL
SQL builder for AWS Athena, inspired by sparkSQL
Language: Python - Size: 202 KB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 6 - Forks: 0

tomkat-cr/data_lakehouse_local_stack
Data Lakehouse local stack with PySpark, Trino, and Minio. Includes an example to process Raygun error data and the IP address occurrence.
Language: Python - Size: 1.37 MB - Last synced at: 12 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

sudohainguyen/mini-lakehouse
Data lakehouse at home with docker compose
Language: Jupyter Notebook - Size: 531 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

apache/pulsar-sql
Pulsar SQL extracted from apache/pulsar
Language: Java - Size: 1.3 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 2

apostolis1/trino-study
A study of the Trino distributed execution engine using the TPCDS benchmark
Language: Python - Size: 137 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

aakashnand/trino-ranger-demo
Tutorial on how to setup Trino and Apache Ranger using docker
Language: Shell - Size: 28.3 KB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 41 - Forks: 23

JinsYin/awesome-datalake
📚 Awesome list for Data Lake
Size: 11.7 KB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

alex1on/Information-Systems-NTUA
Distributed execution of SQL queries over Trino
Language: Python - Size: 7.09 MB - Last synced at: 28 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

kentarokamiyajp/crypto-prediction-infra
Devops for DWH which is for Crypto data analysis (hadoop, hive, spark, kafka, cassandra, trino, etc.)
Language: Dockerfile - Size: 165 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

OKDP/charts
Collection of OKDP helm charts
Language: Smarty - Size: 259 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 1

dbsystel/datalake-graphql-wrapper
The DataLake GraphQL Wrapper provides a GraphQL API for presto/trino.
Language: TypeScript - Size: 294 KB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 18 - Forks: 0

Tom-Fynes/sql-101
A beginners guide to SQL
Language: TSQL - Size: 237 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

permitta/permitta
Permitta - RABAC for Trino and OPA
Language: HTML - Size: 11.8 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

naushadh/hive-metastore
Apache Hive Metastore as a Standalone server in Docker
Language: Python - Size: 30.3 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 62 - Forks: 25

aliavni/docker
Language: Jupyter Notebook - Size: 195 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

yuokada/presto-query-formatter Fork of kokosing/trino-query-formatter
Presto SQL query formatter
Language: Java - Size: 229 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

yuokada/presto-cli-rpm
Language: Shell - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

mchien15/datascience
Soccer Players Data Analyst and Similar Players Finder
Language: Jupyter Notebook - Size: 44.8 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

kentarokamiyajp/crypto-prediction-etl
DataWareHouse system for crypto data analysis using hive, cassandra, trino, kafka, spark, etc.
Language: Python - Size: 518 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

1ambda/query-gateway
Gateway for Query Engines
Language: Java - Size: 77.1 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

tooptoop4/awesome-prestosql
A list of Presto/Trino resources
Size: 188 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 3

delta-io/connectors 📦
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
Language: Java - Size: 4.87 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 383 - Forks: 159

nyu-rbda-2023-fall/user-data
ETL for yelp public dataset
Language: Java - Size: 3.67 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

devlive-community/dbm
Full platform database management tool, supports ClickHouse, Presto, Trino, MySQL, PostgreSQL, Apache Druid, ElasticSearch...
Language: TypeScript - Size: 14.8 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 387 - Forks: 49

keyhong/datalake-playground
Playground for DataLake (Hadoop, Hive, Kudu, Trino, Hue, Airflow, DBT)
Language: Dockerfile - Size: 6.15 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

dungdm93/sqlalchemy-trino 📦
Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.
Language: Python - Size: 1.33 MB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 16

lepetitprinz/wikipedia-pageview-analysis
Wikipedia Pageview Analysis
Language: Python - Size: 2.89 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Quocc1/OpenStack
An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.
Language: Jupyter Notebook - Size: 6.97 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

rchukh/trino-history-server
History Server for Trino
Language: TypeScript - Size: 285 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

vmware/versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
Language: Python - Size: 109 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 409 - Forks: 54

ploomber/jupysql Fork of catherinedevlin/ipython-sql
Better SQL in Jupyter. 📊
Language: Python - Size: 12.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 588 - Forks: 70

wseaton/starproxy
HTTP Proxy based solution for real-time interception and prioritization of SQL queries.
Language: Rust - Size: 41 KB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

IvanWoo/trino-on-kubernetes
Language: Shell - Size: 58.6 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 1

rchukh/trino-querylog
Trino plugin for logging query events into a separate log file.
Language: Java - Size: 40 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 37 - Forks: 35

duhanmin/bigdata-sql-parser Fork of 0xqq/bigdata-sql-parser
数据血缘,支持spark sql,hive sql,pg sql,presto sql,mysql sql,tidb sql, flink sql, datax血缘,spark/flink jar 运行命令的血缘解析;支持with语法
Language: Java - Size: 1.95 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 3

WesleyJw/modern-data-stack
Creating a modern data stack in Kubernetes with open-source products, both on-premises and cloud-agnostic, is an increasingly popular approach. By leveraging Kubernetes for container orchestration, you can deploy and manage data processing, storage, and analysis tools more efficiently.
Language: Python - Size: 28 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

grihabor/trino-query-parser
Provides a parser for trino queries
Language: ANTLR - Size: 32.2 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

borgettas/trino
Language: Makefile - Size: 12.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
