An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: trino

zncdatadev/trino-operator

Operator for Trino, the distributed SQL query engine for big data

Language: Go - Size: 699 KB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 4 - Forks: 7

tobymao/sqlglot

Python SQL Parser and Transpiler

Language: Python - Size: 489 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 7,526 - Forks: 830

ibis-project/ibis

the portable Python dataframe library

Language: Python - Size: 173 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 5,698 - Forks: 631

snowlift/trino-storage

Storage connector for Trino

Language: Java - Size: 2.67 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 109 - Forks: 34

wvlet/wvlet

A flow-style query language for SQL engines

Language: Scala - Size: 17.4 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 131 - Forks: 8

DNAstack/data-connect-trino

Cloned from https://github.com/DNAstack/ga4gh-search-adapter-presto

Language: Java - Size: 3.24 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 3

ac-gomes/spark-iceberg-hive

Language: Jupyter Notebook - Size: 639 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

trinodb/trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Language: Java - Size: 270 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 11,137 - Forks: 3,170

kestra-io/plugin-jdbc

Language: Java - Size: 2.4 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 16 - Forks: 16

SonicEXEDVP/real-time-data-pipeline

📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.

Size: 1000 Bytes - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

tuannvm/mcp-trino

A high-performance Model Context Protocol (MCP) server for Trino implemented in Go.

Language: Go - Size: 112 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 7 - Forks: 0

davidkhala/database

the databases index

Language: PowerShell - Size: 98.7 MB - Last synced at: 1 day ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

datafold/data-diff 📦

Compare tables within or across databases

Language: Python - Size: 3.98 MB - Last synced at: about 17 hours ago - Pushed at: 11 months ago - Stars: 2,965 - Forks: 278

wgzhao/Addax

A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL seamlessly

Language: Java - Size: 43.9 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 1,257 - Forks: 309

trinodb/trino-js-client

TypeScript client library for Trino

Language: TypeScript - Size: 47 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 38 - Forks: 19

devlive-community/datacap

DataCap is integrated software for data transformation, integration, and visualization. Support a variety of data sources, file types, big data related database, relational database, NoSQL database, etc. Through the software can realize the management of multiple data sources, the data under the source of various operations conversion ...

Language: Java - Size: 114 MB - Last synced at: 7 days ago - Pushed at: 27 days ago - Stars: 933 - Forks: 102

regadas/trino-pubsub-event-listener

Trino Google Pub/Sub event listener

Language: Java - Size: 427 KB - Last synced at: 8 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 6

DTStack/dt-sql-parser

SQL Parsers for BigData, built with antlr4.

Language: TypeScript - Size: 45.8 MB - Last synced at: 7 days ago - Pushed at: 24 days ago - Stars: 336 - Forks: 101

EvgSkv/logica

Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.

Language: Jupyter Notebook - Size: 6.56 MB - Last synced at: 11 days ago - Pushed at: 29 days ago - Stars: 1,958 - Forks: 102

zsvoboda/ngods

New generation opensource data stack

Language: Dockerfile - Size: 1.62 MB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 66 - Forks: 9

CybercentreCanada/trino Fork of trinodb/trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Language: Java - Size: 265 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 1

Turakulov/datalakehouse

A project to create a Data Lake House

Language: Python - Size: 151 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

wAVeckx/trino-jtopen

JTOpen IBMi Db2 Trino Connector

Language: Java - Size: 254 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 9 - Forks: 4

zsvoboda/ngods-stocks

New Generation Opensource Data Stack Demo

Language: Jupyter Notebook - Size: 22.1 MB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 429 - Forks: 100

ebyhr/puffin-tools

Language: Java - Size: 62.5 KB - Last synced at: 9 days ago - Pushed at: 20 days ago - Stars: 9 - Forks: 0

aerospike/trino-aerospike.docker

Trino with the Aeropsike connector Docker Image

Language: Shell - Size: 28.3 KB - Last synced at: 1 day ago - Pushed at: 20 days ago - Stars: 5 - Forks: 2

trannhatnguyen2/NYC_Taxi_Data_Pipeline

Nyc_Taxi_Data_Pipeline - DE Project

Language: Python - Size: 6.58 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 104 - Forks: 21

regadas/sqltools-trino-driver

SQLTools driver for Trino

Language: TypeScript - Size: 1.74 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 9 - Forks: 7

starburstdata/dbt-trino

The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)

Language: Python - Size: 721 KB - Last synced at: 5 days ago - Pushed at: 25 days ago - Stars: 232 - Forks: 61

bitsondatadev/trino-getting-started

Language: Python - Size: 40.2 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 261 - Forks: 101

criccomini/hive-metastore-standalone

Apache Hive Metastore in Standalone Mode With Docker

Language: Dockerfile - Size: 15.6 KB - Last synced at: 20 days ago - Pushed at: 9 months ago - Stars: 11 - Forks: 3

abeltavares/real-time-data-pipeline

📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.

Language: Python - Size: 1010 KB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 41 - Forks: 3

wix-incubator/quix

Quix Notebook Manager

Language: TypeScript - Size: 11.4 MB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 273 - Forks: 36

stackabletech/trino-lb

Trino load balancer with support for routing, queueing and auto-scaling

Language: Rust - Size: 2.02 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 26 - Forks: 8

dain55788/End-To-End-Streaming-Big-Data

End-To-End Streaming Big Data Project makes processing big data easy.

Language: Python - Size: 12.7 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 1 - Forks: 0

1ambda/lakehouse

Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)

Language: Kotlin - Size: 3.28 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 53 - Forks: 9

yuhexiong/doris-to-trino-data-pipeline-spark-python

Language: Python - Size: 1000 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

alberttwong/databasecomparison

Comparison of batch and real time OLAP databases

Size: 78.1 KB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 17 - Forks: 0

zookage/zookage

Hadoop on Kubernetes on Docker Desktop.

Language: Shell - Size: 247 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 6

PeiFengBin/trino-db2

Db2 JDBC connector for Trino

Language: Java - Size: 16.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

nooberfsh/prusto

A presto/trino client library written in rust.

Language: Rust - Size: 167 KB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 41 - Forks: 24

CybercentreCanada/jupyterlab-sql-editor

A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino

Language: Jupyter Notebook - Size: 90.5 MB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 86 - Forks: 14

wgzhao/trino-event-logger

Trino 查询日志保存插件,用于保存所有Trino的查询语句以及相关信息

Language: Java - Size: 29.3 KB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 2

os-climate/osc-ingest-tools

python tools to assist with standardized data ingestion workflows

Language: Python - Size: 328 KB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 7 - Forks: 10

sutrolabs/iceberg-fyi

Iceberg FYI: Verified test results for 100+ combinations of tools in the Apache Iceberg ecosystem

Language: Python - Size: 319 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

yanagishima/yanagishima

Web UI for Trino, Hive and SparkSQL

Language: Java - Size: 44.9 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 632 - Forks: 199

starburstdata/metabase-driver

Starburst Metabase driver

Language: Clojure - Size: 127 KB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 65 - Forks: 11

Pirate-Emperor/BigData-Pipeline

BigData Pipeline is a local testing environment for experimenting with various storage solutions (RDB, HDFS), query engines (Trino), schedulers (Airflow), and ETL/ELT tools (DBT). It supports MySQL, Hadoop, Hive, Kudu, and more.

Language: Dockerfile - Size: 7.95 MB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

runprism/prism

Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.

Language: Python - Size: 2.42 MB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 84 - Forks: 2

tcd93/invoice-data-pipeline

A sample data pipeline for transforming invoice images and CSV files into beautiful numbers

Language: Shell - Size: 10.3 MB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

josephmachado/analytical_dp_with_sql

Code for my "Efficient Data Processing in SQL" book.

Language: Python - Size: 398 KB - Last synced at: 6 days ago - Pushed at: 9 months ago - Stars: 56 - Forks: 17

eneserdogan/trino

Trino: Master your translations with command line!

Language: JavaScript - Size: 60.5 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 137 - Forks: 12

akarce/RedditDataPipeline

Data Engineering with Reddit Api, Airflow, Hive, Postgres, MinIO, Nifi, Trino, Tableau and Superset

Language: Python - Size: 137 MB - Last synced at: 13 days ago - Pushed at: 7 months ago - Stars: 5 - Forks: 3

immuta/trino-artifacts

Immuta Trino Plugin Releases

Size: 23.4 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

BlazejNowicki/datalake

On-premise data lake architecture with Trino, Delta Tables and Hive Metastore

Language: Dockerfile - Size: 1.11 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rogeriomm/labtools-k8s

Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,Airflow, Kafka Strimzi, Datahub, OpenMetadata,Zeppelin, Jupyter, JFrog Container Registry

Language: Shell - Size: 12.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 27 - Forks: 5

Ren294/SmartTraffic_Lakehouse_for_HCMC

A Smart Traffic Management System for Ho Chi Minh City, Vietnam leveraging batch and real-time data processing, intuitive dashboards, and monitoring tools to optimize traffic flow, enhance safety, and support sustainable urban mobility through advanced analytics and user-friendly applications.

Language: Python - Size: 163 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

VuBacktracking/stream-data-processing

Streaming data processing pipeline using Spark, PostgreSQL, Debezium, Kafka, Minio, Delta Lake, Trino and DBeaver

Language: Python - Size: 1.71 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

yaooqinn/itachi

A library that brings useful functions from various modern database management systems to Apache Spark

Language: Scala - Size: 167 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 58 - Forks: 6

1ambda/lakehouse-cdc

Playground for Lakehouse CDC (Flink, Iceberg, Kafka and Debezium)

Language: Java - Size: 138 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Jsarde/iceberg-lakehouse

Apache Iceberg Lakehouse using MinIO, Trino and Nessie.

Size: 5.86 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

gmrqs/lasagna

A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka

Language: Jupyter Notebook - Size: 11.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 36 - Forks: 11

nabilseid/athenaSQL

SQL builder for AWS Athena, inspired by sparkSQL

Language: Python - Size: 202 KB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 6 - Forks: 0

tomkat-cr/data_lakehouse_local_stack

Data Lakehouse local stack with PySpark, Trino, and Minio. Includes an example to process Raygun error data and the IP address occurrence.

Language: Python - Size: 1.37 MB - Last synced at: 12 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

sudohainguyen/mini-lakehouse

Data lakehouse at home with docker compose

Language: Jupyter Notebook - Size: 531 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

apache/pulsar-sql

Pulsar SQL extracted from apache/pulsar

Language: Java - Size: 1.3 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 2

apostolis1/trino-study

A study of the Trino distributed execution engine using the TPCDS benchmark

Language: Python - Size: 137 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

aakashnand/trino-ranger-demo

Tutorial on how to setup Trino and Apache Ranger using docker

Language: Shell - Size: 28.3 KB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 41 - Forks: 23

JinsYin/awesome-datalake

📚 Awesome list for Data Lake

Size: 11.7 KB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

alex1on/Information-Systems-NTUA

Distributed execution of SQL queries over Trino

Language: Python - Size: 7.09 MB - Last synced at: 28 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

kentarokamiyajp/crypto-prediction-infra

Devops for DWH which is for Crypto data analysis (hadoop, hive, spark, kafka, cassandra, trino, etc.)

Language: Dockerfile - Size: 165 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

OKDP/charts

Collection of OKDP helm charts

Language: Smarty - Size: 259 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 1

dbsystel/datalake-graphql-wrapper

The DataLake GraphQL Wrapper provides a GraphQL API for presto/trino.

Language: TypeScript - Size: 294 KB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 18 - Forks: 0

Tom-Fynes/sql-101

A beginners guide to SQL

Language: TSQL - Size: 237 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

permitta/permitta

Permitta - RABAC for Trino and OPA

Language: HTML - Size: 11.8 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

naushadh/hive-metastore

Apache Hive Metastore as a Standalone server in Docker

Language: Python - Size: 30.3 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 62 - Forks: 25

aliavni/docker

Language: Jupyter Notebook - Size: 195 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

yuokada/presto-query-formatter Fork of kokosing/trino-query-formatter

Presto SQL query formatter

Language: Java - Size: 229 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

yuokada/presto-cli-rpm

Language: Shell - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

mchien15/datascience

Soccer Players Data Analyst and Similar Players Finder

Language: Jupyter Notebook - Size: 44.8 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

kentarokamiyajp/crypto-prediction-etl

DataWareHouse system for crypto data analysis using hive, cassandra, trino, kafka, spark, etc.

Language: Python - Size: 518 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

1ambda/query-gateway

Gateway for Query Engines

Language: Java - Size: 77.1 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

tooptoop4/awesome-prestosql

A list of Presto/Trino resources

Size: 188 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 3

delta-io/connectors 📦

This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.

Language: Java - Size: 4.87 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 383 - Forks: 159

nyu-rbda-2023-fall/user-data

ETL for yelp public dataset

Language: Java - Size: 3.67 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

devlive-community/dbm

Full platform database management tool, supports ClickHouse, Presto, Trino, MySQL, PostgreSQL, Apache Druid, ElasticSearch...

Language: TypeScript - Size: 14.8 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 387 - Forks: 49

keyhong/datalake-playground

Playground for DataLake (Hadoop, Hive, Kudu, Trino, Hue, Airflow, DBT)

Language: Dockerfile - Size: 6.15 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

dungdm93/sqlalchemy-trino 📦

Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.

Language: Python - Size: 1.33 MB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 16

lepetitprinz/wikipedia-pageview-analysis

Wikipedia Pageview Analysis

Language: Python - Size: 2.89 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Quocc1/OpenStack

An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.

Language: Jupyter Notebook - Size: 6.97 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

rchukh/trino-history-server

History Server for Trino

Language: TypeScript - Size: 285 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

vmware/versatile-data-kit

One framework to develop, deploy and operate data workflows with Python and SQL.

Language: Python - Size: 109 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 409 - Forks: 54

ploomber/jupysql Fork of catherinedevlin/ipython-sql

Better SQL in Jupyter. 📊

Language: Python - Size: 12.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 588 - Forks: 70

wseaton/starproxy

HTTP Proxy based solution for real-time interception and prioritization of SQL queries.

Language: Rust - Size: 41 KB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

IvanWoo/trino-on-kubernetes

Language: Shell - Size: 58.6 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 1

rchukh/trino-querylog

Trino plugin for logging query events into a separate log file.

Language: Java - Size: 40 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 37 - Forks: 35

duhanmin/bigdata-sql-parser Fork of 0xqq/bigdata-sql-parser

数据血缘,支持spark sql,hive sql,pg sql,presto sql,mysql sql,tidb sql, flink sql, datax血缘,spark/flink jar 运行命令的血缘解析;支持with语法

Language: Java - Size: 1.95 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 3

WesleyJw/modern-data-stack

Creating a modern data stack in Kubernetes with open-source products, both on-premises and cloud-agnostic, is an increasingly popular approach. By leveraging Kubernetes for container orchestration, you can deploy and manage data processing, storage, and analysis tools more efficiently.

Language: Python - Size: 28 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

grihabor/trino-query-parser

Provides a parser for trino queries

Language: ANTLR - Size: 32.2 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

borgettas/trino

Language: Makefile - Size: 12.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0