GitHub / mrugankray / Big-Data-Cluster
The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center and pgAdmin. This cluster is solely intended for usage in a development environment. Do not use it to run any production workloads.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mrugankray%2FBig-Data-Cluster
PURL: pkg:github/mrugankray/Big-Data-Cluster
Stars: 41
Forks: 15
Open issues: 0
License: mit
Language: Shell
Size: 118 KB
Dependencies parsed at: Pending
Created at: over 2 years ago
Updated at: about 1 year ago
Pushed at: over 2 years ago
Last synced at: about 1 year ago
Topics: airflow, cassandra, conda-environment, control-center, flume, hadoop, hdfs, hive, hue, kadmin, kafka, pgadmin4, postgresql, pyspark, python3, schema-registry, spark, sqoop, zeppelin