Topic: "hadoop-cluster"
big-data-europe/docker-hadoop
Apache Hadoop docker image
Language: Shell - Size: 109 KB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 2,261 - Forks: 1,360

groda/big_data
Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.
Language: Jupyter Notebook - Size: 51.9 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 78 - Forks: 27

Impetus/jumbune
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Language: Java - Size: 31.7 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 70 - Forks: 32

Segence/docker-hadoop
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Language: Shell - Size: 46.9 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 61 - Forks: 40

sergevs/ansible-cloudera-hadoop
ansible playbook to deploy cloudera hadoop components to the cluster
Language: Shell - Size: 6.3 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 52 - Forks: 41

Wittline/apache-spark-docker
Dockerizing an Apache Spark Standalone Cluster
Language: VBA - Size: 63.7 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 43 - Forks: 27

rainmaple/WIFI_BussinessBigDataAnalyseSystem
A System is designed to analyse BigData collect from Wifi probe
Language: JavaScript - Size: 98.6 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 31 - Forks: 18

hokstack/hok-helm
HokStack - Run Hadoop Stack on Kubernetes
Language: Shell - Size: 3.88 MB - Last synced at: 11 months ago - Pushed at: about 5 years ago - Stars: 22 - Forks: 6

hadoop-sandbox/hadoop-sandbox
A fully-functional Hadoop Yarn cluster as docker-compose deployment.
Language: Shell - Size: 103 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 16 - Forks: 5

mikeroyal/Apache-Ignite-Guide
Apache Ignite Guide
Size: 162 KB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 11 - Forks: 4

hyeonsangjeon/dataplatform
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
Language: Shell - Size: 549 KB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 11 - Forks: 1

waltherg/distributable_docker_sql_on_hadoop
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Language: Shell - Size: 88.9 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 11 - Forks: 4

manuparra/MasterDegreeCC_Practice
Taller del Máster Profesional de Informática UGR. Curso de CloudComputing.
Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: about 6 years ago - Stars: 10 - Forks: 3

lyingbo/hadoop-cluster-docker
Run Hadoop Cluster within Docker Containers
Language: Shell - Size: 32.2 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 3

pfisterer/apache-knox-docker
Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker
Language: Dockerfile - Size: 15.6 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 7 - Forks: 7

hadoop-sandbox/hadoop-sandbox-images
Docker image builds for Hadoop sandbox.
Language: Dockerfile - Size: 64.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 4

HxnDev/Finding-Average-Temperature-of-Each-Year-using-Hadoop-HDFS
In this task, we had to calculate the average temperature for each year from the given dataset using Hadoop HDFS. We had to create a MapReduce function to perform this task.
Language: Java - Size: 451 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 0

chriskery/hadoop-operator
Kubernetes operator for managing the lifecycle of Apache Hadoop Yarn Tasks on Kubernetes.
Language: Go - Size: 3.06 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

PacktPublishing/Big-Data-Processing-with-Hadoop---A-Complete-Reference-Guide
Design, build, and execute effective big data strategies with advanced Hadoop concepts
Language: Java - Size: 65.4 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 5

MengmSun/hadoop-in-docker
Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.
Language: Shell - Size: 6.13 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

malabz/HAlign-2
a multiple sequence alignment tool
Language: HTML - Size: 1.6 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 4

tugrulhkarabulut/hadoop-movie-rating-prediction
Movie rating prediction application
Language: CSS - Size: 3.46 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

mitre/clusterconf
Manage Hadoop cluster configurations
Language: R - Size: 67.4 KB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 2

peyaa/bigdata-platform-on-k8s
deploy bigdata platform on kubernetes
Language: Shell - Size: 1.9 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 4

AnalyticsApps/LogAnalyzer
Analyses the customer logs for bigdata components like HDFS, Hive, HBase, Yarn, MapReduce, Storm, Spark, Spark 2, Knox, Ambari Metrics, Nifi, Accumulo, Kafka, Flume, Oozie, Falcon, Atlas & Zookeeper.
Language: Shell - Size: 1.59 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 1

roboxue/YarnVision
UI for Hadoop Resource Manager
Language: Vue - Size: 54.7 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 2

vitobellini/bigdata-cluster
BigData Cluster with Docker
Language: Shell - Size: 63.5 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 4 - Forks: 0

mitre/webhdfs
Interface with WebHDFS Service in a Cluster-Neutral Way
Language: R - Size: 124 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

sloopstash/kickstart-hadoop
The ultimate aim of this Hadoop starter-kit Git repository is to help you deploy and manage Hadoop ecosystem components on AWS cloud using Docker, Kubernetes, and Chef.
Language: Ruby - Size: 150 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 7

conema/spark-terraform
This project create an Hadoop and Spark cluster on Amazon AWS with Terraform
Language: Shell - Size: 30.3 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 4

vietanh85/hadoop-docker
Apache Hadoop Cluster Docker images
Language: Shell - Size: 103 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

anchal-agrawal/hadoop-ubuntu
Everything to deploy a Hadoop cluster on Ubuntu
Language: Shell - Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: about 9 years ago - Stars: 3 - Forks: 0

jazzwang/haduzilla
Automated Installation CD for Hadoop Cluster
Language: Shell - Size: 336 KB - Last synced at: about 2 years ago - Pushed at: over 11 years ago - Stars: 3 - Forks: 1

elaaatif/JPEG-and-JPEG2000-compression-on-Multi-node-cluster-using-hadoop-and-spark
Big Data technologies can be leveraged for efficient, distributed image compression using JPEG2000 (Spark) and JPEG (MapReduce).
Size: 14.3 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

wecharyu/hadoop-docker
Docker images for apache hadoop cluster
Language: Dockerfile - Size: 190 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

ictu-research/docker
Apache Hadoop + Apache Solr
Language: Shell - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 1

mikeroyal/Apache-Hadoop-Guide
Apache Hadoop Guide
Size: 141 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 2

Crone1/Pig-and-Hive-MovieLens-Analysis
This repository contains analysis work I did on the MovieLens dataset using the big data tools Pig and Hive alongside the Hadoop infrastructure
Language: PigLatin - Size: 22.5 KB - Last synced at: 9 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

sdabhi23/hadoop-k8s 📦
Image to run hadoop pseudo distributed cluster
Language: Shell - Size: 40 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

alex-ber/docker-hive Fork of ops-guru/docker-hive
EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5
Language: Shell - Size: 45.9 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

pkeropen/hadoop-docker-lite
Hadoop cluster 精简版(hadoop & kafka & storm & hbase & phoenix & pig & zookeeper & flume)
Language: Dockerfile - Size: 74.8 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 2

codito/hadoop-expt
Experiments with Hadoop cluster setups in Docker
Size: 1.95 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 3

huangyueranbbc/hadoop05_pagerank
pagerank hadoop
Language: Java - Size: 39.5 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

jicahoo/chaten
Flume Hive ElasticSearch
Language: Shell - Size: 115 KB - Last synced at: 19 days ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

Rohit9314/my-hadoop
Setup hadoop cluster manually and automatically
Language: Python - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

BigDataRepublic/hdp-installation-notes
Knowledge base for Hortonworks Ambari Installations
Size: 5.92 MB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 1

gathecageorge/hadoop
Contains docker files to build a hadoop container image
Language: Shell - Size: 51.8 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

aimanamri/raspberry-pi4-hadoop-spark-cluster
This is a self-documentation of learning distributed data storage, parallel processing, and Linux OS using Apache Hadoop, Apache Spark and Raspbian OS. In this project, 3-node cluster will be setup using Raspberry Pi 4, install HDFS and run Spark processing jobs via YARN.
Language: Shell - Size: 5.21 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Ohtar10/cloud-labs
This is a set of docker/ansible/terraform/aws recipies for different kind of deployments, e.g., Jupyter, Zeppelin, EMR, Big Data Infrastructure, etc.
Language: Jupyter Notebook - Size: 14.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

seunggihong/hadoop-install-guide
📕 Guide to installing a Hadoop and Spark on an Oracle virtual machine.
Language: Shell - Size: 20.5 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

jaynamm/docker-hadoop-cluster
Hadoop Cluster For Docker (-ing)
Language: Dockerfile - Size: 8.79 KB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

akshayavb99/Ansible-Examples
The repository contains all the Playbooks and other files used to work with different applications for Ansible
Language: Python - Size: 309 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 3

aquib-sh/setup-hadoop
A BASH script to setup Apache Hadoop and Apache Hive with Derby database on Debian GNU/Linux
Language: Shell - Size: 37.1 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

AndreiFAD/raspberry_pi_cluster
I built my own mini analysis lab to practice and learn something new. An environment that I can use remotely with any device (with RealVNC) and of course an install script to rebuild it quickly anytime. I hope, it will be useful for others, who want to make their own.
Language: Shell - Size: 132 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

GGolfz/hadoop-script
Shell script for setup hadoop 3.3.1 on Ubuntu
Language: Shell - Size: 14.6 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

leibniz21c/one-click-hadoop Fork of ddps-lab/ubuntu-hadoop
One click hadoop test environment builder.
Language: Dockerfile - Size: 83 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

akaliutau/hadoop-cluster
Batch data processing on the dockerized Hadoop cluster
Language: Shell - Size: 93.8 KB - Last synced at: 4 days ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

sriw-world/ansible_ws
Ansible workspace
Language: HTML - Size: 42 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

fixcer/bigdata
bigdata
Language: Jupyter Notebook - Size: 178 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

humanbeeng/hadoop-auto-install
A small helper script that can save your valuable time during installation of Apache Hadoop.
Language: Shell - Size: 13.7 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Jomat18/Search-Engine-Hadoop-MapReduce
Motor de búsqueda usando Inverted Index, Pagerank, Hadoop Streaming MapReduce para Python, Backend con Flask y Frontend con Javascript, Jquery, Css y Html desplegado en un cluster en Google Cloud
Language: Python - Size: 80.7 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 2

gitosamakhan/Multinode-Ambari-Cluster
Deploying a multinode ambari cluster on Linux (CentOS7) (Documentation)
Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

anshul1004/MutualFriends
Implementation of Hadoop and Spark
Language: Java - Size: 23 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

Vaibhav-Mehta-19/ansible-python-hadoop-automated
automating the creation of a hadoop cluster using an ansible playbook, python-cgi and an html web page.
Language: Python - Size: 3.91 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

HemersonRafael/ect Fork of heltonmaia/ECT
This repository contains general information about projects under development. In School of Science and Technology - ECT at Federal university of Rio Grande do Norte - UFRN
Language: C++ - Size: 60.9 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Vaibhav-Mehta-19/hadoop-setup
This file involves the steps to set up a hadoop cluster. This has been verified and tried in Redhat Linux version 7.
Language: Python - Size: 542 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

AsmaZgo/distribution_and_scripts
A repository for some scripts that can help in creating a distributed Big data ecosystem using the platform Grid5000.
Language: Shell - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

younthu/docker-hadoop Fork of big-data-europe/docker-hadoop
Apache Hadoop docker image cluster
Language: Shell - Size: 759 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

elek/flokkr-runtime-nomad 📦
Examples to run Hadoop/Spark cluster with Hashicorp nomad and consul.
Language: Shell - Size: 38.1 KB - Last synced at: 9 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

manoharpalanisamy/Distributed-Keras
Research And Development on Distributed Keras with Spark
Language: Jupyter Notebook - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

chrisPiemonte/docker-hadoop-cluster
Hadoop cluster with docker-compose
Language: Shell - Size: 1.66 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

chih7/hadoop_install
hadoop cluster installer
Language: Shell - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

yjham2002/Hadoop_Clustering
:book: Apache Hadoop Based Clustering Tutorial
Language: Java - Size: 77.1 KB - Last synced at: 3 months ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

Will436851/Raspberry-pi-OS-Documentation
Raspberry pi OS installation and system call demonstration
Language: Shell - Size: 1.32 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

MuhamedHekal/Hadoop-HA-Cluster-on-Docker
Hadoop3-HA-Docker is a production-ready, fault-tolerant Hadoop cluster deployed with Docker Compose. It automates the setup of a fully distributed Hadoop ecosystem with high availability (HA) features, designed for reliability, scalability, and real-world big data workloads
Language: Dockerfile - Size: 273 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

xpcosmos/data-lake-prime
This project aims to simulate and configure a Distributed File System using Hadoop HDFS. For this project, 3 machines were created: 1 Master Node and 2 Worker Nodes.
Language: Shell - Size: 815 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

whykay-01/movie-recommender-system
Final project for the big data class at NYU where I developed a movie recommendation system using MovieLens database and compared its performance against the popularity based models and other vanilla metrics
Language: Jupyter Notebook - Size: 4.81 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Mariam-iftikhar/BigDataProjects
The repository showcases a series of exercises and projects focused on big data processing using Hadoop, HBase, Hive, and Spark with Python. Hosted on AWS EMR, these projects demonstrate efficient data handling and processing techniques, leveraging the power of cloud computing to tackle complex data challenges.
Size: 10.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

keyhong/journal-ha-hadoop
Hadoop Journal HA Cluster Docker-compose
Language: Dockerfile - Size: 2.24 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ais04134/hyperv-hadoop-spark-cluster
Hadoop Ecosystem - 대규모 빈발 패턴 마이닝을 위한 하둡 클러스터 환경 구축
Language: Shell - Size: 2.35 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mchien15/nn-Docker-Hadoop-cluster
Docker Hadoop cluster with ecosystem
Language: Shell - Size: 105 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SalmaHisham/KafKa-Hadoop-Spark-cluster-for-Analytics Fork of nourhansowar/E-commerce-Customer-Behavior-Analysis
This project offers a dual approach to understanding e-commerce customer behavior through: Batch data analysis and Real-time data processing.
Language: HTML - Size: 10.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Heisenberghj7/Hands-on-Hadoop
🐘 In this repository, Explore the core concepts of Hadoop, including HDFS and MapReduce, to help you grasp the fundamentals of big data processing and storage. Whether you're a beginner or an experienced data engineer, this resource is designed to enhance your understanding of Hadoop.
Size: 51.8 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

RonnJacob/PageRank-MapReduce-Spark
Implemented the PageRank algorithm in Hadoop MapReduce framework and Spark.
Language: Java - Size: 442 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

HeliaHashemipour/Hadoop-Spark
Third homework of CloudComputing - Fall 2022
Language: Jupyter Notebook - Size: 50.3 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

vineetsampat/Projects1
Language: TypeScript - Size: 2.77 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

rkazak07/Apache-Zeppelin-Helm-Chart
Apache Zeppelin kubernetes Installation
Language: Mustache - Size: 15.6 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

DanMolenhouse/Distributed-Systems-Project5-Hadoop-and-Spark
In this project, we used both Hadoop / MapReduce and Spark to do distributed computing. The first task was to perform a series of operations using a Mapper and Reduce java file that was implemented on a Hadoop server. The second task was to perform similar operations, but on Spark instead.
Language: Java - Size: 70.3 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mooselab/hadoop-ansible
Ansible scripts for setting up Multi-Cluster Hadoop
Language: Shell - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

kumarvna/terraform-azurerm-hdinsight
Terraform module to create managed, full-spectrum, open-source analytics service Azure HDInsight. This module creates Apache Hadoop, Apache Spark, Apache HBase, Interactive Query (Apache Hive LLAP) and Apache Kafka clusters.
Language: HCL - Size: 365 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 5

mehroosali/ABCStoresPipeline
Batch ETL data pipeline built on HDP 3.0 to process daily sales and business data to procedure power Bi reports. Automated the pipelines using Airflow.
Language: Scala - Size: 464 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

aogunwoolu/Ethereum-analysis
ETH analysis using big data for the QMUL Big Data Processing module. Intended to promote analysis of data retrieved via big data processing
Language: Jupyter Notebook - Size: 960 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

mr-ravin/Smart-Hadoop-Cluster-SMHACL 📦
This is an automated hadoop cluster building tool,which implements distributed computing for creating the cluster over the network. This is implemented in python 2.7
Language: Python - Size: 1.64 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

avojak/aws-hadoop-cluster
Infrastructure and configuration-as-code for standing up a Hadoop cluster in AWS
Language: Jinja - Size: 12.7 KB - Last synced at: 9 days ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Nikhil9546/ARTH2020_6_7
Hadoop-cluster ,Amazon Web Service ,Docker Services ,Apache Web Server and some basic Linux commands is automated using Python programming and shell scripting
Language: Python - Size: 6.84 KB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

shreyasshivakumara/Reddit-Analysis-Large-Dataset-Scientific-Application
Architected and developed a horizontally scalable data processing solution for the reddit dataset. Demonstrated the scalability (Weak Scalability and Strong Scalability) tests in suitable computational analysis.
Language: Jupyter Notebook - Size: 190 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

spineo/hadoop-app
Size: 763 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

aiden-dai/ai-cluster
Start clusters in virtualbox VMs
Size: 198 KB - Last synced at: 10 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

adonis0147/hbase-in-docker
HBase cluster running in docker
Language: Shell - Size: 21.5 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

lk5164/hadoop-cluster-setup
Language: Shell - Size: 62.5 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0
