An open API service providing repository metadata for many open source software ecosystems.

Topic: "hadoop-cluster"

big-data-europe/docker-hadoop

Apache Hadoop docker image

Language: Shell - Size: 109 KB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 2,261 - Forks: 1,360

groda/big_data

Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.

Language: Jupyter Notebook - Size: 51.9 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 78 - Forks: 27

Impetus/jumbune

Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,

Language: Java - Size: 31.7 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 70 - Forks: 32

Segence/docker-hadoop

A Docker container with a full Hadoop cluster setup with Spark and Zeppelin

Language: Shell - Size: 46.9 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 61 - Forks: 40

sergevs/ansible-cloudera-hadoop

ansible playbook to deploy cloudera hadoop components to the cluster

Language: Shell - Size: 6.3 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 52 - Forks: 41

Wittline/apache-spark-docker

Dockerizing an Apache Spark Standalone Cluster

Language: VBA - Size: 63.7 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 43 - Forks: 27

rainmaple/WIFI_BussinessBigDataAnalyseSystem

A System is designed to analyse BigData collect from Wifi probe

Language: JavaScript - Size: 98.6 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 31 - Forks: 18

hokstack/hok-helm

HokStack - Run Hadoop Stack on Kubernetes

Language: Shell - Size: 3.88 MB - Last synced at: 11 months ago - Pushed at: about 5 years ago - Stars: 22 - Forks: 6

hadoop-sandbox/hadoop-sandbox

A fully-functional Hadoop Yarn cluster as docker-compose deployment.

Language: Shell - Size: 103 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 16 - Forks: 5

mikeroyal/Apache-Ignite-Guide

Apache Ignite Guide

Size: 162 KB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 11 - Forks: 4

hyeonsangjeon/dataplatform

Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.

Language: Shell - Size: 549 KB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 11 - Forks: 1

waltherg/distributable_docker_sql_on_hadoop

Toy Hadoop cluster combining various SQL-on-Hadoop variants

Language: Shell - Size: 88.9 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 11 - Forks: 4

manuparra/MasterDegreeCC_Practice

Taller del Máster Profesional de Informática UGR. Curso de CloudComputing.

Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: about 6 years ago - Stars: 10 - Forks: 3

lyingbo/hadoop-cluster-docker

Run Hadoop Cluster within Docker Containers

Language: Shell - Size: 32.2 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 3

pfisterer/apache-knox-docker

Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker

Language: Dockerfile - Size: 15.6 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 7 - Forks: 7

hadoop-sandbox/hadoop-sandbox-images

Docker image builds for Hadoop sandbox.

Language: Dockerfile - Size: 64.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 4

HxnDev/Finding-Average-Temperature-of-Each-Year-using-Hadoop-HDFS

In this task, we had to calculate the average temperature for each year from the given dataset using Hadoop HDFS. We had to create a MapReduce function to perform this task.

Language: Java - Size: 451 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 0

chriskery/hadoop-operator

Kubernetes operator for managing the lifecycle of Apache Hadoop Yarn Tasks on Kubernetes.

Language: Go - Size: 3.06 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

PacktPublishing/Big-Data-Processing-with-Hadoop---A-Complete-Reference-Guide

Design, build, and execute effective big data strategies with advanced Hadoop concepts

Language: Java - Size: 65.4 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 5

MengmSun/hadoop-in-docker

Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.

Language: Shell - Size: 6.13 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

malabz/HAlign-2

a multiple sequence alignment tool

Language: HTML - Size: 1.6 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 4

tugrulhkarabulut/hadoop-movie-rating-prediction

Movie rating prediction application

Language: CSS - Size: 3.46 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

mitre/clusterconf

Manage Hadoop cluster configurations

Language: R - Size: 67.4 KB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 2

peyaa/bigdata-platform-on-k8s

deploy bigdata platform on kubernetes

Language: Shell - Size: 1.9 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 4

AnalyticsApps/LogAnalyzer

Analyses the customer logs for bigdata components like HDFS, Hive, HBase, Yarn, MapReduce, Storm, Spark, Spark 2, Knox, Ambari Metrics, Nifi, Accumulo, Kafka, Flume, Oozie, Falcon, Atlas & Zookeeper.

Language: Shell - Size: 1.59 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 1

roboxue/YarnVision

UI for Hadoop Resource Manager

Language: Vue - Size: 54.7 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 2

vitobellini/bigdata-cluster

BigData Cluster with Docker

Language: Shell - Size: 63.5 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 4 - Forks: 0

mitre/webhdfs

Interface with WebHDFS Service in a Cluster-Neutral Way

Language: R - Size: 124 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

sloopstash/kickstart-hadoop

The ultimate aim of this Hadoop starter-kit Git repository is to help you deploy and manage Hadoop ecosystem components on AWS cloud using Docker, Kubernetes, and Chef.

Language: Ruby - Size: 150 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 7

conema/spark-terraform

This project create an Hadoop and Spark cluster on Amazon AWS with Terraform

Language: Shell - Size: 30.3 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 4

vietanh85/hadoop-docker

Apache Hadoop Cluster Docker images

Language: Shell - Size: 103 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

anchal-agrawal/hadoop-ubuntu

Everything to deploy a Hadoop cluster on Ubuntu

Language: Shell - Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: about 9 years ago - Stars: 3 - Forks: 0

jazzwang/haduzilla

Automated Installation CD for Hadoop Cluster

Language: Shell - Size: 336 KB - Last synced at: about 2 years ago - Pushed at: over 11 years ago - Stars: 3 - Forks: 1

elaaatif/JPEG-and-JPEG2000-compression-on-Multi-node-cluster-using-hadoop-and-spark

Big Data technologies can be leveraged for efficient, distributed image compression using JPEG2000 (Spark) and JPEG (MapReduce).

Size: 14.3 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

wecharyu/hadoop-docker

Docker images for apache hadoop cluster

Language: Dockerfile - Size: 190 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

ictu-research/docker

Apache Hadoop + Apache Solr

Language: Shell - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 1

mikeroyal/Apache-Hadoop-Guide

Apache Hadoop Guide

Size: 141 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 2

Crone1/Pig-and-Hive-MovieLens-Analysis

This repository contains analysis work I did on the MovieLens dataset using the big data tools Pig and Hive alongside the Hadoop infrastructure

Language: PigLatin - Size: 22.5 KB - Last synced at: 9 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

sdabhi23/hadoop-k8s 📦

Image to run hadoop pseudo distributed cluster

Language: Shell - Size: 40 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

alex-ber/docker-hive Fork of ops-guru/docker-hive

EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5

Language: Shell - Size: 45.9 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

pkeropen/hadoop-docker-lite

Hadoop cluster 精简版(hadoop & kafka & storm & hbase & phoenix & pig & zookeeper & flume)

Language: Dockerfile - Size: 74.8 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 2

codito/hadoop-expt

Experiments with Hadoop cluster setups in Docker

Size: 1.95 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 3

huangyueranbbc/hadoop05_pagerank

pagerank hadoop

Language: Java - Size: 39.5 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

jicahoo/chaten

Flume Hive ElasticSearch

Language: Shell - Size: 115 KB - Last synced at: 19 days ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

Rohit9314/my-hadoop

Setup hadoop cluster manually and automatically

Language: Python - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

BigDataRepublic/hdp-installation-notes

Knowledge base for Hortonworks Ambari Installations

Size: 5.92 MB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 1

gathecageorge/hadoop

Contains docker files to build a hadoop container image

Language: Shell - Size: 51.8 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

aimanamri/raspberry-pi4-hadoop-spark-cluster

This is a self-documentation of learning distributed data storage, parallel processing, and Linux OS using Apache Hadoop, Apache Spark and Raspbian OS. In this project, 3-node cluster will be setup using Raspberry Pi 4, install HDFS and run Spark processing jobs via YARN.

Language: Shell - Size: 5.21 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Ohtar10/cloud-labs

This is a set of docker/ansible/terraform/aws recipies for different kind of deployments, e.g., Jupyter, Zeppelin, EMR, Big Data Infrastructure, etc.

Language: Jupyter Notebook - Size: 14.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

seunggihong/hadoop-install-guide

📕 Guide to installing a Hadoop and Spark on an Oracle virtual machine.

Language: Shell - Size: 20.5 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

jaynamm/docker-hadoop-cluster

Hadoop Cluster For Docker (-ing)

Language: Dockerfile - Size: 8.79 KB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

akshayavb99/Ansible-Examples

The repository contains all the Playbooks and other files used to work with different applications for Ansible

Language: Python - Size: 309 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 3

aquib-sh/setup-hadoop

A BASH script to setup Apache Hadoop and Apache Hive with Derby database on Debian GNU/Linux

Language: Shell - Size: 37.1 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

AndreiFAD/raspberry_pi_cluster

I built my own mini analysis lab to practice and learn something new. An environment that I can use remotely with any device (with RealVNC) and of course an install script to rebuild it quickly anytime. I hope, it will be useful for others, who want to make their own.

Language: Shell - Size: 132 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

GGolfz/hadoop-script

Shell script for setup hadoop 3.3.1 on Ubuntu

Language: Shell - Size: 14.6 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

leibniz21c/one-click-hadoop Fork of ddps-lab/ubuntu-hadoop

One click hadoop test environment builder.

Language: Dockerfile - Size: 83 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

akaliutau/hadoop-cluster

Batch data processing on the dockerized Hadoop cluster

Language: Shell - Size: 93.8 KB - Last synced at: 4 days ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

sriw-world/ansible_ws

Ansible workspace

Language: HTML - Size: 42 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

fixcer/bigdata

bigdata

Language: Jupyter Notebook - Size: 178 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

humanbeeng/hadoop-auto-install

A small helper script that can save your valuable time during installation of Apache Hadoop.

Language: Shell - Size: 13.7 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Jomat18/Search-Engine-Hadoop-MapReduce

Motor de búsqueda usando Inverted Index, Pagerank, Hadoop Streaming MapReduce para Python, Backend con Flask y Frontend con Javascript, Jquery, Css y Html desplegado en un cluster en Google Cloud

Language: Python - Size: 80.7 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 2

gitosamakhan/Multinode-Ambari-Cluster

Deploying a multinode ambari cluster on Linux (CentOS7) (Documentation)

Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

anshul1004/MutualFriends

Implementation of Hadoop and Spark

Language: Java - Size: 23 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

Vaibhav-Mehta-19/ansible-python-hadoop-automated

automating the creation of a hadoop cluster using an ansible playbook, python-cgi and an html web page.

Language: Python - Size: 3.91 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

HemersonRafael/ect Fork of heltonmaia/ECT

This repository contains general information about projects under development. In School of Science and Technology - ECT at Federal university of Rio Grande do Norte - UFRN

Language: C++ - Size: 60.9 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Vaibhav-Mehta-19/hadoop-setup

This file involves the steps to set up a hadoop cluster. This has been verified and tried in Redhat Linux version 7.

Language: Python - Size: 542 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

AsmaZgo/distribution_and_scripts

A repository for some scripts that can help in creating a distributed Big data ecosystem using the platform Grid5000.

Language: Shell - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

younthu/docker-hadoop Fork of big-data-europe/docker-hadoop

Apache Hadoop docker image cluster

Language: Shell - Size: 759 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

elek/flokkr-runtime-nomad 📦

Examples to run Hadoop/Spark cluster with Hashicorp nomad and consul.

Language: Shell - Size: 38.1 KB - Last synced at: 9 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

manoharpalanisamy/Distributed-Keras

Research And Development on Distributed Keras with Spark

Language: Jupyter Notebook - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

chrisPiemonte/docker-hadoop-cluster

Hadoop cluster with docker-compose

Language: Shell - Size: 1.66 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

chih7/hadoop_install

hadoop cluster installer

Language: Shell - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

yjham2002/Hadoop_Clustering

:book: Apache Hadoop Based Clustering Tutorial

Language: Java - Size: 77.1 KB - Last synced at: 3 months ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

Will436851/Raspberry-pi-OS-Documentation

Raspberry pi OS installation and system call demonstration

Language: Shell - Size: 1.32 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

MuhamedHekal/Hadoop-HA-Cluster-on-Docker

Hadoop3-HA-Docker is a production-ready, fault-tolerant Hadoop cluster deployed with Docker Compose. It automates the setup of a fully distributed Hadoop ecosystem with high availability (HA) features, designed for reliability, scalability, and real-world big data workloads

Language: Dockerfile - Size: 273 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

xpcosmos/data-lake-prime

This project aims to simulate and configure a Distributed File System using Hadoop HDFS. For this project, 3 machines were created: 1 Master Node and 2 Worker Nodes.

Language: Shell - Size: 815 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

whykay-01/movie-recommender-system

Final project for the big data class at NYU where I developed a movie recommendation system using MovieLens database and compared its performance against the popularity based models and other vanilla metrics

Language: Jupyter Notebook - Size: 4.81 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Mariam-iftikhar/BigDataProjects

The repository showcases a series of exercises and projects focused on big data processing using Hadoop, HBase, Hive, and Spark with Python. Hosted on AWS EMR, these projects demonstrate efficient data handling and processing techniques, leveraging the power of cloud computing to tackle complex data challenges.

Size: 10.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

keyhong/journal-ha-hadoop

Hadoop Journal HA Cluster Docker-compose

Language: Dockerfile - Size: 2.24 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ais04134/hyperv-hadoop-spark-cluster

Hadoop Ecosystem - 대규모 빈발 패턴 마이닝을 위한 하둡 클러스터 환경 구축

Language: Shell - Size: 2.35 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mchien15/nn-Docker-Hadoop-cluster

Docker Hadoop cluster with ecosystem

Language: Shell - Size: 105 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SalmaHisham/KafKa-Hadoop-Spark-cluster-for-Analytics Fork of nourhansowar/E-commerce-Customer-Behavior-Analysis

This project offers a dual approach to understanding e-commerce customer behavior through: Batch data analysis and Real-time data processing.

Language: HTML - Size: 10.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Heisenberghj7/Hands-on-Hadoop

🐘 In this repository, Explore the core concepts of Hadoop, including HDFS and MapReduce, to help you grasp the fundamentals of big data processing and storage. Whether you're a beginner or an experienced data engineer, this resource is designed to enhance your understanding of Hadoop.

Size: 51.8 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

RonnJacob/PageRank-MapReduce-Spark

Implemented the PageRank algorithm in Hadoop MapReduce framework and Spark.

Language: Java - Size: 442 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

HeliaHashemipour/Hadoop-Spark

Third homework of CloudComputing - Fall 2022

Language: Jupyter Notebook - Size: 50.3 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

vineetsampat/Projects1

Language: TypeScript - Size: 2.77 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

rkazak07/Apache-Zeppelin-Helm-Chart

Apache Zeppelin kubernetes Installation

Language: Mustache - Size: 15.6 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

DanMolenhouse/Distributed-Systems-Project5-Hadoop-and-Spark

In this project, we used both Hadoop / MapReduce and Spark to do distributed computing. The first task was to perform a series of operations using a Mapper and Reduce java file that was implemented on a Hadoop server. The second task was to perform similar operations, but on Spark instead.

Language: Java - Size: 70.3 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mooselab/hadoop-ansible

Ansible scripts for setting up Multi-Cluster Hadoop

Language: Shell - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

kumarvna/terraform-azurerm-hdinsight

Terraform module to create managed, full-spectrum, open-source analytics service Azure HDInsight. This module creates Apache Hadoop, Apache Spark, Apache HBase, Interactive Query (Apache Hive LLAP) and Apache Kafka clusters.

Language: HCL - Size: 365 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 5

mehroosali/ABCStoresPipeline

Batch ETL data pipeline built on HDP 3.0 to process daily sales and business data to procedure power Bi reports. Automated the pipelines using Airflow.

Language: Scala - Size: 464 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

aogunwoolu/Ethereum-analysis

ETH analysis using big data for the QMUL Big Data Processing module. Intended to promote analysis of data retrieved via big data processing

Language: Jupyter Notebook - Size: 960 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

mr-ravin/Smart-Hadoop-Cluster-SMHACL 📦

This is an automated hadoop cluster building tool,which implements distributed computing for creating the cluster over the network. This is implemented in python 2.7

Language: Python - Size: 1.64 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

avojak/aws-hadoop-cluster

Infrastructure and configuration-as-code for standing up a Hadoop cluster in AWS

Language: Jinja - Size: 12.7 KB - Last synced at: 9 days ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Nikhil9546/ARTH2020_6_7

Hadoop-cluster ,Amazon Web Service ,Docker Services ,Apache Web Server and some basic Linux commands is automated using Python programming and shell scripting

Language: Python - Size: 6.84 KB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

shreyasshivakumara/Reddit-Analysis-Large-Dataset-Scientific-Application

Architected and developed a horizontally scalable data processing solution for the reddit dataset. Demonstrated the scalability (Weak Scalability and Strong Scalability) tests in suitable computational analysis.

Language: Jupyter Notebook - Size: 190 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

spineo/hadoop-app

Size: 763 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

aiden-dai/ai-cluster

Start clusters in virtualbox VMs

Size: 198 KB - Last synced at: 10 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

adonis0147/hbase-in-docker

HBase cluster running in docker

Language: Shell - Size: 21.5 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

lk5164/hadoop-cluster-setup

Language: Shell - Size: 62.5 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0