An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: hadoop-framework

KeerthanaJ-rec/210701118-CS19P16-DA-Lab

Data Analytics Laboratory

Language: R - Size: 23.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

SAKET-SK/Semester6-SPPU-Data-Analysis-Lab

I installed Hadoop on Virtual Machine and all Assignments are performed on Ubuntu OS. Refer to this repo for completion of the Hadoop Assignments. It is recommended that you have a stable internet connection while doing these things.

Language: Rebol - Size: 3.24 MB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 13 - Forks: 6

linkedin/dynamometer

A tool for scale and performance testing of HDFS with a specific focus on the NameNode.

Language: Java - Size: 297 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 129 - Forks: 36

Rohit9314/my-hadoop

Setup hadoop cluster manually and automatically

Language: Python - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

akshaytambe/Big-Data-Scripts

Python Scripts for working with Big Data Files

Language: Python - Size: 193 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 1

BigWheel92/PageRank-Algorithm-using-MapReduce

PageRank algorithm written in Java MapReduce framework

Language: Java - Size: 149 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

James-QiuHaoran/distributed-computing-platform-mapreduce

This repository contains a simple Hadoop-like (MapReduce) distributed computing platform implemented in Java. It is extended from a course project at UIUC awarded the best Java version implementation and it's open-sourced for reference.

Language: Java - Size: 454 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 12 - Forks: 4

Akilankm/Hadoop-Installation

The repo contains the steps for setting up the single node cluster in Hadoop 3.2.1 in Ubuntu 20.04 LTS

Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

satyajeetmaharana/floodprediction

The goal of this project is to identify the flood-prone areas with probabilities of flood in counties in a future date, using Spark MLLib.

Language: Scala - Size: 3.46 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

shienlong/parallel

WQD7008 Parallel and Distributed Computing Project

Size: 24.2 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 2

waltherg/distributable_docker_sql_on_hadoop

Toy Hadoop cluster combining various SQL-on-Hadoop variants

Language: Shell - Size: 88.9 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 11 - Forks: 4

Cigna/ibis

IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.

Language: Python - Size: 749 KB - Last synced at: 6 months ago - Pushed at: about 3 years ago - Stars: 51 - Forks: 15

spoddutur/cloud-based-sql-engine-using-spark

Cloud-based SQL engine using SPARK where data is accessible as JDBC/ODBC data source via Spark ThriftServer.

Language: Java - Size: 185 KB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 32 - Forks: 13

jayantakumar/Hadoop-In-Action-Introductory-Patent-Dataset-Analysis

A basic introductory example of hadoops mapreduce libraries to load and analyse large datasets in this case a US patent dataset sourced from https://www.nber.org/research/data/us-patents

Language: Java - Size: 28.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

SakhriHoussem/MapReduce-Python

MapReduce Python Example

Language: Python - Size: 20.5 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 4

suselong/bigData-30-Days

零基础大数据学习笔记

Language: Java - Size: 15.5 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 3

giovannigarifo/bigdata

Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark

Language: Java - Size: 69.1 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 5 - Forks: 2

alex-ber/docker-hive Fork of ops-guru/docker-hive

EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5

Language: Shell - Size: 45.9 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

JayLohokare/distributed-GIS-framework

Distributed Hadoop and Spark based framework for in-memory GIS queries

Language: C++ - Size: 24.6 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

imdeepanshugpt/Hadoop

Hadoop-Cluster

Language: Python - Size: 887 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0