An open API service providing repository metadata for many open source software ecosystems.

Topic: "hadoop-mapreduce"

mahmoudparsian/data-algorithms-book

MapReduce, Spark, Java, and Scala for Data Algorithms Book

Language: Java - Size: 397 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 1,075 - Forks: 661

bytedance/CloudShuffleService

Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.

Language: Java - Size: 1.23 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 255 - Forks: 58

touero/ctenopharyngodon-idella

Use the MapReduce's Java interface to distributed crawle the data of Chinese universities and learn basic knowledge of hdfs.

Language: Java - Size: 3.75 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 140 - Forks: 0

groda/big_data

Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.

Language: Jupyter Notebook - Size: 51.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 78 - Forks: 27

vim89/datapipelines-essentials-python

Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations

Language: Python - Size: 1.76 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 53 - Forks: 34

seraogianluca/k-means-mapreduce 📦

K-Means algorithm implementation with Hadoop and Spark for the course of Cloud Computing of the MSc AIDE at the University of Pisa.

Language: Java - Size: 20.5 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 34 - Forks: 18

maniram-yadav/Big_DataHadoop_Projects

Big data projects implemented by Maniram yadav

Language: PigLatin - Size: 2.79 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 33 - Forks: 33

anjalysam/Hadoop

This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop

Language: Jupyter Notebook - Size: 103 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 25 - Forks: 72

caizkun/mapreduce-examples

A collection of mapreduce problems and solutions

Language: Java - Size: 91.8 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 24 - Forks: 10

jmaister/wordcount 📦

Hadoop MapReduce word counting with Java

Language: Java - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 21 - Forks: 32

benedekh/bigdata-projects

Student projects in Big Data field.

Language: Java - Size: 198 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 19 - Forks: 12

absnaik810/CloudComputing

Projects done in the Cloud Computing course.

Language: Java - Size: 2.53 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 18 - Forks: 8

MoustafaAMahmoud/BigDataInDepth

Data Engineering Course

Language: TeX - Size: 78.9 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 9

QiushiSun/Distributed-Computing-Systems

2021 Spring (Distributed Computing Systems) 分布式系统与编程

Language: Java - Size: 101 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 15 - Forks: 1

arshdeepbahga/cloud-computing-solutions-architect-book-code

Source code for the examples in the book Cloud Computing Solutions Architect: A Hands-On Approach by Arshdeep Bahga and Vijay Madisetti

Language: CSS - Size: 10.9 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 20

SAKET-SK/Semester6-SPPU-Data-Analysis-Lab

I installed Hadoop on Virtual Machine and all Assignments are performed on Ubuntu OS. Refer to this repo for completion of the Hadoop Assignments. It is recommended that you have a stable internet connection while doing these things.

Language: Rebol - Size: 3.24 MB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 13 - Forks: 6

lucas91batista/twitter-hashtag-graph

Twitter + Flume + Hadoop (HDFS, MapReduce) + Neo4j + Pyhton

Language: JavaScript - Size: 2.61 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 13 - Forks: 0

James-QiuHaoran/distributed-computing-platform-mapreduce

This repository contains a simple Hadoop-like (MapReduce) distributed computing platform implemented in Java. It is extended from a course project at UIUC awarded the best Java version implementation and it's open-sourced for reference.

Language: Java - Size: 454 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 12 - Forks: 4

Keerthivasan13/CSCI572-Information_Retrieval_And_Web_Search_Engines

Search Engine projects

Language: Java - Size: 34.5 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 17

hyeonsangjeon/dataplatform

Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.

Language: Shell - Size: 549 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 11 - Forks: 1

waltherg/distributable_docker_sql_on_hadoop

Toy Hadoop cluster combining various SQL-on-Hadoop variants

Language: Shell - Size: 88.9 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 11 - Forks: 4

pasqualesalza/elephant56

A Genetic Algorithms framework for Hadoop MapReduce.

Language: Java - Size: 123 KB - Last synced at: 25 days ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 3

suselong/bigData-30-Days

零基础大数据学习笔记

Language: Java - Size: 15.5 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 3

imsanjoykb/PySpark-Bootcamp

My Practice and project on PySpark

Language: Jupyter Notebook - Size: 4.52 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 8 - Forks: 3

Areesha-Tahir/Hadoop-MapReduce-Sentiment-Analysis-Through-Keywords

A MapReduce program to conduct sentiment analysis of a keyword from a list of comments.

Language: Java - Size: 38.1 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 8 - Forks: 0

shask9/Matrix-Multiplication-Hadoop

Hadoop MapReduce program to compute multiplication of two sparse matrices

Language: Java - Size: 96.7 KB - Last synced at: 7 months ago - Pushed at: about 7 years ago - Stars: 8 - Forks: 5

MariaDukmak/Hadopy

Easy parallel map-reduce command line tool

Language: Python - Size: 28.3 KB - Last synced at: 5 days ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 0

sjtu-sail/ops-hadoop 📦

Language: Java - Size: 4.74 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 4

navsing/In-Network-Hadoop-NDN-CacheSimulator

Simulates the data transfer to explore caching potential in network nodes running Hadoop over NDN (Named Data Networking) rather than traditional TCP/IP.

Language: Java - Size: 18.6 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 0

drexly/movie140reviewcorpus

네이버 영화 164397건 중 140자 평이 있는 영화별 평점 raw data for spark

Size: 336 MB - Last synced at: 9 months ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 5

syscrest/oozie-graphite

Monitor your oozie server and your oozie bundles with graphite

Language: Java - Size: 621 KB - Last synced at: about 1 year ago - Pushed at: about 10 years ago - Stars: 7 - Forks: 3

Mathews-Tom/MSc-in-Machine-Learning-and-Artificial-Intelligence

Master of Science in Machine Learning & Artificial Intelligence - Indian Institute Technology Madras & Liverpool John Moores University

Language: Jupyter Notebook - Size: 2.12 GB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 7

pfisterer/apache-hadoop-helm Fork of mgit-at/helm-hadoop-3

Helm chart for Apache Hadoop using multi-arch docker images

Language: Dockerfile - Size: 104 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 6

monisjaved/Data-Processing-With-Hadoop

Text Processing Using Hadoop

Language: Jupyter Notebook - Size: 21 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 2

HxnDev/Hadoop-MapReduce-to-Analyze-Sentiment-of-Keyword

In this task, we had to write a MapReduce program to analyze the sentiment of a keyword from a list of comments. This was done using Hadoop HDFS.

Language: Java - Size: 1000 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 0

joshi-aditya/Amazon-Reviews-Dataset-Analysis-MapReduce

Amazon Customer Reviews Dataset Analysis using Hadoop MapReduce, Pig. Semester end project for INFO7250 Engineering of Big Data Systems course.

Language: Java - Size: 1.66 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 2

yalchinAlv/relevance-checker

Sorts the comments on Instagram posts in relevant order

Language: Java - Size: 8.42 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 0

jazzwang/hadoop_labs

MapReduce Java Code Examples to learn Hadoop

Language: Java - Size: 79.1 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 6 - Forks: 1

aadishgoel/Hadoop-Codes

Neat and Handy Place for all Hadoop codes

Language: Java - Size: 25.4 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 3

Coursal/Hadoop-Examples

Some simple, kinda introductory projects based on Apache Hadoop to be used as guides in order to make the MapReduce model look less weird or boring.

Language: Java - Size: 340 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 2

karamolegkos/EverAnalyzer

EverAnalyzer is my thesis in the Department of Digital Systems of the University of Piraeus. EverAnalyzer is a platform for collecting, preprocessing, processing and analyzing Big Data from the Twitter platform.

Language: HTML - Size: 761 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

LMAPcoder/Hadoop-on-Colab

Installation and configuration of Hadoop on Google Colaboratory

Language: Jupyter Notebook - Size: 620 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 5

HxnDev/Finding-Average-Temperature-of-Each-Year-using-Hadoop-HDFS

In this task, we had to calculate the average temperature for each year from the given dataset using Hadoop HDFS. We had to create a MapReduce function to perform this task.

Language: Java - Size: 451 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 0

kowaalczyk/spark-minimal-algorithms

An python implementation of Minimal Mapreduce Algorithms for Apache Spark

Language: Python - Size: 52.7 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 0

giovannigarifo/bigdata

Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark

Language: Java - Size: 69.1 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 5 - Forks: 2

dboston1/Reddit-Sentiment-Analysis

Program that performs textual analysis of Reddit data (approx. 300 GB) preprocessed by another team member. Uses Hadoop's Mapreduce to classify comments as either positive or negative based on certain keywords, negation, etc.

Language: Java - Size: 2.34 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 5 - Forks: 0

HawxChen/CloudComputing

MapReduce, Spark, Hadoop, PostgreSQL, Cluster Management

Language: Python - Size: 54.7 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 0

sihamhafsi/projet-big-data_analyse-des-donnees-youtube

Language: Java - Size: 5.21 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

BGI-flexlab/SOAPgaea

Language: Java - Size: 69.6 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 9

HxnDev/Hadoop-MapReduce-to-Find-Average-Length-of-Comments

In this task, we had to find the average length of comments given in the dataset. It was done using Hadoop MapReduce and Hadoop HDFS.

Language: Java - Size: 675 KB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 1

tugrulhkarabulut/hadoop-movie-rating-prediction

Movie rating prediction application

Language: CSS - Size: 3.46 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

sharma-n/global_event_analytics

Big data analytics using Hadoop on GDELT global news dataset.

Language: Java - Size: 2.66 MB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 1

divyam-goel/ML-using-MapReduce-and-Spark

Naive Implementation of Machine Learning Algorithms in distributed frameworks MapReduce and Spark

Language: Scilab - Size: 589 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

sayantansatpati/ml

Machine Learning

Language: Jupyter Notebook - Size: 108 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

TonyApuzzo/fuzzyjoin

Fork of http://asterixdb.ics.uci.edu/fuzzyjoin/ Efficient Parallel Set-Similarity Joins Using MapReduce. Rares Vernica, Michael J. Carey, Chen Li SIGMOD 2010

Language: Java - Size: 2.88 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 3

lzmhhh123/Wikipedia-Index

A MapReduce execution of Wikipedia Index. Project of Fudan University Distributed System Course.

Language: Java - Size: 658 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

ruxuebu/Java-based-Movie-Recommender

A Movie Recommendation System implemented in Java base on Item-Item collaborative filtering algorithms

Language: Java - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 4 - Forks: 2

juagarmar/Cov-Cor-matrix-via-Rhadoop

Covariance and correlation matrix via Rhadoop (rmr2 and HDFS)

Language: R - Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 4 - Forks: 0

Ruggero1912/mapreduce-bloom-filters

This project investigates how to build Bloom Filters using the MapReduce approach in Hadoop and Spark. Different implementations and further anlysis on performances are reported

Language: Jupyter Notebook - Size: 1.74 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

mhamadelitawi/Handoop

Hadoop Map-Reduce implementations of many scientific computations

Language: Java - Size: 2.46 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

thedatasociety/lab-hadoop

Language: PLpgSQL - Size: 4.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 7

HiGal/GUSE

Search Engine based on Hadoop MapReduce

Language: Java - Size: 20.2 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 2

MarwanMashra/Hadoop-MapReduce

Map/Reduce project with Hadoop

Language: Python - Size: 1.11 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

nikopetr/Hadoop-MapReduce-Calculating-Sales-by-Country

Java program that uses Hadoop Map-Reduce for calculating the number of products and sales by country

Language: Java - Size: 81.1 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

SarahAyaz/YouTube_Data_Analysis

Analysis of YouTube Data using Hadoop Mapreduce framework in Java.

Language: Java - Size: 24.5 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

sloopstash/kickstart-hadoop

The ultimate aim of this Hadoop starter-kit Git repository is to help you deploy and manage Hadoop ecosystem components on AWS cloud using Docker, Kubernetes, and Chef.

Language: Ruby - Size: 150 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 7

Areesha-Tahir/Hadoop-MapReduce-To-Find-Average-Length-Of-Comments

A MapReduce program to calculate the average length of comments.

Language: Java - Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

benjdiasaad/MapReduce_K-means

Implémentation de l'algorithme de clustering k-means en utilisant le framework Hadoop version 3.1.3 (MapReduce).

Language: Java - Size: 32.2 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 2

sreetamparida/Hiraishin

A REST-based service that translates the SQL query into MapReduce and Spark jobs. It runs these jobs and provides the JSON object. SQL to MapReduce and Spark translator.

Language: Python - Size: 194 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

jomarsilio/Bootcamp-IGTI-Analista-de-Dados

Bootcamp ministrado pela IGTI com o objetivo de abordar de forma intensiva conceitos e práticas da análise de dados, habilitando o aluno para atuar profissionalmente na área.

Language: Jupyter Notebook - Size: 127 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

mac40/BDC

Big Data Computing

Language: Python - Size: 13.3 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

yennanliu/spark_emr_dev

Collection of code for submitting Spark/Hadoop/Hive/Pig tasks to EMR (AWS Elastic MapReduce) | #DE

Language: Scala - Size: 3.72 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

gurpreetsahni/WordCount

WordCount Project on Hadoop . It Works on MapReduce .In this we first map the data of the file and provide them key number and than reduce will count the words and we will get the output file .

Language: Java - Size: 7.63 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

abhibalani/emr_lambda

Lambda to start EMR and run a map reduce job

Language: Python - Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 1

rishabmenon/YouTube-Data-Analysis-Hadoop

This Hadoop project involves analysing the YouTube dataset to solve a few problem statements.

Size: 1.75 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 5

krishnadey30/NewsHeadlines

This repository have codes that extracts meaningful information from News headline data-set.

Language: Python - Size: 85.9 KB - Last synced at: 3 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 2

lazy-apple/BigData_Long

爬虫+大数据项目

Language: Java - Size: 183 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

choosewhatulike/Chinese-Ngram-LM-Hadoop

A distributed chinese n-gram language model implementation for train and test on large corpus , using Hadoop MapReduce.

Language: Java - Size: 15.6 KB - Last synced at: 30 days ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

ajerit/parallel-bfs

Parallel implementation of Breadth-First Search algorith in Java MapReduce and PySpark. This implementation finds degrees of separation between Twitter Users

Language: Python - Size: 16.4 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

Shubham-vish/hadoop-B-Tree

Language: Java - Size: 42 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

PatwinchIR/RandomForest-On-MapReduce

A MapReduce Version of Random Forest.

Language: Java - Size: 60.9 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 2

kashifmin/mapreduce-kotlin

An example MapReduce project written in Kotlin using IntelliJ IDE.

Language: Kotlin - Size: 6.84 KB - Last synced at: 25 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

samridhishree/Machine-Learning-for-Large-Datasets

Machine Learning models for large datasets

Language: Gnuplot - Size: 26.6 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

prabhuvashwin/PageRank-Algorithm-Implementation 📦

Implementation of Google's PageRank algorithm using Java, Hadoop, and MapReduce

Language: Java - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

highoncarbs/hadoopwithpy

:elephant: :heavy_plus_sign: :snake: Learning Hadoop with Python

Language: Python - Size: 86.6 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

MandarGogate/Association-Rule-Mining-Hadoop-Python

A case study on mining association rules between different factors related to deaths of people in the United States

Language: Python - Size: 146 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 2

gosaliajigar/SmartSort

CSC-550 Big Data : Smart Sort (Secondary Sort)

Language: Java - Size: 12.3 MB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 3

juagarmar/linear-regression-via-Rhadoop

Linear regression via Rhadoop (rmr2 and RHDFS)

Language: R - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 0

prabaprakash/Hadoop-2.3

Hadoop 2.3 for Windows x64

Language: CSS - Size: 61.5 MB - Last synced at: 3 months ago - Pushed at: almost 11 years ago - Stars: 3 - Forks: 6

elaaatif/JPEG-and-JPEG2000-compression-on-Multi-node-cluster-using-hadoop-and-spark

Big Data technologies can be leveraged for efficient, distributed image compression using JPEG2000 (Spark) and JPEG (MapReduce).

Size: 14.3 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

nikisetti01/Hadoop-MapReduce-LetterFrequency-Analysis

Simple example of Hadoop Application count letter, with an intersting Romance Language Analysis

Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 2

burhanahmed1/Big-Data-Analytics

Practice tasks in Python programming language using Hadoop, MRJob, PySpark for Big Data Analytics.

Language: Jupyter Notebook - Size: 40 KB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0

Christianivh/data_repo

Repositorio de datos

Language: Jupyter Notebook - Size: 1.69 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 8

erkanvatan/mapreduce-on-stackoverflow-dataset

Docker Hadoop Cluster MapReduce Example on Stackoverflow Dataset

Language: Java - Size: 522 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

yoyozaemon/BD-Assignment-UE20CS322

A repository containing the source codes for the Big Data Course Assignment and Project (UE20CS322) at PES University.

Language: Python - Size: 5.66 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

kkoless/MapReduce

Hadoop MapReduce Python

Language: Python - Size: 1.05 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

CycloneBoy/sparklearn

learn bigdata hadoop spark

Language: PLpgSQL - Size: 8.44 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 3

jprakashkce/Olympic_Participants-Analysis

Analysis of Olympic Participants dataset using Hadoop Map Reduce.

Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

UdeshikaDissa/BigData-MapReduce

This BigData study intends to identify the most revenue-generating Taxi zones in New York City for the year 2019. Three MapReduce algorithms were developed and their performance was analyzed on different size of input datasets and different size clusters in EMR.

Language: Java - Size: 1.32 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

TejSankhe/NYC-citi-bike-data-analysis

Language: Jupyter Notebook - Size: 16.6 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0