Topic: "hadoop-mapreduce"
mahmoudparsian/data-algorithms-book
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Language: Java - Size: 397 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 1,075 - Forks: 661

bytedance/CloudShuffleService
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
Language: Java - Size: 1.23 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 255 - Forks: 58

touero/ctenopharyngodon-idella
Use the MapReduce's Java interface to distributed crawle the data of Chinese universities and learn basic knowledge of hdfs.
Language: Java - Size: 3.75 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 140 - Forks: 0

groda/big_data
Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.
Language: Jupyter Notebook - Size: 51.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 78 - Forks: 27

vim89/datapipelines-essentials-python
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Language: Python - Size: 1.76 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 53 - Forks: 34

seraogianluca/k-means-mapreduce 📦
K-Means algorithm implementation with Hadoop and Spark for the course of Cloud Computing of the MSc AIDE at the University of Pisa.
Language: Java - Size: 20.5 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 34 - Forks: 18

maniram-yadav/Big_DataHadoop_Projects
Big data projects implemented by Maniram yadav
Language: PigLatin - Size: 2.79 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 33 - Forks: 33

anjalysam/Hadoop
This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop
Language: Jupyter Notebook - Size: 103 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 25 - Forks: 72

caizkun/mapreduce-examples
A collection of mapreduce problems and solutions
Language: Java - Size: 91.8 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 24 - Forks: 10

jmaister/wordcount 📦
Hadoop MapReduce word counting with Java
Language: Java - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 21 - Forks: 32

benedekh/bigdata-projects
Student projects in Big Data field.
Language: Java - Size: 198 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 19 - Forks: 12

absnaik810/CloudComputing
Projects done in the Cloud Computing course.
Language: Java - Size: 2.53 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 18 - Forks: 8

MoustafaAMahmoud/BigDataInDepth
Data Engineering Course
Language: TeX - Size: 78.9 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 9

QiushiSun/Distributed-Computing-Systems
2021 Spring (Distributed Computing Systems) 分布式系统与编程
Language: Java - Size: 101 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 15 - Forks: 1

arshdeepbahga/cloud-computing-solutions-architect-book-code
Source code for the examples in the book Cloud Computing Solutions Architect: A Hands-On Approach by Arshdeep Bahga and Vijay Madisetti
Language: CSS - Size: 10.9 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 20

SAKET-SK/Semester6-SPPU-Data-Analysis-Lab
I installed Hadoop on Virtual Machine and all Assignments are performed on Ubuntu OS. Refer to this repo for completion of the Hadoop Assignments. It is recommended that you have a stable internet connection while doing these things.
Language: Rebol - Size: 3.24 MB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 13 - Forks: 6

lucas91batista/twitter-hashtag-graph
Twitter + Flume + Hadoop (HDFS, MapReduce) + Neo4j + Pyhton
Language: JavaScript - Size: 2.61 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 13 - Forks: 0

James-QiuHaoran/distributed-computing-platform-mapreduce
This repository contains a simple Hadoop-like (MapReduce) distributed computing platform implemented in Java. It is extended from a course project at UIUC awarded the best Java version implementation and it's open-sourced for reference.
Language: Java - Size: 454 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 12 - Forks: 4

Keerthivasan13/CSCI572-Information_Retrieval_And_Web_Search_Engines
Search Engine projects
Language: Java - Size: 34.5 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 17

hyeonsangjeon/dataplatform
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
Language: Shell - Size: 549 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 11 - Forks: 1

waltherg/distributable_docker_sql_on_hadoop
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Language: Shell - Size: 88.9 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 11 - Forks: 4

pasqualesalza/elephant56
A Genetic Algorithms framework for Hadoop MapReduce.
Language: Java - Size: 123 KB - Last synced at: 25 days ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 3

suselong/bigData-30-Days
零基础大数据学习笔记
Language: Java - Size: 15.5 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 3

imsanjoykb/PySpark-Bootcamp
My Practice and project on PySpark
Language: Jupyter Notebook - Size: 4.52 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 8 - Forks: 3

Areesha-Tahir/Hadoop-MapReduce-Sentiment-Analysis-Through-Keywords
A MapReduce program to conduct sentiment analysis of a keyword from a list of comments.
Language: Java - Size: 38.1 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 8 - Forks: 0

shask9/Matrix-Multiplication-Hadoop
Hadoop MapReduce program to compute multiplication of two sparse matrices
Language: Java - Size: 96.7 KB - Last synced at: 7 months ago - Pushed at: about 7 years ago - Stars: 8 - Forks: 5

MariaDukmak/Hadopy
Easy parallel map-reduce command line tool
Language: Python - Size: 28.3 KB - Last synced at: 5 days ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 0

sjtu-sail/ops-hadoop 📦
Language: Java - Size: 4.74 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 4

navsing/In-Network-Hadoop-NDN-CacheSimulator
Simulates the data transfer to explore caching potential in network nodes running Hadoop over NDN (Named Data Networking) rather than traditional TCP/IP.
Language: Java - Size: 18.6 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 0

drexly/movie140reviewcorpus
네이버 영화 164397건 중 140자 평이 있는 영화별 평점 raw data for spark
Size: 336 MB - Last synced at: 9 months ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 5

syscrest/oozie-graphite
Monitor your oozie server and your oozie bundles with graphite
Language: Java - Size: 621 KB - Last synced at: about 1 year ago - Pushed at: about 10 years ago - Stars: 7 - Forks: 3

Mathews-Tom/MSc-in-Machine-Learning-and-Artificial-Intelligence
Master of Science in Machine Learning & Artificial Intelligence - Indian Institute Technology Madras & Liverpool John Moores University
Language: Jupyter Notebook - Size: 2.12 GB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 7

pfisterer/apache-hadoop-helm Fork of mgit-at/helm-hadoop-3
Helm chart for Apache Hadoop using multi-arch docker images
Language: Dockerfile - Size: 104 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 6

monisjaved/Data-Processing-With-Hadoop
Text Processing Using Hadoop
Language: Jupyter Notebook - Size: 21 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 2

HxnDev/Hadoop-MapReduce-to-Analyze-Sentiment-of-Keyword
In this task, we had to write a MapReduce program to analyze the sentiment of a keyword from a list of comments. This was done using Hadoop HDFS.
Language: Java - Size: 1000 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 0

joshi-aditya/Amazon-Reviews-Dataset-Analysis-MapReduce
Amazon Customer Reviews Dataset Analysis using Hadoop MapReduce, Pig. Semester end project for INFO7250 Engineering of Big Data Systems course.
Language: Java - Size: 1.66 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 2

yalchinAlv/relevance-checker
Sorts the comments on Instagram posts in relevant order
Language: Java - Size: 8.42 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 0

jazzwang/hadoop_labs
MapReduce Java Code Examples to learn Hadoop
Language: Java - Size: 79.1 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 6 - Forks: 1

aadishgoel/Hadoop-Codes
Neat and Handy Place for all Hadoop codes
Language: Java - Size: 25.4 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 3

Coursal/Hadoop-Examples
Some simple, kinda introductory projects based on Apache Hadoop to be used as guides in order to make the MapReduce model look less weird or boring.
Language: Java - Size: 340 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 2

karamolegkos/EverAnalyzer
EverAnalyzer is my thesis in the Department of Digital Systems of the University of Piraeus. EverAnalyzer is a platform for collecting, preprocessing, processing and analyzing Big Data from the Twitter platform.
Language: HTML - Size: 761 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

LMAPcoder/Hadoop-on-Colab
Installation and configuration of Hadoop on Google Colaboratory
Language: Jupyter Notebook - Size: 620 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 5

HxnDev/Finding-Average-Temperature-of-Each-Year-using-Hadoop-HDFS
In this task, we had to calculate the average temperature for each year from the given dataset using Hadoop HDFS. We had to create a MapReduce function to perform this task.
Language: Java - Size: 451 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 0

kowaalczyk/spark-minimal-algorithms
An python implementation of Minimal Mapreduce Algorithms for Apache Spark
Language: Python - Size: 52.7 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 0

giovannigarifo/bigdata
Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark
Language: Java - Size: 69.1 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 5 - Forks: 2

dboston1/Reddit-Sentiment-Analysis
Program that performs textual analysis of Reddit data (approx. 300 GB) preprocessed by another team member. Uses Hadoop's Mapreduce to classify comments as either positive or negative based on certain keywords, negation, etc.
Language: Java - Size: 2.34 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 5 - Forks: 0

HawxChen/CloudComputing
MapReduce, Spark, Hadoop, PostgreSQL, Cluster Management
Language: Python - Size: 54.7 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 0

sihamhafsi/projet-big-data_analyse-des-donnees-youtube
Language: Java - Size: 5.21 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

BGI-flexlab/SOAPgaea
Language: Java - Size: 69.6 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 9

HxnDev/Hadoop-MapReduce-to-Find-Average-Length-of-Comments
In this task, we had to find the average length of comments given in the dataset. It was done using Hadoop MapReduce and Hadoop HDFS.
Language: Java - Size: 675 KB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 1

tugrulhkarabulut/hadoop-movie-rating-prediction
Movie rating prediction application
Language: CSS - Size: 3.46 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

sharma-n/global_event_analytics
Big data analytics using Hadoop on GDELT global news dataset.
Language: Java - Size: 2.66 MB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 1

divyam-goel/ML-using-MapReduce-and-Spark
Naive Implementation of Machine Learning Algorithms in distributed frameworks MapReduce and Spark
Language: Scilab - Size: 589 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

sayantansatpati/ml
Machine Learning
Language: Jupyter Notebook - Size: 108 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

TonyApuzzo/fuzzyjoin
Fork of http://asterixdb.ics.uci.edu/fuzzyjoin/ Efficient Parallel Set-Similarity Joins Using MapReduce. Rares Vernica, Michael J. Carey, Chen Li SIGMOD 2010
Language: Java - Size: 2.88 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 3

lzmhhh123/Wikipedia-Index
A MapReduce execution of Wikipedia Index. Project of Fudan University Distributed System Course.
Language: Java - Size: 658 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

ruxuebu/Java-based-Movie-Recommender
A Movie Recommendation System implemented in Java base on Item-Item collaborative filtering algorithms
Language: Java - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 4 - Forks: 2

juagarmar/Cov-Cor-matrix-via-Rhadoop
Covariance and correlation matrix via Rhadoop (rmr2 and HDFS)
Language: R - Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 4 - Forks: 0

Ruggero1912/mapreduce-bloom-filters
This project investigates how to build Bloom Filters using the MapReduce approach in Hadoop and Spark. Different implementations and further anlysis on performances are reported
Language: Jupyter Notebook - Size: 1.74 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

mhamadelitawi/Handoop
Hadoop Map-Reduce implementations of many scientific computations
Language: Java - Size: 2.46 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

thedatasociety/lab-hadoop
Language: PLpgSQL - Size: 4.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 7

HiGal/GUSE
Search Engine based on Hadoop MapReduce
Language: Java - Size: 20.2 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 2

MarwanMashra/Hadoop-MapReduce
Map/Reduce project with Hadoop
Language: Python - Size: 1.11 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

nikopetr/Hadoop-MapReduce-Calculating-Sales-by-Country
Java program that uses Hadoop Map-Reduce for calculating the number of products and sales by country
Language: Java - Size: 81.1 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

SarahAyaz/YouTube_Data_Analysis
Analysis of YouTube Data using Hadoop Mapreduce framework in Java.
Language: Java - Size: 24.5 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

sloopstash/kickstart-hadoop
The ultimate aim of this Hadoop starter-kit Git repository is to help you deploy and manage Hadoop ecosystem components on AWS cloud using Docker, Kubernetes, and Chef.
Language: Ruby - Size: 150 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 7

Areesha-Tahir/Hadoop-MapReduce-To-Find-Average-Length-Of-Comments
A MapReduce program to calculate the average length of comments.
Language: Java - Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

benjdiasaad/MapReduce_K-means
Implémentation de l'algorithme de clustering k-means en utilisant le framework Hadoop version 3.1.3 (MapReduce).
Language: Java - Size: 32.2 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 2

sreetamparida/Hiraishin
A REST-based service that translates the SQL query into MapReduce and Spark jobs. It runs these jobs and provides the JSON object. SQL to MapReduce and Spark translator.
Language: Python - Size: 194 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

jomarsilio/Bootcamp-IGTI-Analista-de-Dados
Bootcamp ministrado pela IGTI com o objetivo de abordar de forma intensiva conceitos e práticas da análise de dados, habilitando o aluno para atuar profissionalmente na área.
Language: Jupyter Notebook - Size: 127 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

mac40/BDC
Big Data Computing
Language: Python - Size: 13.3 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

yennanliu/spark_emr_dev
Collection of code for submitting Spark/Hadoop/Hive/Pig tasks to EMR (AWS Elastic MapReduce) | #DE
Language: Scala - Size: 3.72 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

gurpreetsahni/WordCount
WordCount Project on Hadoop . It Works on MapReduce .In this we first map the data of the file and provide them key number and than reduce will count the words and we will get the output file .
Language: Java - Size: 7.63 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

abhibalani/emr_lambda
Lambda to start EMR and run a map reduce job
Language: Python - Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 1

rishabmenon/YouTube-Data-Analysis-Hadoop
This Hadoop project involves analysing the YouTube dataset to solve a few problem statements.
Size: 1.75 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 5

krishnadey30/NewsHeadlines
This repository have codes that extracts meaningful information from News headline data-set.
Language: Python - Size: 85.9 KB - Last synced at: 3 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 2

lazy-apple/BigData_Long
爬虫+大数据项目
Language: Java - Size: 183 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

choosewhatulike/Chinese-Ngram-LM-Hadoop
A distributed chinese n-gram language model implementation for train and test on large corpus , using Hadoop MapReduce.
Language: Java - Size: 15.6 KB - Last synced at: 30 days ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

ajerit/parallel-bfs
Parallel implementation of Breadth-First Search algorith in Java MapReduce and PySpark. This implementation finds degrees of separation between Twitter Users
Language: Python - Size: 16.4 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

Shubham-vish/hadoop-B-Tree
Language: Java - Size: 42 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

PatwinchIR/RandomForest-On-MapReduce
A MapReduce Version of Random Forest.
Language: Java - Size: 60.9 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 2

kashifmin/mapreduce-kotlin
An example MapReduce project written in Kotlin using IntelliJ IDE.
Language: Kotlin - Size: 6.84 KB - Last synced at: 25 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

samridhishree/Machine-Learning-for-Large-Datasets
Machine Learning models for large datasets
Language: Gnuplot - Size: 26.6 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

prabhuvashwin/PageRank-Algorithm-Implementation 📦
Implementation of Google's PageRank algorithm using Java, Hadoop, and MapReduce
Language: Java - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

highoncarbs/hadoopwithpy
:elephant: :heavy_plus_sign: :snake: Learning Hadoop with Python
Language: Python - Size: 86.6 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

MandarGogate/Association-Rule-Mining-Hadoop-Python
A case study on mining association rules between different factors related to deaths of people in the United States
Language: Python - Size: 146 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 2

gosaliajigar/SmartSort
CSC-550 Big Data : Smart Sort (Secondary Sort)
Language: Java - Size: 12.3 MB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 3

juagarmar/linear-regression-via-Rhadoop
Linear regression via Rhadoop (rmr2 and RHDFS)
Language: R - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 0

prabaprakash/Hadoop-2.3
Hadoop 2.3 for Windows x64
Language: CSS - Size: 61.5 MB - Last synced at: 3 months ago - Pushed at: almost 11 years ago - Stars: 3 - Forks: 6

elaaatif/JPEG-and-JPEG2000-compression-on-Multi-node-cluster-using-hadoop-and-spark
Big Data technologies can be leveraged for efficient, distributed image compression using JPEG2000 (Spark) and JPEG (MapReduce).
Size: 14.3 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

nikisetti01/Hadoop-MapReduce-LetterFrequency-Analysis
Simple example of Hadoop Application count letter, with an intersting Romance Language Analysis
Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 2

burhanahmed1/Big-Data-Analytics
Practice tasks in Python programming language using Hadoop, MRJob, PySpark for Big Data Analytics.
Language: Jupyter Notebook - Size: 40 KB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0

Christianivh/data_repo
Repositorio de datos
Language: Jupyter Notebook - Size: 1.69 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 8

erkanvatan/mapreduce-on-stackoverflow-dataset
Docker Hadoop Cluster MapReduce Example on Stackoverflow Dataset
Language: Java - Size: 522 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

yoyozaemon/BD-Assignment-UE20CS322
A repository containing the source codes for the Big Data Course Assignment and Project (UE20CS322) at PES University.
Language: Python - Size: 5.66 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

kkoless/MapReduce
Hadoop MapReduce Python
Language: Python - Size: 1.05 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

CycloneBoy/sparklearn
learn bigdata hadoop spark
Language: PLpgSQL - Size: 8.44 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 3

jprakashkce/Olympic_Participants-Analysis
Analysis of Olympic Participants dataset using Hadoop Map Reduce.
Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

UdeshikaDissa/BigData-MapReduce
This BigData study intends to identify the most revenue-generating Taxi zones in New York City for the year 2019. Three MapReduce algorithms were developed and their performance was analyzed on different size of input datasets and different size clusters in EMR.
Language: Java - Size: 1.32 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

TejSankhe/NYC-citi-bike-data-analysis
Language: Jupyter Notebook - Size: 16.6 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0
