An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: mapreduce-java

gilberto-009199/bigdata

Workspaces de BigData:

Language: Java - Size: 60.4 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

nielsbasjes/splittablegzip

Splittable Gzip codec for Hadoop

Language: Java - Size: 1.38 MB - Last synced at: 6 days ago - Pushed at: 22 days ago - Stars: 70 - Forks: 9

TechAlhan826/Hadoop-Tasks

Hadoop MapReduce Tasks Java - Big Data Project 🚀

Language: Java - Size: 275 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

KhalilKrugerOS/PaymentMethodCounter

INSAT exercice solution where we count how many transactions use Mastercard using MapReduce Frameword on hadoop

Language: Java - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Dare-marvel/Big-Data-Analytics--BDA--

💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍

Language: Java - Size: 174 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 2

berksudan/Analysis-on-Big-Data-with-Hadoop

Implementation of Statistical Methods via Hadoop Map-Reduce Library.

Language: Java - Size: 75.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ashishgopalhattimare/Parallel-Concurrent-and-Distributed-Programming-in-Java

Parallel, Concurrent, and Distributed Programming in Java | Coursera

Language: Java - Size: 34.5 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 2

agustin-recoba/mosquitos-hpc

Proyecto Hadoop MapReduce de algoritmos de detección de tendencias sobre series temporales, aplicados a datos de ventas de productos relacionados con control de plagas (repelentes e insecticidas).

Language: Java - Size: 1.03 MB - Last synced at: 21 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

raza8899/Recommend_Friends_through_MapReduce

Its a Map Reduce Program which tells you about People you may know on the basis of mutual friends

Language: Java - Size: 2.66 MB - Last synced at: 10 months ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

anpu9/MIT6.824-MapReduce

MapReduce Implementation - Distributed System

Language: Go - Size: 21.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Coursal/Hadoop-Examples

Some simple, kinda introductory projects based on Apache Hadoop to be used as guides in order to make the MapReduce model look less weird or boring.

Language: Java - Size: 340 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 2

amarkum/crunch-demo

crunch demo project

Language: Java - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

debajyotiguha11/BigDataAssignment_WordCount

Class assignment to understand the MapReduce Programming model in Hadoop.

Language: Java - Size: 5.9 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

emrectn/HadoopTutorial

hadoop

Language: Java - Size: 15.6 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

changfubai/hadoop-wordcount

Language: Java - Size: 3.64 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

aadishgoel/Hadoop-Codes

Neat and Handy Place for all Hadoop codes

Language: Java - Size: 25.4 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 3

fzehracetin/big-data-project

Big Data Processing and Analytics course term project.

Language: JavaScript - Size: 8.77 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

fbaldi6/PageRank-Hadoop Fork of edofazza/PageRank-Hadoop

Implementation of the MapReduce PageRank algorithm using the Hadoop framework in Java (developed for Cloud Computing course)

Size: 5.35 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

fbaldi6/PageRank-Spark Fork of edofazza/PageRank-Spark

Implementation of the MapReduce PageRank algorithm using the Spark framework both in Python and in Java (developed for Cloud Computing course)

Size: 4.99 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

10lloydj/NLP-RDF-Inverted-Index

This Map Reduce program should read in a set of RDF/XML documents and output the data in the form: {object}, [(predicate1, position, subject1)...]

Language: Java - Size: 11.7 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

SomeshChevella/Apache-Hadoop-Map-Reduce--Basic-Sentiment-Analysis-on-Yelp-Dataset

In this project we will use Hadoop MapReduce to implement a very basic “Sentiment Analysis” using the review text in the Yelp Academic Dataset as training data.

Language: Java - Size: 7.39 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

HarshitDawar55/MapReduce

Programs for MapReduce written in java with least complexity!

Language: Java - Size: 76.2 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

iulianoroberto/MapReduceBasicApplications

Basic MapReduce applications in Java.

Language: Java - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Sumonta056/Hadoop-Clustering-Docker-Guide

Hadoop-Clustering-Docker-Guide : A Complete Documentation to setting up Hadoop and try clustering.

Language: Java - Size: 51.5 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

HxnDev/Hadoop-MapReduce-to-Analyze-Sentiment-of-Keyword

In this task, we had to write a MapReduce program to analyze the sentiment of a keyword from a list of comments. This was done using Hadoop HDFS.

Language: Java - Size: 1000 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 0

leightonllc/FTEC4005 📦

FTEC4005 - Financial Informatics/ FTEC4003 - Data Mining for FinTech -- This repository contains codes for the bonus task, as well as the group project.

Language: Java - Size: 161 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

arberkuci/shared-memory-map-reduce

A shared-memory implementation of MapReduce.

Language: Java - Size: 12.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

anshul1004/MutualFriends

Implementation of Hadoop and Spark

Language: Java - Size: 23 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

benjdiasaad/MapReduce_K-means

Implémentation de l'algorithme de clustering k-means en utilisant le framework Hadoop version 3.1.3 (MapReduce).

Language: Java - Size: 32.2 KB - Last synced at: 24 days ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 2

NikolaAndro/Pagerank_Hadoop_MapReduce

Language: Java - Size: 6.02 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

kushalhebbar/Big-data-project

Optimizing the storage capability of HDFS and HBase through data size factor with integrated security feature

Size: 48.9 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

SimoBkr/MapReduceJAVA

JAVA SWING APPLICATION MAPREDUCE

Language: Java - Size: 40 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

ucapdak/Olympic-Tweets

Assignment for Big Data Processing: A collection of programs for analysing tweets related to the 2012 Olympics.

Language: Java - Size: 223 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

ucapdak/MapReduce-Spark-Comparison

Assignment for Big Data Processing: Comparison between Spark and MapReduce programs for analysing large data sets.

Language: Java - Size: 651 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

DanMolenhouse/Distributed-Systems-Project5-Hadoop-and-Spark

In this project, we used both Hadoop / MapReduce and Spark to do distributed computing. The first task was to perform a series of operations using a Mapper and Reduce java file that was implemented on a Hadoop server. The second task was to perform similar operations, but on Spark instead.

Language: Java - Size: 70.3 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

DA1OOO/Big-Data-Systems-and-Information-Processing

基于Hadoop集群的各类大数据存储、处理。

Language: Java - Size: 107 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

backslash112/crystal-ball-hadoop

A crystal ball to predict events that may happen once a certain event happened with MapReduce.

Language: Java - Size: 18.6 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

RonnJacob/PageRank-MapReduce-Spark

Implemented the PageRank algorithm in Hadoop MapReduce framework and Spark.

Language: Java - Size: 442 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

ManasaPola/Distributed-Parallel_DB

Distributed and Parallel Database Tasks

Language: Python - Size: 1.46 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

pancr9/Cloud-Computing

The repository consists of Cloud Computing for Data Analysis project and assignments.

Language: Java - Size: 2.91 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

harsh306/Hadoop_Task 📦

Language: Java - Size: 92.8 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 1

a-poliakov/distributed_computing

Language: Java - Size: 15.2 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Elzawawy/hadoop-word-count

A simple MapReduce and Hadoop application to count words in a document ,implemented in Java to get a flavor for how they work.

Language: Java - Size: 22.5 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 2

SarahAyaz/YouTube_Data_Analysis

Analysis of YouTube Data using Hadoop Mapreduce framework in Java.

Language: Java - Size: 24.5 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

jieren123/Bigdata_Project_Recommender_System

Recommender system based on Item Collaborative Filtering and MapReduce

Language: Java - Size: 389 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 17 - Forks: 3

shashankg32/big_data_lab_nmit_6th_sem

big data lab nmit 6th sem

Language: Java - Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Raveesh1505/BigData-Training

Big data training material

Language: Python - Size: 45.9 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

hiifong/MapReduce-multi-table-merge

MapReduce multi-table merge MapReduce多表合并

Language: Java - Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

warrenlyr/K-Nearest-Neighbors-Implementation-in-Parallel-Programming

K-Nearest Neighbors implementation in parallel programming and cloud computing with MPI, MapReduce, Spark, and MASS.

Language: Java - Size: 33.5 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

charliecai00/Tree-Versus-Income

Examining the Relationship Between Tree Quality and Socioeconomic Status in New York City

Language: Java - Size: 32.5 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

concealedtea/cardinaalit

cardinality Counter for large .data files

Language: Java - Size: 21.5 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

dddddkio/Data-analysis-of-Sogou-query-log

使用hadoop mapreduce对搜狗2008年查询日志进行数据分析

Language: Java - Size: 120 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

markomih/kmeans_mapreduce

K-means MapReduce implementation

Language: Java - Size: 51 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 39 - Forks: 17

huangyueranbbc/RecommendByItemcf

Hadoop mapreduce. 基于ItemCF的协同过滤 物品推荐系统 Collaborative filtering goods recommendation system based on ItemCF

Language: Java - Size: 498 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 20 - Forks: 13

razo7/Nap

Nap: Network-Aware Data Partitions for Efficient Distributed Processing

Language: Mathematica - Size: 186 MB - Last synced at: 19 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

Michu-dev/big-data-first-project

First academic big data project to implement analysis using MapReduce and Hive platform

Language: Java - Size: 109 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

RiccardoSagramoni/map-reduce-bloom-filter 📦

University Project for "Cloud Computing" course (MSc Computer Engineering @ University of Pisa). MapReduce applications implemented in Hadoop and Spark.

Language: Java - Size: 8.86 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

GovardhanR26/webserver-log-analysis

Language: Jupyter Notebook - Size: 1.83 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

jhlfrfufyfn/hadoop-web-robot

Web robot made with Hadoop MapReduce and Java

Language: Java - Size: 3.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

careycwang/CS5425-MapReduce-Common-Words

CS5425 Assignment 1: Top K Common Words

Language: Java - Size: 60.5 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ling67/Cloud-Computing

Cloud Computing Learning and Project 👩‍🎓‍🤦‍♀️🤷‍♀️

Language: HTML - Size: 933 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

HxnDev/Hadoop-MapReduce-to-Find-Average-Length-of-Comments

In this task, we had to find the average length of comments given in the dataset. It was done using Hadoop MapReduce and Hadoop HDFS.

Language: Java - Size: 675 KB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 1

Ishuan/Information-Retrieval

Information retrieval (IR) is concerned with finding material (e.g., documents) of an unstructured nature (usually text) in response to an information need (e.g., a query) from large collections. One approach to identify relevant documents is to compute scores based on the matches between terms in the query and terms in the documents. For example, a document with words such as ball, team, score, championship is likely to be about sports. It is helpful to define a weight for each term in a document that can be meaningful for computing such a score. We describe below popular information retrieval metrics such as term frequency, inverse document frequency, and their product, term frequency-inverse document frequency (TF-IDF), that are used to define weights for terms. ​ Term​ ​Frequency: ​ Term frequency is the number of times a particular word t occurs in a document d. TF(t,​ ​d)​ ​=​ ​No.​ ​of​ ​times​ ​t​ ​appears​ ​in​ ​document​ ​d Since the importance of a word in a document does not necessarily scale linearly with the frequency of its appearance, a common modification is to instead use the logarithm of the raw term frequency. WF(t,d)​ ​=​ ​1​ ​+​ ​log​10​ (TF(t,d))​ ​ ​if​ ​TF(t,d)​ ​>​ ​0,​ ​and​ ​0​ ​otherwise ​ ​ ​ ​ ​ We will use this logarithmically scaled term frequency in what follows. Inverse​ ​Document​ ​Frequency: The inverse document frequency (IDF) is a measure of how common or rare a term is across all documents in the collection. It is the logarithmically scaled fraction of the documents that contain the word, and is obtained by taking the logarithm of the ratio of the total number of documents to the number of documents containing the term. IDF(t)​ ​=​ ​log​10​ ​ ​(Total​ ​#​ ​of​ ​documents​ ​/​ ​#​ ​of​ ​documents​ ​containing​ ​term​ ​t) ​ ​ ​ ​ ​ ​ Under this IDF formula, terms appearing in all documents are assumed to be stopwords and subsequently assigned IDF=0. We will use the smoothed version of this formula as follows: ​ IDF(t)​ ​=​ ​log​10​ ​ ​(1​ ​+​ ​Total​ ​#​ ​of​ ​documents​ ​/​ ​#​ ​of​ ​documents​ ​containing​ ​term​ ​t) ​ ​ ​ ​ ​ Practically, smoothed IDF helps alleviating the out of vocabulary problem (OOV), where it is better to return to the user results rather than nothing even if his query matches every single document in the collection. TF-IDF: Term frequency–inverse document frequency (TF-IDF) is a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus of documents. It is often used as a weighting factor in information retrieval and text mining. TF-IDF(t,​ ​d)​ ​=​ ​WF(t,d)​ ​*​ ​IDF(t) ​ ​ ​ ​

Language: Java - Size: 378 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

divinenaman/dbscan-mapreduce

DBSCAN implementation on mapreduce

Language: Java - Size: 247 KB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

stamatisbreezer/Hadoop-MapReduce

Problem solving on Hadoop using MapReduce

Language: Java - Size: 6.01 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

HxnDev/Finding-Average-Temperature-of-Each-Year-using-Hadoop-HDFS

In this task, we had to calculate the average temperature for each year from the given dataset using Hadoop HDFS. We had to create a MapReduce function to perform this task.

Language: Java - Size: 451 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 0

ai1138/Analyzing_Brooklyn

For this project we studied 3 data sets revolving around neighborhoods in New York City. We hope to learn what neighborhoods in Brooklyn are good to live in

Language: HiveQL - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

jayantakumar/Hadoop-In-Action-Introductory-Patent-Dataset-Analysis

A basic introductory example of hadoops mapreduce libraries to load and analyse large datasets in this case a US patent dataset sourced from https://www.nber.org/research/data/us-patents

Language: Java - Size: 28.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

iamyufan/COMPSCI401-Projects

Personal repo for COMPSCI 401 project 1-3, 22SP@DKU

Language: Jupyter Notebook - Size: 2.17 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

sim-pez/k_means_distributed

K-Means algorithm for distributed systems

Language: Java - Size: 410 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

divyam-goel/ML-using-MapReduce-and-Spark

Naive Implementation of Machine Learning Algorithms in distributed frameworks MapReduce and Spark

Language: Scilab - Size: 589 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

shanmuga-sudan/Big-Data-Systems

This repo contains all the assignments, project work on Engineering Big Data Systems coursework

Language: C# - Size: 299 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

BorroGG/relation

Processing relational data using mapReduce, hive, pig.

Language: Java - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

BorroGG/cross-correlation

Cross Correlation Algorithm.

Language: Java - Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

huangyueranbbc/hadoop05_pagerank

pagerank hadoop

Language: Java - Size: 39.5 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

aaaastark/Hadoop-Insallation-Commands-WordCount

Hadoop: Installation, Commands and Word Count Example

Language: Java - Size: 4.7 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 3

smohammadhejazi/twitter-mapreduce-practice

Applying MapReduce in Java on a Twitter dataset using Apache Hadoop

Language: Java - Size: 39.2 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ParthKalkar/intro_to_big_data

Introductory Big Data concepts using Spark framework and different libraries

Language: Java - Size: 4.61 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

tkhan11/Big-Data-Hadoop-Project

Big Data Hadoop framework project for analysis of superstore sales data to find insights.

Language: Java - Size: 5.36 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

Lakhan-Nad/MapReduce

A small hadoop map reduce implemented for Big Data Project

Language: Java - Size: 850 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

ArArgon/UESTC-CloudComputing-Experiment

远离远古 Eclipse, 远离上古软件和阴间插件

Language: Java - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

gorkinovich/SGDI

Sistemas de Gestión de Datos y de la Información (UCM, 2015)

Language: Java - Size: 2.74 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

szaher/spark

Playing with Spark using Java

Language: Java - Size: 424 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

raymondzheranlei-github/movie-recommender-system

a movie recommender system that can predict movies or videos that users may be interested in

Language: Java - Size: 616 KB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

ankit08015/Engg-Of-Big-Data

Repository for course INFO7250 - Engineering of Big Data

Language: Java - Size: 308 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 3

sayaliwalke30/Hadoop-Mapreduce

Data analysis on Big Data. Used various databases from 1M to 100M including Movie Lens dataset to perform analysis. Covers basics and advance map reduce using Hadoop.

Size: 4.95 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 3

rachelzhaolp/BigData-HW-MapReduce

Solutions of some MapReduce Problem

Language: Java - Size: 133 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

SinghHarshita/MapReduce-Examples

Word co-occurrence and Matrix Multiplication using MapReduce

Language: Java - Size: 11.4 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

ktothep/MapReduce

Implementation of MapReduce programs other than word count

Language: Java - Size: 64 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

abhishekmsharma/big-data-electricity-consumption-analysis-apache-spark

Developed for analysing and visualizing trends related to electricity and energy consumption

Language: Java - Size: 145 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

Super-Special-Pookies/PhraseExtract

Hadoop MapReduce Assignment: Distributed Phrase Extraction(Unregistered Word Discovery)

Language: Java - Size: 40.4 MB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

ac547/WordCount-MapReduce

Fully distributed Hadoop on AWS EC2 Cluster, executes WordCount MapReduce operations and analyzes performance as a function of Cluster Size.

Language: Java - Size: 19.5 KB - Last synced at: 7 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

jaskier07/Hadoop-lab

Solving simple tasks with Apache Hadoop.

Language: Java - Size: 32.2 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

DavideAG/BigData

Spark, RDDs and Map Reduce applications related to the BigData @Polito course (2019-2020). A set of personal notes are already provided.

Language: Java - Size: 5.7 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ayenpure/StockMeUp

This is a class project for 'CIS 610 : Data Science' where I try and validate Stock Market recommendations.

Language: Java - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

anevsky/bigdata-101

Big Data ramp-up

Language: Java - Size: 6.48 MB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 1

lurifn/mapreduce-ep3-client

Um sistema que permite a um programa cliente requisitar, a uma arquitetura Map-Reduce, a criação de um índice invertido de links (semelhante a uma das atividades do PageRank do Google)

Language: Java - Size: 13.7 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

RolandTaverner/hadoop_tutorial

Language: Java - Size: 2.85 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Arko98/Product_Purchase_Prediction

Prediction of purchase of Bank Product using Map Reduce Naive Bayes

Language: Java - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

ziyaddhuka/Airine-data-analysis

Big Data project on Airline dataset

Language: Java - Size: 309 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0