Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: mapreduce-java

Dare-marvel/Big-Data-Analytics--BDA--

💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍

Language: Java - Size: 174 MB - Last synced: about 8 hours ago - Pushed: about 10 hours ago - Stars: 1 - Forks: 1

Coursal/Hadoop-Examples

Some simple, kinda introductory projects based on Apache Hadoop to be used as guides in order to make the MapReduce model look less weird or boring.

Language: Java - Size: 340 KB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 5 - Forks: 2

nielsbasjes/splittablegzip

Splittable Gzip codec for Hadoop

Language: Java - Size: 1.37 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 68 - Forks: 8

amarkum/crunch-demo

crunch demo project

Language: Java - Size: 7.81 KB - Last synced: 11 days ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

anpu9/MIT6.824-MapReduce

MapReduce Implementation - Distributed System

Language: Go - Size: 21.1 MB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 0 - Forks: 0

debajyotiguha11/BigDataAssignment_WordCount

Class assignment to understand the MapReduce Programming model in Hadoop.

Language: Java - Size: 5.9 MB - Last synced: 30 days ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

emrectn/HadoopTutorial

hadoop

Language: Java - Size: 15.6 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

changfubai/hadoop-wordcount

Language: Java - Size: 3.64 MB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

aadishgoel/Hadoop-Codes

Neat and Handy Place for all Hadoop codes

Language: Java - Size: 25.4 KB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 6 - Forks: 3

fzehracetin/big-data-project

Big Data Processing and Analytics course term project.

Language: JavaScript - Size: 8.77 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

fbaldi6/PageRank-Hadoop Fork of edofazza/PageRank-Hadoop

Implementation of the MapReduce PageRank algorithm using the Hadoop framework in Java (developed for Cloud Computing course)

Size: 5.35 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

fbaldi6/PageRank-Spark Fork of edofazza/PageRank-Spark

Implementation of the MapReduce PageRank algorithm using the Spark framework both in Python and in Java (developed for Cloud Computing course)

Size: 4.99 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0

10lloydj/NLP-RDF-Inverted-Index

This Map Reduce program should read in a set of RDF/XML documents and output the data in the form: {object}, [(predicate1, position, subject1)...]

Language: Java - Size: 11.7 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

SomeshChevella/Apache-Hadoop-Map-Reduce--Basic-Sentiment-Analysis-on-Yelp-Dataset

In this project we will use Hadoop MapReduce to implement a very basic “Sentiment Analysis” using the review text in the Yelp Academic Dataset as training data.

Language: Java - Size: 7.39 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

HarshitDawar55/MapReduce

Programs for MapReduce written in java with least complexity!

Language: Java - Size: 76.2 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

Sumonta056/Hadoop-Clustering-Docker-Guide

Hadoop-Clustering-Docker-Guide : A Complete Documentation to setting up Hadoop and try clustering.

Language: Java - Size: 51.5 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

leightonllc/FTEC4005 📦

FTEC4005 - Financial Informatics/ FTEC4003 - Data Mining for FinTech -- This repository contains codes for the bonus task, as well as the group project.

Language: Java - Size: 161 KB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

arberkuci/shared-memory-map-reduce

A shared-memory implementation of MapReduce.

Language: Java - Size: 12.7 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

anshul1004/MutualFriends

Implementation of Hadoop and Spark

Language: Java - Size: 23 MB - Last synced: 6 months ago - Pushed: about 4 years ago - Stars: 1 - Forks: 0

iulianoroberto/MapReduceApplications

Basic MapReduce applications in Java.

Language: Java - Size: 16.6 KB - Last synced: 6 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

benjdiasaad/MapReduce_K-means

Implémentation de l'algorithme de clustering k-means en utilisant le framework Hadoop version 3.1.3 (MapReduce).

Language: Java - Size: 32.2 KB - Last synced: 4 days ago - Pushed: about 3 years ago - Stars: 3 - Forks: 2

NikolaAndro/Pagerank_Hadoop_MapReduce

Language: Java - Size: 6.02 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

kushalhebbar/Big-data-project

Optimizing the storage capability of HDFS and HBase through data size factor with integrated security feature

Size: 48.9 MB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

SimoBkr/MapReduceJAVA

JAVA SWING APPLICATION MAPREDUCE

Language: Java - Size: 40 KB - Last synced: 8 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

ucapdak/Olympic-Tweets

Assignment for Big Data Processing: A collection of programs for analysing tweets related to the 2012 Olympics.

Language: Java - Size: 223 KB - Last synced: 9 months ago - Pushed: almost 7 years ago - Stars: 1 - Forks: 0

ucapdak/MapReduce-Spark-Comparison

Assignment for Big Data Processing: Comparison between Spark and MapReduce programs for analysing large data sets.

Language: Java - Size: 651 KB - Last synced: 9 months ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0

DanMolenhouse/Distributed-Systems-Project5-Hadoop-and-Spark

In this project, we used both Hadoop / MapReduce and Spark to do distributed computing. The first task was to perform a series of operations using a Mapper and Reduce java file that was implemented on a Hadoop server. The second task was to perform similar operations, but on Spark instead.

Language: Java - Size: 70.3 KB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

DA1OOO/Big-Data-Systems-and-Information-Processing

基于Hadoop集群的各类大数据存储、处理。

Language: Java - Size: 107 MB - Last synced: 10 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

backslash112/crystal-ball-hadoop

A crystal ball to predict events that may happen once a certain event happened with MapReduce.

Language: Java - Size: 18.6 KB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0

RonnJacob/PageRank-MapReduce-Spark

Implemented the PageRank algorithm in Hadoop MapReduce framework and Spark.

Language: Java - Size: 442 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 1

ManasaPola/Distributed-Parallel_DB

Distributed and Parallel Database Tasks

Language: Python - Size: 1.46 MB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

pancr9/Cloud-Computing

The repository consists of Cloud Computing for Data Analysis project and assignments.

Language: Java - Size: 2.91 MB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

harsh306/Hadoop_Task 📦

Language: Java - Size: 92.8 KB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 1 - Forks: 1

a-poliakov/distributed_computing

Language: Java - Size: 15.2 MB - Last synced: 10 months ago - Pushed: almost 1 year ago - Stars: 0 - Forks: 0

Elzawawy/hadoop-word-count

A simple MapReduce and Hadoop application to count words in a document ,implemented in Java to get a flavor for how they work.

Language: Java - Size: 22.5 KB - Last synced: 10 months ago - Pushed: almost 4 years ago - Stars: 2 - Forks: 2

SarahAyaz/YouTube_Data_Analysis

Analysis of YouTube Data using Hadoop Mapreduce framework in Java.

Language: Java - Size: 24.5 MB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 3 - Forks: 2

jieren123/Bigdata_Project_Recommender_System

Recommender system based on Item Collaborative Filtering and MapReduce

Language: Java - Size: 389 KB - Last synced: 11 months ago - Pushed: over 6 years ago - Stars: 17 - Forks: 3

shashankg32/big_data_lab_nmit_6th_sem

big data lab nmit 6th sem

Language: Java - Size: 11.7 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 1 - Forks: 0

Raveesh1505/BigData-Training

Big data training material

Language: Python - Size: 45.9 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

hiifong/MapReduce-multi-table-merge

MapReduce multi-table merge MapReduce多表合并

Language: Java - Size: 6.84 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

warrenlyr/K-Nearest-Neighbors-Implementation-in-Parallel-Programming

K-Nearest Neighbors implementation in parallel programming and cloud computing with MPI, MapReduce, Spark, and MASS.

Language: Java - Size: 33.5 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

charliecai00/Tree-Versus-Income

Examining the Relationship Between Tree Quality and Socioeconomic Status in New York City

Language: Java - Size: 32.5 MB - Last synced: 5 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

concealedtea/cardinaalit

cardinality Counter for large .data files

Language: Java - Size: 21.5 KB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0

dddddkio/Data-analysis-of-Sogou-query-log

使用hadoop mapreduce对搜狗2008年查询日志进行数据分析

Language: Java - Size: 120 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 2 - Forks: 1

markomih/kmeans_mapreduce

K-means MapReduce implementation

Language: Java - Size: 51 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 39 - Forks: 17

huangyueranbbc/RecommendByItemcf

Hadoop mapreduce. 基于ItemCF的协同过滤 物品推荐系统 Collaborative filtering goods recommendation system based on ItemCF

Language: Java - Size: 498 KB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 20 - Forks: 13

razo7/Nap

Nap: Network-Aware Data Partitions for Efficient Distributed Processing

Language: Mathematica - Size: 186 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

Michu-dev/big-data-first-project

First academic big data project to implement analysis using MapReduce and Hive platform

Language: Java - Size: 109 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

RiccardoSagramoni/map-reduce-bloom-filter 📦

University Project for "Cloud Computing" course (MSc Computer Engineering @ University of Pisa). MapReduce applications implemented in Hadoop and Spark.

Language: Java - Size: 8.86 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

GovardhanR26/webserver-log-analysis

Language: Jupyter Notebook - Size: 1.83 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

jhlfrfufyfn/hadoop-web-robot

Web robot made with Hadoop MapReduce and Java

Language: Java - Size: 3.7 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

careycwang/CS5425-MapReduce-Common-Words

CS5425 Assignment 1: Top K Common Words

Language: Java - Size: 60.5 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

ling67/Cloud-Computing

Cloud Computing Learning and Project 👩‍🎓‍🤦‍♀️🤷‍♀️

Language: HTML - Size: 933 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

HxnDev/Hadoop-MapReduce-to-Find-Average-Length-of-Comments

In this task, we had to find the average length of comments given in the dataset. It was done using Hadoop MapReduce and Hadoop HDFS.

Language: Java - Size: 675 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 4 - Forks: 0

Ishuan/Information-Retrieval

Information retrieval (IR) is concerned with finding material (e.g., documents) of an unstructured nature (usually text) in response to an information need (e.g., a query) from large collections. One approach to identify relevant documents is to compute scores based on the matches between terms in the query and terms in the documents. For example, a document with words such as ball, team, score, championship is likely to be about sports. It is helpful to define a weight for each term in a document that can be meaningful for computing such a score. We describe below popular information retrieval metrics such as term frequency, inverse document frequency, and their product, term frequency-inverse document frequency (TF-IDF), that are used to define weights for terms. ​ Term​ ​Frequency: ​ Term frequency is the number of times a particular word t occurs in a document d. TF(t,​ ​d)​ ​=​ ​No.​ ​of​ ​times​ ​t​ ​appears​ ​in​ ​document​ ​d Since the importance of a word in a document does not necessarily scale linearly with the frequency of its appearance, a common modification is to instead use the logarithm of the raw term frequency. WF(t,d)​ ​=​ ​1​ ​+​ ​log​10​ (TF(t,d))​ ​ ​if​ ​TF(t,d)​ ​>​ ​0,​ ​and​ ​0​ ​otherwise ​ ​ ​ ​ ​ We will use this logarithmically scaled term frequency in what follows. Inverse​ ​Document​ ​Frequency: The inverse document frequency (IDF) is a measure of how common or rare a term is across all documents in the collection. It is the logarithmically scaled fraction of the documents that contain the word, and is obtained by taking the logarithm of the ratio of the total number of documents to the number of documents containing the term. IDF(t)​ ​=​ ​log​10​ ​ ​(Total​ ​#​ ​of​ ​documents​ ​/​ ​#​ ​of​ ​documents​ ​containing​ ​term​ ​t) ​ ​ ​ ​ ​ ​ Under this IDF formula, terms appearing in all documents are assumed to be stopwords and subsequently assigned IDF=0. We will use the smoothed version of this formula as follows: ​ IDF(t)​ ​=​ ​log​10​ ​ ​(1​ ​+​ ​Total​ ​#​ ​of​ ​documents​ ​/​ ​#​ ​of​ ​documents​ ​containing​ ​term​ ​t) ​ ​ ​ ​ ​ Practically, smoothed IDF helps alleviating the out of vocabulary problem (OOV), where it is better to return to the user results rather than nothing even if his query matches every single document in the collection. TF-IDF: Term frequency–inverse document frequency (TF-IDF) is a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus of documents. It is often used as a weighting factor in information retrieval and text mining. TF-IDF(t,​ ​d)​ ​=​ ​WF(t,d)​ ​*​ ​IDF(t) ​ ​ ​ ​

Language: Java - Size: 378 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0

divinenaman/dbscan-mapreduce

DBSCAN implementation on mapreduce

Language: Java - Size: 247 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0

HxnDev/Finding-Average-Temperature-of-Each-Year-using-Hadoop-HDFS

In this task, we had to calculate the average temperature for each year from the given dataset using Hadoop HDFS. We had to create a MapReduce function to perform this task.

Language: Java - Size: 451 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 5 - Forks: 0

ai1138/Analyzing_Brooklyn

For this project we studied 3 data sets revolving around neighborhoods in New York City. We hope to learn what neighborhoods in Brooklyn are good to live in

Language: HiveQL - Size: 35.2 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 2

jayantakumar/Hadoop-In-Action-Introductory-Patent-Dataset-Analysis

A basic introductory example of hadoops mapreduce libraries to load and analyse large datasets in this case a US patent dataset sourced from https://www.nber.org/research/data/us-patents

Language: Java - Size: 28.3 KB - Last synced: 12 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

iamyufan/COMPSCI401-Projects

Personal repo for COMPSCI 401 project 1-3, 22SP@DKU

Language: Jupyter Notebook - Size: 2.17 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0

sim-pez/k_means_distributed

K-Means algorithm for distributed systems

Language: Java - Size: 410 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 1

divyam-goel/ML-using-MapReduce-and-Spark

Naive Implementation of Machine Learning Algorithms in distributed frameworks MapReduce and Spark

Language: Scilab - Size: 589 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 4 - Forks: 0

shanmuga-sudan/Big-Data-Systems

This repo contains all the assignments, project work on Engineering Big Data Systems coursework

Language: C# - Size: 299 MB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0

BorroGG/relation

Processing relational data using mapReduce, hive, pig.

Language: Java - Size: 8.79 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

BorroGG/cross-correlation

Cross Correlation Algorithm.

Language: Java - Size: 6.84 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

huangyueranbbc/hadoop05_pagerank

pagerank hadoop

Language: Java - Size: 39.5 MB - Last synced: over 1 year ago - Pushed: almost 7 years ago - Stars: 2 - Forks: 0

aaaastark/Hadoop-Insallation-Commands-WordCount

Hadoop: Installation, Commands and Word Count Example

Language: Java - Size: 4.7 MB - Last synced: 11 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 3

smohammadhejazi/twitter-mapreduce-practice

Applying MapReduce in Java on a Twitter dataset using Apache Hadoop

Language: Java - Size: 39.2 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

ParthKalkar/intro_to_big_data

Introductory Big Data concepts using Spark framework and different libraries

Language: Java - Size: 4.61 MB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1

tkhan11/Big-Data-Hadoop-Project

Big Data Hadoop framework project for analysis of superstore sales data to find insights.

Language: Java - Size: 5.36 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 2 - Forks: 1

Lakhan-Nad/MapReduce

A small hadoop map reduce implemented for Big Data Project

Language: Java - Size: 850 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

ArArgon/UESTC-CloudComputing-Experiment

远离远古 Eclipse, 远离上古软件和阴间插件

Language: Java - Size: 23.4 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

gorkinovich/SGDI

Sistemas de Gestión de Datos y de la Información (UCM, 2015)

Language: Java - Size: 2.74 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

szaher/spark

Playing with Spark using Java

Language: Java - Size: 424 KB - Last synced: about 2 months ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

HxnDev/Hadoop-MapReduce-to-Analyze-Sentiment-of-Keyword

In this task, we had to write a MapReduce program to analyze the sentiment of a keyword from a list of comments. This was done using Hadoop HDFS.

Language: Java - Size: 1000 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 5 - Forks: 0

ankit08015/Engg-Of-Big-Data

Repository for course INFO7250 - Engineering of Big Data

Language: Java - Size: 308 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 4 - Forks: 3

sayaliwalke30/Hadoop-Mapreduce

Data analysis on Big Data. Used various databases from 1M to 100M including Movie Lens dataset to perform analysis. Covers basics and advance map reduce using Hadoop.

Size: 4.95 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 1 - Forks: 3

rachelzhaolp/BigData-HW-MapReduce

Solutions of some MapReduce Problem

Language: Java - Size: 133 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 1

SinghHarshita/MapReduce-Examples

Word co-occurrence and Matrix Multiplication using MapReduce

Language: Java - Size: 11.4 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

ktothep/MapReduce

Implementation of MapReduce programs other than word count

Language: Java - Size: 64 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

abhishekmsharma/big-data-electricity-consumption-analysis-apache-spark

Developed for analysing and visualizing trends related to electricity and energy consumption

Language: Java - Size: 145 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 3 - Forks: 1

Super-Special-Pookies/PhraseExtract

Hadoop MapReduce Assignment: Distributed Phrase Extraction(Unregistered Word Discovery)

Language: Java - Size: 40.4 MB - Last synced: 12 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

jaskier07/Hadoop-lab

Solving simple tasks with Apache Hadoop.

Language: Java - Size: 32.2 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

DavideAG/BigData

Spark, RDDs and Map Reduce applications related to the BigData @Polito course (2019-2020). A set of personal notes are already provided.

Language: Java - Size: 5.7 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

ayenpure/StockMeUp

This is a class project for 'CIS 610 : Data Science' where I try and validate Stock Market recommendations.

Language: Java - Size: 17.6 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0

anevsky/bigdata-101

Big Data ramp-up

Language: Java - Size: 6.48 MB - Last synced: about 1 year ago - Pushed: over 7 years ago - Stars: 0 - Forks: 1

berksudan/Analysis-on-Big-Data-with-Hadoop

Implementation of Statistical Methods via Hadoop Map-Reduce Library.

Language: Java - Size: 75.3 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

lurifn/mapreduce-ep3-client

Um sistema que permite a um programa cliente requisitar, a uma arquitetura Map-Reduce, a criação de um índice invertido de links (semelhante a uma das atividades do PageRank do Google)

Language: Java - Size: 13.7 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1

RolandTaverner/hadoop_tutorial

Language: Java - Size: 2.85 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

Arko98/Product_Purchase_Prediction

Prediction of purchase of Bank Product using Map Reduce Naive Bayes

Language: Java - Size: 35.2 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 1 - Forks: 0

ziyaddhuka/Airine-data-analysis

Big Data project on Airline dataset

Language: Java - Size: 309 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

Tabed23/Map_Reduce_WordCount

Hadoop Map Reduce Word count example

Language: Java - Size: 5.86 KB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

varshinireddyt/Big-Data-Cloud-computing

Class Projects related to big data, spark, Hadoop, Pig, Hive

Language: Java - Size: 690 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

ashishgopalhattimare/Parallel-Concurrent-and-Distributed-Programming-in-Java

Parallel, Concurrent, and Distributed Programming in Java | Coursera

Language: Java - Size: 34.5 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

reethified/techsquids-code-examples

Language: Java - Size: 80.1 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

foolfun/SomeMapReduceCases

mapreduce案例和大数据入门笔记

Language: Java - Size: 53.7 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

soyelherein/BigData

Here we will try to solve very priliminary Bigdata problems using java, which is suitable for beginners or college project

Language: Java - Size: 432 KB - Last synced: 12 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

Ishuan/Page-Rank-Implementation

The goal of this programming assignment is to compute the PageRanks of an input set of hyperlinked Wikipedia documents using Hadoop MapReduce. The PageRank score of a web page serves as an indicator of the importance of the page. Many web search engines (e.g., Google) use PageRank scores in some form to rank user-submitted queries. The goals of this assignment are to: 1. Understand the PageRank algorithm and how it works in MapReduce. 2. Implement PageRank and execute it on a large corpus of data. 3. Examine the output from running PageRank on Simple English Wikipedia to measure the relative importance of pages in the corpus. To run your program on the full Simple English Wikipedia archive, you will need to run it on the dsba-hadoop cluster to which you have access.

Language: Java - Size: 36.1 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0

philomathic-guy/Friend-recommendation-using-movie-data

Language: Java - Size: 756 KB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 2 - Forks: 0

ShreeshaN/BigDataTutorials

Hadoop MapReduce jobs, Pig Queries

Language: Java - Size: 514 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0