Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: mapreduce-java
Dare-marvel/Big-Data-Analytics--BDA--
💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍
Language: Java - Size: 174 MB - Last synced: about 8 hours ago - Pushed: about 10 hours ago - Stars: 1 - Forks: 1
Coursal/Hadoop-Examples
Some simple, kinda introductory projects based on Apache Hadoop to be used as guides in order to make the MapReduce model look less weird or boring.
Language: Java - Size: 340 KB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 5 - Forks: 2
nielsbasjes/splittablegzip
Splittable Gzip codec for Hadoop
Language: Java - Size: 1.37 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 68 - Forks: 8
amarkum/crunch-demo
crunch demo project
Language: Java - Size: 7.81 KB - Last synced: 11 days ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
anpu9/MIT6.824-MapReduce
MapReduce Implementation - Distributed System
Language: Go - Size: 21.1 MB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 0 - Forks: 0
debajyotiguha11/BigDataAssignment_WordCount
Class assignment to understand the MapReduce Programming model in Hadoop.
Language: Java - Size: 5.9 MB - Last synced: 30 days ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0
emrectn/HadoopTutorial
hadoop
Language: Java - Size: 15.6 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0
changfubai/hadoop-wordcount
Language: Java - Size: 3.64 MB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0
aadishgoel/Hadoop-Codes
Neat and Handy Place for all Hadoop codes
Language: Java - Size: 25.4 KB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 6 - Forks: 3
fzehracetin/big-data-project
Big Data Processing and Analytics course term project.
Language: JavaScript - Size: 8.77 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
fbaldi6/PageRank-Hadoop Fork of edofazza/PageRank-Hadoop
Implementation of the MapReduce PageRank algorithm using the Hadoop framework in Java (developed for Cloud Computing course)
Size: 5.35 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0
fbaldi6/PageRank-Spark Fork of edofazza/PageRank-Spark
Implementation of the MapReduce PageRank algorithm using the Spark framework both in Python and in Java (developed for Cloud Computing course)
Size: 4.99 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0
10lloydj/NLP-RDF-Inverted-Index
This Map Reduce program should read in a set of RDF/XML documents and output the data in the form: {object}, [(predicate1, position, subject1)...]
Language: Java - Size: 11.7 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
SomeshChevella/Apache-Hadoop-Map-Reduce--Basic-Sentiment-Analysis-on-Yelp-Dataset
In this project we will use Hadoop MapReduce to implement a very basic “Sentiment Analysis” using the review text in the Yelp Academic Dataset as training data.
Language: Java - Size: 7.39 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
HarshitDawar55/MapReduce
Programs for MapReduce written in java with least complexity!
Language: Java - Size: 76.2 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
Sumonta056/Hadoop-Clustering-Docker-Guide
Hadoop-Clustering-Docker-Guide : A Complete Documentation to setting up Hadoop and try clustering.
Language: Java - Size: 51.5 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
leightonllc/FTEC4005 📦
FTEC4005 - Financial Informatics/ FTEC4003 - Data Mining for FinTech -- This repository contains codes for the bonus task, as well as the group project.
Language: Java - Size: 161 KB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0
arberkuci/shared-memory-map-reduce
A shared-memory implementation of MapReduce.
Language: Java - Size: 12.7 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
anshul1004/MutualFriends
Implementation of Hadoop and Spark
Language: Java - Size: 23 MB - Last synced: 6 months ago - Pushed: about 4 years ago - Stars: 1 - Forks: 0
iulianoroberto/MapReduceApplications
Basic MapReduce applications in Java.
Language: Java - Size: 16.6 KB - Last synced: 6 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
benjdiasaad/MapReduce_K-means
Implémentation de l'algorithme de clustering k-means en utilisant le framework Hadoop version 3.1.3 (MapReduce).
Language: Java - Size: 32.2 KB - Last synced: 4 days ago - Pushed: about 3 years ago - Stars: 3 - Forks: 2
NikolaAndro/Pagerank_Hadoop_MapReduce
Language: Java - Size: 6.02 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
kushalhebbar/Big-data-project
Optimizing the storage capability of HDFS and HBase through data size factor with integrated security feature
Size: 48.9 MB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0
SimoBkr/MapReduceJAVA
JAVA SWING APPLICATION MAPREDUCE
Language: Java - Size: 40 KB - Last synced: 8 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0
ucapdak/Olympic-Tweets
Assignment for Big Data Processing: A collection of programs for analysing tweets related to the 2012 Olympics.
Language: Java - Size: 223 KB - Last synced: 9 months ago - Pushed: almost 7 years ago - Stars: 1 - Forks: 0
ucapdak/MapReduce-Spark-Comparison
Assignment for Big Data Processing: Comparison between Spark and MapReduce programs for analysing large data sets.
Language: Java - Size: 651 KB - Last synced: 9 months ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0
DanMolenhouse/Distributed-Systems-Project5-Hadoop-and-Spark
In this project, we used both Hadoop / MapReduce and Spark to do distributed computing. The first task was to perform a series of operations using a Mapper and Reduce java file that was implemented on a Hadoop server. The second task was to perform similar operations, but on Spark instead.
Language: Java - Size: 70.3 KB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
DA1OOO/Big-Data-Systems-and-Information-Processing
基于Hadoop集群的各类大数据存储、处理。
Language: Java - Size: 107 MB - Last synced: 10 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0
backslash112/crystal-ball-hadoop
A crystal ball to predict events that may happen once a certain event happened with MapReduce.
Language: Java - Size: 18.6 KB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0
RonnJacob/PageRank-MapReduce-Spark
Implemented the PageRank algorithm in Hadoop MapReduce framework and Spark.
Language: Java - Size: 442 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 1
ManasaPola/Distributed-Parallel_DB
Distributed and Parallel Database Tasks
Language: Python - Size: 1.46 MB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
pancr9/Cloud-Computing
The repository consists of Cloud Computing for Data Analysis project and assignments.
Language: Java - Size: 2.91 MB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0
harsh306/Hadoop_Task 📦
Language: Java - Size: 92.8 KB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 1 - Forks: 1
a-poliakov/distributed_computing
Language: Java - Size: 15.2 MB - Last synced: 10 months ago - Pushed: almost 1 year ago - Stars: 0 - Forks: 0
Elzawawy/hadoop-word-count
A simple MapReduce and Hadoop application to count words in a document ,implemented in Java to get a flavor for how they work.
Language: Java - Size: 22.5 KB - Last synced: 10 months ago - Pushed: almost 4 years ago - Stars: 2 - Forks: 2
SarahAyaz/YouTube_Data_Analysis
Analysis of YouTube Data using Hadoop Mapreduce framework in Java.
Language: Java - Size: 24.5 MB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 3 - Forks: 2
jieren123/Bigdata_Project_Recommender_System
Recommender system based on Item Collaborative Filtering and MapReduce
Language: Java - Size: 389 KB - Last synced: 11 months ago - Pushed: over 6 years ago - Stars: 17 - Forks: 3
shashankg32/big_data_lab_nmit_6th_sem
big data lab nmit 6th sem
Language: Java - Size: 11.7 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 1 - Forks: 0
Raveesh1505/BigData-Training
Big data training material
Language: Python - Size: 45.9 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
hiifong/MapReduce-multi-table-merge
MapReduce multi-table merge MapReduce多表合并
Language: Java - Size: 6.84 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0
warrenlyr/K-Nearest-Neighbors-Implementation-in-Parallel-Programming
K-Nearest Neighbors implementation in parallel programming and cloud computing with MPI, MapReduce, Spark, and MASS.
Language: Java - Size: 33.5 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
charliecai00/Tree-Versus-Income
Examining the Relationship Between Tree Quality and Socioeconomic Status in New York City
Language: Java - Size: 32.5 MB - Last synced: 5 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0
concealedtea/cardinaalit
cardinality Counter for large .data files
Language: Java - Size: 21.5 KB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0
dddddkio/Data-analysis-of-Sogou-query-log
使用hadoop mapreduce对搜狗2008年查询日志进行数据分析
Language: Java - Size: 120 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 2 - Forks: 1
markomih/kmeans_mapreduce
K-means MapReduce implementation
Language: Java - Size: 51 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 39 - Forks: 17
huangyueranbbc/RecommendByItemcf
Hadoop mapreduce. 基于ItemCF的协同过滤 物品推荐系统 Collaborative filtering goods recommendation system based on ItemCF
Language: Java - Size: 498 KB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 20 - Forks: 13
razo7/Nap
Nap: Network-Aware Data Partitions for Efficient Distributed Processing
Language: Mathematica - Size: 186 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
Michu-dev/big-data-first-project
First academic big data project to implement analysis using MapReduce and Hive platform
Language: Java - Size: 109 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
RiccardoSagramoni/map-reduce-bloom-filter 📦
University Project for "Cloud Computing" course (MSc Computer Engineering @ University of Pisa). MapReduce applications implemented in Hadoop and Spark.
Language: Java - Size: 8.86 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
GovardhanR26/webserver-log-analysis
Language: Jupyter Notebook - Size: 1.83 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0
jhlfrfufyfn/hadoop-web-robot
Web robot made with Hadoop MapReduce and Java
Language: Java - Size: 3.7 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
careycwang/CS5425-MapReduce-Common-Words
CS5425 Assignment 1: Top K Common Words
Language: Java - Size: 60.5 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
ling67/Cloud-Computing
Cloud Computing Learning and Project 👩🎓🤦♀️🤷♀️
Language: HTML - Size: 933 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
HxnDev/Hadoop-MapReduce-to-Find-Average-Length-of-Comments
In this task, we had to find the average length of comments given in the dataset. It was done using Hadoop MapReduce and Hadoop HDFS.
Language: Java - Size: 675 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 4 - Forks: 0
Ishuan/Information-Retrieval
Information retrieval (IR) is concerned with finding material (e.g., documents) of an unstructured nature (usually text) in response to an information need (e.g., a query) from large collections. One approach to identify relevant documents is to compute scores based on the matches between terms in the query and terms in the documents. For example, a document with words such as ball, team, score, championship is likely to be about sports. It is helpful to define a weight for each term in a document that can be meaningful for computing such a score. We describe below popular information retrieval metrics such as term frequency, inverse document frequency, and their product, term frequency-inverse document frequency (TF-IDF), that are used to define weights for terms. Term Frequency: Term frequency is the number of times a particular word t occurs in a document d. TF(t, d) = No. of times t appears in document d Since the importance of a word in a document does not necessarily scale linearly with the frequency of its appearance, a common modification is to instead use the logarithm of the raw term frequency. WF(t,d) = 1 + log10 (TF(t,d)) if TF(t,d) > 0, and 0 otherwise We will use this logarithmically scaled term frequency in what follows. Inverse Document Frequency: The inverse document frequency (IDF) is a measure of how common or rare a term is across all documents in the collection. It is the logarithmically scaled fraction of the documents that contain the word, and is obtained by taking the logarithm of the ratio of the total number of documents to the number of documents containing the term. IDF(t) = log10 (Total # of documents / # of documents containing term t) Under this IDF formula, terms appearing in all documents are assumed to be stopwords and subsequently assigned IDF=0. We will use the smoothed version of this formula as follows: IDF(t) = log10 (1 + Total # of documents / # of documents containing term t) Practically, smoothed IDF helps alleviating the out of vocabulary problem (OOV), where it is better to return to the user results rather than nothing even if his query matches every single document in the collection. TF-IDF: Term frequency–inverse document frequency (TF-IDF) is a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus of documents. It is often used as a weighting factor in information retrieval and text mining. TF-IDF(t, d) = WF(t,d) * IDF(t)
Language: Java - Size: 378 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0
divinenaman/dbscan-mapreduce
DBSCAN implementation on mapreduce
Language: Java - Size: 247 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0
HxnDev/Finding-Average-Temperature-of-Each-Year-using-Hadoop-HDFS
In this task, we had to calculate the average temperature for each year from the given dataset using Hadoop HDFS. We had to create a MapReduce function to perform this task.
Language: Java - Size: 451 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 5 - Forks: 0
ai1138/Analyzing_Brooklyn
For this project we studied 3 data sets revolving around neighborhoods in New York City. We hope to learn what neighborhoods in Brooklyn are good to live in
Language: HiveQL - Size: 35.2 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 2
jayantakumar/Hadoop-In-Action-Introductory-Patent-Dataset-Analysis
A basic introductory example of hadoops mapreduce libraries to load and analyse large datasets in this case a US patent dataset sourced from https://www.nber.org/research/data/us-patents
Language: Java - Size: 28.3 KB - Last synced: 12 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
iamyufan/COMPSCI401-Projects
Personal repo for COMPSCI 401 project 1-3, 22SP@DKU
Language: Jupyter Notebook - Size: 2.17 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0
sim-pez/k_means_distributed
K-Means algorithm for distributed systems
Language: Java - Size: 410 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 1
divyam-goel/ML-using-MapReduce-and-Spark
Naive Implementation of Machine Learning Algorithms in distributed frameworks MapReduce and Spark
Language: Scilab - Size: 589 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 4 - Forks: 0
shanmuga-sudan/Big-Data-Systems
This repo contains all the assignments, project work on Engineering Big Data Systems coursework
Language: C# - Size: 299 MB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0
BorroGG/relation
Processing relational data using mapReduce, hive, pig.
Language: Java - Size: 8.79 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
BorroGG/cross-correlation
Cross Correlation Algorithm.
Language: Java - Size: 6.84 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
huangyueranbbc/hadoop05_pagerank
pagerank hadoop
Language: Java - Size: 39.5 MB - Last synced: over 1 year ago - Pushed: almost 7 years ago - Stars: 2 - Forks: 0
aaaastark/Hadoop-Insallation-Commands-WordCount
Hadoop: Installation, Commands and Word Count Example
Language: Java - Size: 4.7 MB - Last synced: 11 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 3
smohammadhejazi/twitter-mapreduce-practice
Applying MapReduce in Java on a Twitter dataset using Apache Hadoop
Language: Java - Size: 39.2 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
ParthKalkar/intro_to_big_data
Introductory Big Data concepts using Spark framework and different libraries
Language: Java - Size: 4.61 MB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1
tkhan11/Big-Data-Hadoop-Project
Big Data Hadoop framework project for analysis of superstore sales data to find insights.
Language: Java - Size: 5.36 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 2 - Forks: 1
Lakhan-Nad/MapReduce
A small hadoop map reduce implemented for Big Data Project
Language: Java - Size: 850 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
ArArgon/UESTC-CloudComputing-Experiment
远离远古 Eclipse, 远离上古软件和阴间插件
Language: Java - Size: 23.4 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
gorkinovich/SGDI
Sistemas de Gestión de Datos y de la Información (UCM, 2015)
Language: Java - Size: 2.74 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
szaher/spark
Playing with Spark using Java
Language: Java - Size: 424 KB - Last synced: about 2 months ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0
HxnDev/Hadoop-MapReduce-to-Analyze-Sentiment-of-Keyword
In this task, we had to write a MapReduce program to analyze the sentiment of a keyword from a list of comments. This was done using Hadoop HDFS.
Language: Java - Size: 1000 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 5 - Forks: 0
ankit08015/Engg-Of-Big-Data
Repository for course INFO7250 - Engineering of Big Data
Language: Java - Size: 308 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 4 - Forks: 3
sayaliwalke30/Hadoop-Mapreduce
Data analysis on Big Data. Used various databases from 1M to 100M including Movie Lens dataset to perform analysis. Covers basics and advance map reduce using Hadoop.
Size: 4.95 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 1 - Forks: 3
rachelzhaolp/BigData-HW-MapReduce
Solutions of some MapReduce Problem
Language: Java - Size: 133 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 1
SinghHarshita/MapReduce-Examples
Word co-occurrence and Matrix Multiplication using MapReduce
Language: Java - Size: 11.4 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0
ktothep/MapReduce
Implementation of MapReduce programs other than word count
Language: Java - Size: 64 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0
abhishekmsharma/big-data-electricity-consumption-analysis-apache-spark
Developed for analysing and visualizing trends related to electricity and energy consumption
Language: Java - Size: 145 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 3 - Forks: 1
Super-Special-Pookies/PhraseExtract
Hadoop MapReduce Assignment: Distributed Phrase Extraction(Unregistered Word Discovery)
Language: Java - Size: 40.4 MB - Last synced: 12 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
jaskier07/Hadoop-lab
Solving simple tasks with Apache Hadoop.
Language: Java - Size: 32.2 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
DavideAG/BigData
Spark, RDDs and Map Reduce applications related to the BigData @Polito course (2019-2020). A set of personal notes are already provided.
Language: Java - Size: 5.7 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
ayenpure/StockMeUp
This is a class project for 'CIS 610 : Data Science' where I try and validate Stock Market recommendations.
Language: Java - Size: 17.6 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0
anevsky/bigdata-101
Big Data ramp-up
Language: Java - Size: 6.48 MB - Last synced: about 1 year ago - Pushed: over 7 years ago - Stars: 0 - Forks: 1
berksudan/Analysis-on-Big-Data-with-Hadoop
Implementation of Statistical Methods via Hadoop Map-Reduce Library.
Language: Java - Size: 75.3 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0
lurifn/mapreduce-ep3-client
Um sistema que permite a um programa cliente requisitar, a uma arquitetura Map-Reduce, a criação de um índice invertido de links (semelhante a uma das atividades do PageRank do Google)
Language: Java - Size: 13.7 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1
RolandTaverner/hadoop_tutorial
Language: Java - Size: 2.85 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
Arko98/Product_Purchase_Prediction
Prediction of purchase of Bank Product using Map Reduce Naive Bayes
Language: Java - Size: 35.2 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 1 - Forks: 0
ziyaddhuka/Airine-data-analysis
Big Data project on Airline dataset
Language: Java - Size: 309 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0
Tabed23/Map_Reduce_WordCount
Hadoop Map Reduce Word count example
Language: Java - Size: 5.86 KB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0
varshinireddyt/Big-Data-Cloud-computing
Class Projects related to big data, spark, Hadoop, Pig, Hive
Language: Java - Size: 690 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
ashishgopalhattimare/Parallel-Concurrent-and-Distributed-Programming-in-Java
Parallel, Concurrent, and Distributed Programming in Java | Coursera
Language: Java - Size: 34.5 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
reethified/techsquids-code-examples
Language: Java - Size: 80.1 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
foolfun/SomeMapReduceCases
mapreduce案例和大数据入门笔记
Language: Java - Size: 53.7 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0
soyelherein/BigData
Here we will try to solve very priliminary Bigdata problems using java, which is suitable for beginners or college project
Language: Java - Size: 432 KB - Last synced: 12 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0
Ishuan/Page-Rank-Implementation
The goal of this programming assignment is to compute the PageRanks of an input set of hyperlinked Wikipedia documents using Hadoop MapReduce. The PageRank score of a web page serves as an indicator of the importance of the page. Many web search engines (e.g., Google) use PageRank scores in some form to rank user-submitted queries. The goals of this assignment are to: 1. Understand the PageRank algorithm and how it works in MapReduce. 2. Implement PageRank and execute it on a large corpus of data. 3. Examine the output from running PageRank on Simple English Wikipedia to measure the relative importance of pages in the corpus. To run your program on the full Simple English Wikipedia archive, you will need to run it on the dsba-hadoop cluster to which you have access.
Language: Java - Size: 36.1 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0
philomathic-guy/Friend-recommendation-using-movie-data
Language: Java - Size: 756 KB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 2 - Forks: 0
ShreeshaN/BigDataTutorials
Hadoop MapReduce jobs, Pig Queries
Language: Java - Size: 514 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0