An open API service providing repository metadata for many open source software ecosystems.

Topic: "hadoop-streaming"

monisjaved/Data-Processing-With-Hadoop

Text Processing Using Hadoop

Language: Jupyter Notebook - Size: 21 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 2

LMAPcoder/Hadoop-on-Colab

Installation and configuration of Hadoop on Google Colaboratory

Language: Jupyter Notebook - Size: 620 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 5

adrian83/go-hadoop-streaming

Hadoop Streaming example written in Go

Language: Go - Size: 1.24 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 0

peloyeje/map543-dijkstra-mapreduce-spark

[MAP543] Hadoop Streaming (MapReduce) and Spark implementations of the Dijkstra shortest path algorithm

Language: Jupyter Notebook - Size: 23.4 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 4

Lapis-Hong/ctrcount

python map reduce statics

Language: Python - Size: 3.58 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 0

thedatasociety/lab-hadoop

Language: PLpgSQL - Size: 4.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 7

sreetamparida/Hiraishin

A REST-based service that translates the SQL query into MapReduce and Spark jobs. It runs these jobs and provides the JSON object. SQL to MapReduce and Spark translator.

Language: Python - Size: 194 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

jomarsilio/Bootcamp-IGTI-Analista-de-Dados

Bootcamp ministrado pela IGTI com o objetivo de abordar de forma intensiva conceitos e práticas da análise de dados, habilitando o aluno para atuar profissionalmente na área.

Language: Jupyter Notebook - Size: 127 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

highoncarbs/hadoopwithpy

:elephant: :heavy_plus_sign: :snake: Learning Hadoop with Python

Language: Python - Size: 86.6 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

MandarGogate/Association-Rule-Mining-Hadoop-Python

A case study on mining association rules between different factors related to deaths of people in the United States

Language: Python - Size: 146 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 2

krishnadey30/Intro-to-Hadoop-and-MapReduce

Language: Python - Size: 6.54 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

GINK03/aws-emr-streaming-templates

AWS Elastic Map Reduce Streaming Templates

Language: Python - Size: 9.72 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

KeerthanaJ-rec/210701118-CS19P16-DA-Lab

Data Analytics Laboratory

Language: R - Size: 23.1 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 1 - Forks: 0

Jacob12138xieyuan/hadoop-mapreduce-with-python

hadoop mapreduce algorithm with hadoop streaming (Python)

Language: Jupyter Notebook - Size: 16.6 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

simple-learning/Hadoop

Hadoop Projects

Language: Java - Size: 28.7 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

BurraAbhishek/Python_Hadoop_MapReduce_MarketBasketAnalysis

Market Basket Analysis using Hadoop MapReduce in Python

Language: Python - Size: 103 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

vbugaevskii/hadoop-streaming-protoseq

A small library example how to work with binary files with Hadoop Streaming.

Language: Java - Size: 33.2 KB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

6vedant/TwitterAnalyticsHadoop

Twitter Streaming Analytics Project (Big Data Analysis using Hadoop)

Language: Java - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

tertiarycourses/ApacheHadoop

Exercise files for Apache Hadoop Big Data Training

Size: 63.5 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

yanglgtm/hadoop-skeleton

A hadoop skeleton streaming script

Language: Shell - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

niz-ka/pbd-project

Repository to the needs of Big Data course at university

Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

YMaher99/Parallelizing-the-Feedforward-Operation-of-Neural-Networks-in-Hadoop-MapReduce

Leveraging the mapreduce paradigm we propose a solution to parallelize the feedforward operation of neural networks in order to speed it up for sufficiently large NN architectures and for sufficiently large datasets. Tested Using the MNIST dataset results can be found in the results.html and results.ipynb files.

Language: HTML - Size: 2.1 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

shreyasnagare/GCP-Dataproc-Hadoop-MapReduce

A Hadoop MapReduce application to find the maximum temperature in every day of the years 1901 and 1902 from the NCDC weather records.

Language: Python - Size: 11.2 MB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

KhanShaheb34/MapReduce

Learning Hadoop MapReduce Using Python

Language: Python - Size: 54.7 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Sidl419/hadoop_streaming

Построение рекомендательной системы на основе алгоритма коллаборативной фильтрации и технологии Hadoop Streaming

Language: Python - Size: 1.94 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

AleksaMCode/odabrana-poglavlja-iz-operativnih-sistema

Rjesenje rokova iz predmeta Odabrana poglavlja iz operativnih sistema na Elektrotehničkom fakultetu u Banjoj Luci.

Language: C# - Size: 41 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

aditeyabaral/mapreduce-word2vec

Implementation of Word2Vec for large datasets as a Map-Reduce Job using Hadoop Streaming.

Language: Python - Size: 1.45 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

crazypegasusvv/Mutations 📦

Mutations

Size: 1.56 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

NilufaYeasmin/MapReduce

This repo contains implementations of Mapreduce program in a large text corpus with Apache Hadoop Environment | Nilufa Yeasmin | https://www.linkedin.com/in/nilufayeasmin/

Language: CSS - Size: 3.53 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

antoinewg/ocr-page-rank

PageRank algorithm using Hadoop Streaming

Language: Python - Size: 438 KB - Last synced at: 16 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

antoinewg/ocr-tfidf

TF-IDF with Hadoop Streaming

Language: Python - Size: 64.5 KB - Last synced at: 16 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

bug-data/Big_Data_First_Project

First project for Big Data course held at Roma Tre University

Language: Jupyter Notebook - Size: 2.16 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

ggodreau/huhdewp

Hadoop streaming EMR job

Language: Python - Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

amitthere/clustering-algorithms

K-Means, Hierarchical Agglomerative, Density based and Map Reduce K-Means Clustering implemented on 2 Gene Datasets in Python

Language: Python - Size: 81.1 KB - Last synced at: 9 days ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

HarigovindV10/NYC-Subway-Data-Analysis

An analysis of NYC Subway Data using Hadoop Map Reduce

Language: Jupyter Notebook - Size: 529 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

SakhriHoussem/MapReduce-Python

MapReduce Python Example

Language: Python - Size: 20.5 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 4

ahmedhumza94/udacity-intro-to-hadoop-and-mapreduce

Repository containing python code for MapReduce jobs to answer questions about Udacity forum data.

Language: Python - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

rmcnew/Wiki_Edit_MapReduce

Simple MapReduce code for use with Hadoop for Distributed Systems graduate school class

Language: Java - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

dishaumarwani/Parking_Data_Analysis

Parking Data Analysis in Hadoop MapReduce Framework

Language: Python - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 1

amcquade/TweetleStreams

Map and Reduce algorithm on a pool of tweets.

Language: C++ - Size: 1.14 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

imdeepanshugpt/Hadoop

Hadoop-Cluster

Language: Python - Size: 887 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

E7su/rhyhorn

easy examples with Hadoop's Java API & Hadoop Streaming

Language: Java - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

avikdatta/hadoop_streaming_python_script

A repository for hadoop streaming python scripts

Language: Shell - Size: 23.4 KB - Last synced at: 6 months ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

penguin138/hadoop-tasks

Hadoop tasks repository for Parallel and Distributed Computing course at MIPT 2015

Language: Java - Size: 237 KB - Last synced at: 5 months ago - Pushed at: over 9 years ago - Stars: 0 - Forks: 0