An open API service providing repository metadata for many open source software ecosystems.

Topic: "sparkr"

awesome-spark/awesome-spark

A curated list of awesome Apache Spark packages and resources.

Language: Shell - Size: 231 KB - Last synced at: about 2 hours ago - Pushed at: 6 months ago - Stars: 1,793 - Forks: 338

cluster-apps-on-docker/spark-standalone-cluster-on-docker

Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. :zap:

Language: Jupyter Notebook - Size: 419 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 381 - Forks: 181

jadianes/spark-r-notebooks

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: 13 days ago - Pushed at: over 7 years ago - Stars: 121 - Forks: 71

microsoft/A-TALE-OF-THREE-CITIES

Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.

Language: R - Size: 21.8 MB - Last synced at: 4 days ago - Pushed at: about 4 years ago - Stars: 86 - Forks: 34

awesome-spark/learn-by-examples 📦

Real-world Spark pipelines examples

Language: Scala - Size: 1.1 MB - Last synced at: about 2 hours ago - Pushed at: about 7 years ago - Stars: 83 - Forks: 30

tomaztk/Azure-Databricks

Azure Databricks - Advent of 2020 Blogposts

Language: Jupyter Notebook - Size: 44.9 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 60 - Forks: 49

manuparra/taller_SparkR

Taller SparkR para las Jornadas de Usuarios de R

Language: HTML - Size: 263 KB - Last synced at: 26 days ago - Pushed at: over 8 years ago - Stars: 12 - Forks: 18

manuparra/MasterDatCom_BDCC_Practice

Practice and Workshop on BigData and Cloud Computing using Docker Containers and OpenNebula. HDFS, hadoop and spark+R

Size: 43.9 KB - Last synced at: 23 days ago - Pushed at: about 8 years ago - Stars: 11 - Forks: 3

zero323/dlt

Mirror of https://gitlab.com/zero323/dlt

Language: R - Size: 904 KB - Last synced at: 9 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

RSummerSchool/R-for-HPC-and-big-data

Slides and lab material for the talk R for HPC and big data at http://rsummer.data-analysis.at

Size: 3.68 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 4

cosmincatalin/cubist-regression

Fit a Cubist regression model on StackOverflow data and make predictions in a distributed manner with SparkR

Language: R - Size: 27.2 MB - Last synced at: 8 days ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 1

spark-in-a-box/sparkr-build-sandbox

Docker images for testing SparkR builds

Language: Python - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0

jaehyeon-kim/sparkr-demo

SparkR Demo

Language: HTML - Size: 12.7 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 0

manuparra/taller-bigdata-con-r

Taller Big Data con Apache Spark + R desde Databricks cloud

Size: 15.9 MB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

duttashi/cheatsheets 📦

A curated list of essential cheatsheets for data analysis, visualization and machine learning using R or Python

Size: 9.27 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

ukdataservice/bdas2017

Course material for the "Encounters with Big Data" course delivered by the UK Data Service at the 2017 Big Data and Analytics Summer School.

Language: R - Size: 26.1 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 7

containalytics/containalytics

"Cloud container data analytics, statistical modeling, and machine learning on distributed databases". "A free opensource alternative to SPSS, SAS, MATLAB, PowerBI, Tableau and Alteryx". Runs on Linux, Windows, MacOS, and in the cloud via containers.

Last synced at: over 2 years ago - Stars: 2 - Forks: 0

zero323/dlt

Delta Lake interface for SparkR https://dlt.zero323.net/

Last synced at: over 2 years ago - Stars: 2 - Forks: 0

Soumyadipta2020/SparkR_test

Sample Codes of Spark using R programming

Language: Jupyter Notebook - Size: 9.6 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

slothkong/r_on_gcloud

R workloads running at scale on Google Cloud

Language: R - Size: 417 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

kiendang/sparkr-naivebayes-example

Language: R - Size: 230 KB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 1

Anas399/SPARK_CLUSTER_DOCKER

Set-up local spark cluster, hadoop (hdfs), airflow, postgresql on docker with ease, without any local installations

Size: 1000 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

konhay/self-service-modeler

Self-service modeling analysis tool based on R language and big data. It integrates SparkR, Rserve, and Mlib machine learning libraries

Language: R - Size: 8.88 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

d4rthm4ul/R-Cleaning-Exploration-Imputation-Visualization

This repository you are browsing contains intermediate level piece of codes which are useful for cleaning, exploratory analysis, handling of missing data points, outlier detection and different visualization techniques using graphics, ggplot2, tidycharts, ggExtra packages. Also in particular part of the script you can get basic information about SparkR package which is an R package that provides a light-weight frontend to use Apache Spark from R . Do not be shy to fork and make contribute.

Language: R - Size: 116 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

gomezportillo/sparkR-hadoop

Processing massive datasets in Hadoop and SparkR

Language: R - Size: 1.38 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

MatthiasDE/spark_standalone_docker

Multiple-Node Standalone Spark with R and Python

Language: R - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

TIME-GATE/r-spark-service

用r、spark做的一些统计分析、机器学习实例,待传

Language: R - Size: 53.7 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

jaehyeon-kim/rocker-extra 📦

Extra docker images from rocker/tidyverse

Language: Shell - Size: 48.8 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

ashish-kamboj/BigData-Analytics

Data analysis and Model building on large datasets using Hive and Spark

Language: R - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

irfanalidv/Applied_Machine_Learning_Apache_Spark

Apache® Spark™ for Machine Learning and Data Science

Language: Jupyter Notebook - Size: 261 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 1