Topic: "sparkr"
awesome-spark/awesome-spark
A curated list of awesome Apache Spark packages and resources.
Language: Shell - Size: 231 KB - Last synced at: about 2 hours ago - Pushed at: 6 months ago - Stars: 1,793 - Forks: 338

cluster-apps-on-docker/spark-standalone-cluster-on-docker
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. :zap:
Language: Jupyter Notebook - Size: 419 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 381 - Forks: 181

jadianes/spark-r-notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: 13 days ago - Pushed at: over 7 years ago - Stars: 121 - Forks: 71

microsoft/A-TALE-OF-THREE-CITIES
Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.
Language: R - Size: 21.8 MB - Last synced at: 4 days ago - Pushed at: about 4 years ago - Stars: 86 - Forks: 34

awesome-spark/learn-by-examples 📦
Real-world Spark pipelines examples
Language: Scala - Size: 1.1 MB - Last synced at: about 2 hours ago - Pushed at: about 7 years ago - Stars: 83 - Forks: 30

tomaztk/Azure-Databricks
Azure Databricks - Advent of 2020 Blogposts
Language: Jupyter Notebook - Size: 44.9 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 60 - Forks: 49

manuparra/taller_SparkR
Taller SparkR para las Jornadas de Usuarios de R
Language: HTML - Size: 263 KB - Last synced at: 26 days ago - Pushed at: over 8 years ago - Stars: 12 - Forks: 18

manuparra/MasterDatCom_BDCC_Practice
Practice and Workshop on BigData and Cloud Computing using Docker Containers and OpenNebula. HDFS, hadoop and spark+R
Size: 43.9 KB - Last synced at: 23 days ago - Pushed at: about 8 years ago - Stars: 11 - Forks: 3

zero323/dlt
Mirror of https://gitlab.com/zero323/dlt
Language: R - Size: 904 KB - Last synced at: 9 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

RSummerSchool/R-for-HPC-and-big-data
Slides and lab material for the talk R for HPC and big data at http://rsummer.data-analysis.at
Size: 3.68 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 4

cosmincatalin/cubist-regression
Fit a Cubist regression model on StackOverflow data and make predictions in a distributed manner with SparkR
Language: R - Size: 27.2 MB - Last synced at: 8 days ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 1

spark-in-a-box/sparkr-build-sandbox
Docker images for testing SparkR builds
Language: Python - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0

jaehyeon-kim/sparkr-demo
SparkR Demo
Language: HTML - Size: 12.7 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 0

manuparra/taller-bigdata-con-r
Taller Big Data con Apache Spark + R desde Databricks cloud
Size: 15.9 MB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

duttashi/cheatsheets 📦
A curated list of essential cheatsheets for data analysis, visualization and machine learning using R or Python
Size: 9.27 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

ukdataservice/bdas2017
Course material for the "Encounters with Big Data" course delivered by the UK Data Service at the 2017 Big Data and Analytics Summer School.
Language: R - Size: 26.1 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 7

containalytics/containalytics
"Cloud container data analytics, statistical modeling, and machine learning on distributed databases". "A free opensource alternative to SPSS, SAS, MATLAB, PowerBI, Tableau and Alteryx". Runs on Linux, Windows, MacOS, and in the cloud via containers.
Last synced at: over 2 years ago - Stars: 2 - Forks: 0

zero323/dlt
Delta Lake interface for SparkR https://dlt.zero323.net/
Last synced at: over 2 years ago - Stars: 2 - Forks: 0
Soumyadipta2020/SparkR_test
Sample Codes of Spark using R programming
Language: Jupyter Notebook - Size: 9.6 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

slothkong/r_on_gcloud
R workloads running at scale on Google Cloud
Language: R - Size: 417 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

kiendang/sparkr-naivebayes-example
Language: R - Size: 230 KB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 1

Anas399/SPARK_CLUSTER_DOCKER
Set-up local spark cluster, hadoop (hdfs), airflow, postgresql on docker with ease, without any local installations
Size: 1000 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

konhay/self-service-modeler
Self-service modeling analysis tool based on R language and big data. It integrates SparkR, Rserve, and Mlib machine learning libraries
Language: R - Size: 8.88 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

d4rthm4ul/R-Cleaning-Exploration-Imputation-Visualization
This repository you are browsing contains intermediate level piece of codes which are useful for cleaning, exploratory analysis, handling of missing data points, outlier detection and different visualization techniques using graphics, ggplot2, tidycharts, ggExtra packages. Also in particular part of the script you can get basic information about SparkR package which is an R package that provides a light-weight frontend to use Apache Spark from R . Do not be shy to fork and make contribute.
Language: R - Size: 116 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

gomezportillo/sparkR-hadoop
Processing massive datasets in Hadoop and SparkR
Language: R - Size: 1.38 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

MatthiasDE/spark_standalone_docker
Multiple-Node Standalone Spark with R and Python
Language: R - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

TIME-GATE/r-spark-service
用r、spark做的一些统计分析、机器学习实例,待传
Language: R - Size: 53.7 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

jaehyeon-kim/rocker-extra 📦
Extra docker images from rocker/tidyverse
Language: Shell - Size: 48.8 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

ashish-kamboj/BigData-Analytics
Data analysis and Model building on large datasets using Hive and Spark
Language: R - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

irfanalidv/Applied_Machine_Learning_Apache_Spark
Apache® Spark™ for Machine Learning and Data Science
Language: Jupyter Notebook - Size: 261 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 1
