An open API service providing repository metadata for many open source software ecosystems.

Topic: "spark-ml"

wzhe06/SparkCTR

CTR prediction model based on spark(LR, GBDT, DNN)

Language: Scala - Size: 35 MB - Last synced at: 13 days ago - Pushed at: about 5 years ago - Stars: 912 - Forks: 260

qubole/sparklens

Qubole Sparklens tool for performance tuning Apache Spark

Language: Scala - Size: 175 KB - Last synced at: 13 days ago - Pushed at: 10 months ago - Stars: 574 - Forks: 141

lifeomic/sparkflow

Easy to use library to bring Tensorflow on Apache Spark

Language: Python - Size: 8.79 MB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 296 - Forks: 45

locationtech/rasterframes

Geospatial Raster support for Spark DataFrames

Language: Jupyter Notebook - Size: 102 MB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 251 - Forks: 45

titicaca/spark-iforest

Isolation Forest on Spark

Language: Scala - Size: 74.2 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 221 - Forks: 91

CognonicLabs/awesome-AI-kubernetes

:snowflake: :whale: Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc

Size: 326 KB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 129 - Forks: 45

databrickslabs/geoscan

Geospatial clustering at massive scale

Language: Scala - Size: 2.44 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 99 - Forks: 19

titicaca/spark-gbtlr

Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark

Language: Scala - Size: 520 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 89 - Forks: 27

xiaogp/customer_churn_prediction

零售电商客户流失模型,基于tensorflow,xgboost4j-spark,spark-ml实现LR,FM,GBDT,RF,进行模型效果对比,离线/在线部署方式总结

Language: Python - Size: 205 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 58 - Forks: 17

dimajix/spark-training

Repository used for Spark Trainings

Language: Jupyter Notebook - Size: 9 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 53 - Forks: 66

typesafehub/fdp-modelserver

An umbrella project for multiple implementations of model serving

Language: Scala - Size: 163 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 46 - Forks: 20

aipredict/ai-models-serialization

AI模型序列化总结

Size: 111 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 45 - Forks: 7

jubins/Spark-And-MLlib-Projects

This repository contains Spark, MLlib, PySpark and Dataframes projects

Language: Jupyter Notebook - Size: 101 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 39 - Forks: 97

feng-li/dlsa

Distributed least squares approximation (dlsa) implemented with Apache Spark

Language: Python - Size: 276 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 33 - Forks: 46

isarn/isarn-sketches-spark

Routines and data structures for using isarn-sketches idiomatically in Apache Spark

Language: Scala - Size: 1.33 MB - Last synced at: 17 days ago - Pushed at: 11 months ago - Stars: 29 - Forks: 12

roshankoirala/pySpark_tutorial

Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning

Language: Jupyter Notebook - Size: 202 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 29 - Forks: 26

uosdmlab/nsmc-zeppelin-notebook

Movie review dataset Word2Vec & sentiment classification Zeppelin notebook

Size: 107 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 26 - Forks: 6

lynnlangit/Spark-Scala-EKS

Spark Scala docker container sample for AWS testing - EKS & S3

Language: HCL - Size: 5.77 MB - Last synced at: about 24 hours ago - Pushed at: over 6 years ago - Stars: 24 - Forks: 13

pierrenodet/spark-ensemble

Ensemble Learning for Apache Spark 🌲

Language: Scala - Size: 3.61 MB - Last synced at: 17 days ago - Pushed at: 8 months ago - Stars: 23 - Forks: 7

mahmoudparsian/machine-learning-course

Machine Learning Course @ Santa Clara University

Size: 194 MB - Last synced at: 18 days ago - Pushed at: almost 5 years ago - Stars: 23 - Forks: 16

autodeployai/pmml4s-spark

PMML scoring library for Spark as SparkML Transformer

Language: Scala - Size: 50.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 19 - Forks: 8

s22s/pre-lt-raster-frames 📦

Spark DataFrames for earth observation data

Language: Scala - Size: 16.9 MB - Last synced at: 2 days ago - Pushed at: almost 7 years ago - Stars: 19 - Forks: 5

dvgodoy/YelpDatasetChallenge

Restaurant recommendations and review text-based quality predictions

Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: 16 days ago - Pushed at: about 8 years ago - Stars: 19 - Forks: 12

FlorentF9/sparkml-som

:sparkles: Spark ML implementation of SOM algorithm (Kohonen self-organizing map)

Language: Scala - Size: 29.3 KB - Last synced at: 26 days ago - Pushed at: about 3 years ago - Stars: 17 - Forks: 6

JohnSnowLabs/spark-nlp-streamlit 📦

Spark NLP for Streamlit

Language: Python - Size: 8.7 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 12

akashsethi24/Machine-Learning

Examples of all Machine Learning Algorithm in Apache Spark

Language: Scala - Size: 3.64 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 15 - Forks: 10

lp-dataninja/SparkML

Detailed notes and code to learn machine learning with Apache Spark.

Language: Jupyter Notebook - Size: 4.06 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 17

Rohini2505/Lending-Club-Loan-Analysis

Explanatory Data Analysis and ML model building using Apache Spark and PySpark

Language: HTML - Size: 6.26 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 10 - Forks: 12

QuanLab/node2vec-spark

Implement node2vec algorithm using Spark 2 from: http://snap.stanford.edu/node2vec/

Language: Scala - Size: 196 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 10 - Forks: 5

josemarialuna/ExternalValidity

This package contains the code for calculating external clustering validity indices in Spark. The package includes Chi Index among others.

Language: Scala - Size: 146 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 1

BlancRay/PUAdapter 📦

A PU-learning tool on spark

Language: Scala - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 2

datastaxdevs/workshop-introduction-to-machine-learning

Come ready to discover the goals and approaches of machine learning, and how to build effective algorithms and solutions!

Language: Jupyter Notebook - Size: 17.2 MB - Last synced at: 13 days ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 6

dvgodoy/DSR-Spark-AppliedML

DSR Class - Applied Machine Learning with Apache Spark

Language: Jupyter Notebook - Size: 26.2 MB - Last synced at: 16 days ago - Pushed at: about 6 years ago - Stars: 8 - Forks: 3

DavideNardone/TwitterSentimentAnalysis

A Spark Streaming implementation for Online Twitter Sentiment Analysis.

Language: Python - Size: 1.78 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 8 - Forks: 3

maziyarpanahi/spark2-template

Intellij template to develop Apache Spark 2.x applications

Language: Scala - Size: 20.5 KB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 41

NashTech-Labs/Sparkathon

A library having Java and Scala examples for Spark 2.x

Language: Java - Size: 113 MB - Last synced at: 20 days ago - Pushed at: over 8 years ago - Stars: 7 - Forks: 9

IBM/sms-spam-filter-using-hortonworks 📦

Build Spam Filter Model on HDP using Watson Studio Local

Language: Jupyter Notebook - Size: 25.4 MB - Last synced at: 4 months ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 10

lykmapipo/Python-Spark-Log-Analysis

Python scripts to process, and analyze log files using PySpark.

Language: Python - Size: 131 KB - Last synced at: about 14 hours ago - Pushed at: 10 months ago - Stars: 5 - Forks: 0

dongkelun/spark-scala

Spark Scala代码,包括个人博客全部代码,以及平时学习测试代码

Language: Scala - Size: 316 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 4

pierrenodet/spark-smile 📦

Integrating SMILE and Spark

Language: Scala - Size: 341 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

alivcor/SMORK

Implementation of SMOTE - Synthetic Minority Over-sampling Technique in SparkML / MLLib

Language: Scala - Size: 165 KB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

giovannigarifo/bigdata

Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark

Language: Java - Size: 69.1 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 2

zhangruipython/ETLPlatform

多数据源,大规模数据提取转换加载

Language: Java - Size: 78.1 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

MHassaanButt/Crime-Spark-ML

In this project I stream data and do crime classification using Spark. This dataset contains incidents derived from the SFPD Crime Incident Reporting system. The data ranges from 1/1/2003 to 5/13/2015. I do some data analysis of crime scenes in different areas and with respect to other parameters.

Language: Python - Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

theGuyWithBlackTie/electricChargingStations

Language: Jupyter Notebook - Size: 8.68 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

shutterstock/spark-phrases

phrase detection using Google's Word2phrase

Language: Scala - Size: 4.88 KB - Last synced at: 13 days ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 1

siddsax/D-Driven-IS

Instance Selection for Big Data

Language: Python - Size: 729 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

desaiankitb/spark-mllib

Apache Spark is one of the most widely used and supported open-source tools for machine learning and big data. In this repo, discover how to work with this powerful platform for machine learning. This repo discusses MLlib—the Spark machine learning library—which provides tools for data scientists and analysts who would rather find solutions to business problems than code, test, and maintain their own machine learning libraries. Repo shows how to use DataFrames to organize data structure, and covers data preparation and the most commonly used types of machine learning algorithms: clustering, classification, regression, and recommendations. You will have experience loading data into Spark, preprocessing data as needed to apply MLlib algorithms, and applying those algorithms to a variety of machine learning problems.

Language: Python - Size: 150 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 4 - Forks: 5

felidsche/mail-spam-filter

An email spam filter using Apache Spark’s ML library

Language: Python - Size: 212 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 1

mnassrib/pyspark-examples

This tutorial presents some examples in order to give a quick overview of the Spark APIs.

Language: Jupyter Notebook - Size: 8.48 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

mnassrib/categorical-data-python

A simple demo repository to show how to handling categorical data in python

Language: Jupyter Notebook - Size: 2.88 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

sirCamp/spark-pspectrum

P-spectrum embedding and sequence relaxation for NLP in Spark

Language: Scala - Size: 87.9 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

LuisFalva/ophelia

Ophelian On Mars! More than a simple framework.

Language: Python - Size: 2.16 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 5

vasnake/spark.ml.SpatialJoinTransformer

spark.ml.transformer: join two datasets using spatial relations

Language: Scala - Size: 120 KB - Last synced at: 17 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

JBris/docker-spark-sparklyr

Docker setup for Apache Spark and the R sparklyr package

Language: Dockerfile - Size: 18.6 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

q-maze/sparkHospital

Project to create models to predict ICU patient mortality based on demographic, diagnostic, and other factors utilizing Apache Spark.

Language: Jupyter Notebook - Size: 1.26 MB - Last synced at: 29 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

seahrh/bad-renter

Working examples of Spark ML Pipeline and SMOTE algorithm for synthetic data augmentation

Language: Scala - Size: 59.6 KB - Last synced at: 28 days ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

adityajn105/Apache-Spark-Tutorials

Apache spark is a big data analysis framework.

Language: Jupyter Notebook - Size: 789 KB - Last synced at: 19 days ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 5

multivacplatform/multivac-ml

Pre-trained ML models for Apache Spark

Language: Scala - Size: 947 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

Hydrospheredata/fastserving

Spark ML Lib serving library

Language: Scala - Size: 77.1 KB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

joyoyoyoyoyo/emojipasta-topic-modeling

😅 A topic model of reddit.com/r/EmojiPasta trained with Spark and an LDA model (NSFW) - Trigger Warning: The r/emojipasta subreddit posts controversial content and anything I have crawled is to provide visibility of a topic modeling some of this controversial content. Unfortunately there is also discriminatory speech which must be called out!

Language: Scala - Size: 700 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

zhaoyongchuang/spark-ml

study spark ml

Language: Scala - Size: 208 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

AbdelmajidLh/Spark_ML_Weather

Projet d'apprentissage Scala et Spark : Prédire la pluie de demain avec des données historiques

Language: Scala - Size: 13.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

John-CYHui/PySpark-Code

Code for PySpark Tutorial

Language: Python - Size: 38.7 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

giabaohb48/ReviewProduct

evaluate product on Shopee by comment

Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

aabdel-kader/Apache-Spark

A repository for my practices and projects using pyspark

Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

szaher/spark

Playing with Spark using Java

Language: Java - Size: 424 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

angeligareta/machine-learning-spark

Assignment for Scalable Machine Learning which aims to study the basics of regression and classification in Spark.

Language: Scala - Size: 1.42 MB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

mnassrib/bankmarketing-sparkml-databricks

This tutorial analyses a binary classification example based on Spark ML applied with Python language programming and running a databricks cloud community edition cluster.

Language: Jupyter Notebook - Size: 79.1 KB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

anmolmore/Enzyme-Classifier-Using-ML

Classify enzymes with geomic sequence using spark-ML

Language: Jupyter Notebook - Size: 719 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

yukia3e/learning-spark-3

Language: Python - Size: 289 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Dragon1573/Online-Works 📦

泰迪在线实习备份

Language: Scala - Size: 2.24 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

maheshg23/Life-Expectancy-Predictor-BigDataProject

Life Expectancy Predictor - Big Data final course project using Big Data Technologies.

Language: Scala - Size: 376 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

Evegen55/mastering-spark

mastering spark

Language: Java - Size: 1.38 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

hypnosapos/sparknetes

Spark on Kubernetes PoCs

Language: Makefile - Size: 1.12 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

hsuanhauliu/yelp-recommendation-system

Yelp recommendation system using collaborative-filtering algorithms.

Language: Python - Size: 20.2 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

corneliouzbett/Master-Apache-Spark

Apache Spark is a fast and general-purpose cluster computing system. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLlib for machine learning, GraphX for graph processing, and Spark Streaming

Language: Python - Size: 889 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

hasitha087/NPSClassification

This is pyspark based NPS(Net Promote Score) text classification model developed using Naive Bayes Classifier.

Language: Python - Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

multivacplatform/multivac-nlp

Testing and benchmarking some of the existing NLP libraries in Apache Spark

Language: Scala - Size: 12 MB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

AndrewKuzmin/spark-ml-pipelines-with-structured-streaming-examples

Examples of using Apache Spark MLlib Pipelines and Structured Streaming on version 2.4.0

Language: Shell - Size: 1020 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

maikeffi/spark_ml_boston

Loading a CSV from hadoop / hive and perform ml functions

Language: Jupyter Notebook - Size: 34.2 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

necosta/forecast-x

Tennis data exploration and forecasting

Language: Scala - Size: 40 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

Jayasagar/sparkml-regression-models-movie-revenue-predictions

TMDb movie dataset revenue predictions

Language: Jupyter Notebook - Size: 277 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

mkuthan/example-spark-ml

Machine learning, feature engineering examples using Spark ML

Language: Jupyter Notebook - Size: 129 KB - Last synced at: 14 days ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

EmmaMuhleman1/emmamuhlemantest1.github.io

Language: HTML - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

krobee/Stock-Price-Predictor

A system that can predict stock price based on Twitter data

Language: Scala - Size: 897 KB - Last synced at: 11 months ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

sueszli/sparkly-svm

distributed training of a SVM with sparkML

Language: Jupyter Notebook - Size: 21.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

hazecodeio/spark-sandbox

Language: Scala - Size: 13.1 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

glimmerphoenix/dataeng_book

Libro Fundamentos de Ingeniería de Datos

Language: TeX - Size: 634 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

AdrielC/skynet

Machine Learning (MLeap) Model Serving application for Scala

Language: Scala - Size: 150 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

rahult18/NYC-Yellow-Taxi-Trip-Data-Pipeline

This is an end-to-end data pipeline that processes and analyzes NYC Yellow Taxi trip data. It includes data ingestion, cleaning, feature engineering, machine learning model training, and a REST API for fare amount prediction.

Language: Jupyter Notebook - Size: 81.8 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

cmpt-732/Amazon_Product_Analysis

Amazon Product analysis

Language: Python - Size: 5.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lediau/bigdata-data-engineering-ai-masters

Language: Python - Size: 122 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Siddharth1989/ProspectiveTopUpCustomerPrediction

Developed a model/Spark ML pipeline stream to identify potential customers that may purchase top up services in the future.

Language: Jupyter Notebook - Size: 6.17 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Amir79Naziri/TwitterSentimentAnalysisWithSpark_Project

A sentiment analyzer using Spark ML library for Twitter Dataset

Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

pmb-7684/IBM-Data-Engineering-Professional-Certificate

Learning materials, assignments, and helpful resources for professional certification. Expected Completion June 2023

Size: 9.77 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Jayveersinh-Raj/trip_duration_big_data

Taxi trip duration forecasting using Big data and spark ML

Language: Jupyter Notebook - Size: 203 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

chandni-s/BigDataSystems

Big Data Management Systems & Tools

Language: HTML - Size: 781 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

yyqcs/spark-example

spark2.4.x common examples ,using scala

Language: Scala - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

avisionary/reddit-comments-analysis

Big data project to analyze (Subreddit : NoStupidQuestions) comments

Language: HTML - Size: 5.68 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0