An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: sparkml-pipelines

lykmapipo/Python-Spark-Log-Analysis

Python scripts to process, and analyze log files using PySpark.

Language: Python - Size: 131 KB - Last synced at: 12 days ago - Pushed at: 11 months ago - Stars: 6 - Forks: 0

Pirata-Codex/Sentiment-Analysis-SparkML

Using SparkML to build different machine learning models for simulating a small scale of big data management

Language: Jupyter Notebook - Size: 172 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

khaz-dev/Airfoil_ML_Pipeline

This Project about Build Machine Learning Pipeline using SparkML with Jupyter Notebook

Language: Jupyter Notebook - Size: 96.7 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

dlleonardo/spark-de-ml-assignments

Spark DE&ML assignments from the "Data Engineering and Machine Learning with Spark" course (offered by IBM Skills Network)

Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ph2017001/FuzzyMatch_Spark

FuzzyMatch a Query Set with a Reference Set Using Spark

Language: Python - Size: 99.6 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

NijatZeynalov/Apache-SparkML-Pipelines

In this notebook I’ll use the HMP dataset and perform some basic operations using Apache SparkML Pipeline component. This dataset is a public collection of labelled accelerometer data recordings to be used for the creation and validation of acceleration models of human motion primitives.

Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0