An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: sparkify

pratikwatwani/ETL-pipeline-for-Sparkify

An ETL model designed using Postgres SQL for Sparkify database 🗄, modeling user activity data to create a database and ETL pipeline🔀 for a music streaming app 🎼.

Language: Jupyter Notebook - Size: 677 KB - Last synced at: 11 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

alessiococchieri/BDA-project-sparkify

This Git repo showcases my analysis of Sparkify dataset with PySpark on Apache Spark cluster mode and JupyterLab on Docker. The goal was to identify at-risk customers and develop retention strategies. The analysis tested multiple machine learning models and uncovered insights into customer behavior and churn patterns.

Language: Jupyter Notebook - Size: 4.11 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

SimplifyData/Cloud-Data-Warehouse-with-Redshift-AWS

Cloud Data Warehouse of Sparkify Data using Redshift

Language: Python - Size: 1.2 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

brunowdev/sparkify

This is the final project for the Data Scientist Nanodegree, where our goal is to predict churn for a fictional streaming service called Sparkify.

Language: HTML - Size: 6.33 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

Mcamin/User-Churn-Prediction

Data Analysis in Spark to Identify Customer Churn for a fictional music service.

Language: Jupyter Notebook - Size: 254 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0