An open API service providing repository metadata for many open source software ecosystems.

GitHub / TahirZia-1 / EDA-Netflix-Dataset-using-PySpark-on-Docker

This project demonstrates how to perform Exploratory Data Analysis (EDA) on the Netflix dataset using PySpark in a Jupyter Notebook environment. It involves setting up Spark, loading a dataset, performing basic data cleaning, and visualizing the results. All of it is runnning on a container in Docker.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TahirZia-1%2FEDA-Netflix-Dataset-using-PySpark-on-Docker

Stars: 0
Forks: 0
Open issues: 0

License: None
Language: Jupyter Notebook
Size: 1.75 MB
Dependencies parsed at: Pending

Created at: 4 months ago
Updated at: 4 months ago
Pushed at: 4 months ago
Last synced at: 23 days ago

Topics: dataset, docker, docker-image, eda, jupyter-notebook, netflix, pyspark, pyspark-notebook, python

Readme
Loading...