An open API service providing repository metadata for many open source software ecosystems.

GitHub / huzaifakhan04 / exploratory-data-analysis-on-amazon-review-data-using-mongodb-and-pyspark

This repository showcases the outcomes of an Exploratory Data Analysis (EDA), including visualisation, conducted on the comprehensive Amazon Review Data (2018) dataset, consisting of nearly 233.1 million records and occupying approximately 128 gigabytes (GB) of data storage, using MongoDB and PySpark.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/huzaifakhan04%2Fexploratory-data-analysis-on-amazon-review-data-using-mongodb-and-pyspark

Stars: 0
Forks: 0
Open issues: 0

License: bsd-3-clause
Language: Jupyter Notebook
Size: 875 KB
Dependencies parsed at: Pending

Created at: almost 2 years ago
Updated at: almost 2 years ago
Pushed at: almost 2 years ago
Last synced at: almost 2 years ago

Topics: amazon, amazon-reviews, customer-analysis, customer-reviews, data-analysis, data-science, data-visualisation, eda, exploratory-data-analysis, inferential-statistics, mongodb, nosql, product-analysis, product-reviews, pyspark, statistical-analysis, statistical-inference, statistics

    Loading...