An open API service providing repository metadata for many open source software ecosystems.

GitHub / yvgupta03 / Big_Data_Project_US-Airlines_Tweet_Processing_and_Analysis

Big data application of Machine Learning concepts for sentiment classification of US Airlines tweets. The focus is on the usage of pyspark libraries (ml-lib) on big data to solve a problem using Machine Learning algorithms and not about the choice of algorithm used in the ML model creation. It also involves data pre-processing using NLP techniques, cross-validation and parameter-grid builder.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yvgupta03%2FBig_Data_Project_US-Airlines_Tweet_Processing_and_Analysis

Stars: 2
Forks: 0
Open issues: 0

License: None
Language: Jupyter Notebook
Size: 1.83 MB
Dependencies parsed at: Pending

Created at: almost 3 years ago
Updated at: about 2 years ago
Pushed at: almost 3 years ago
Last synced at: almost 2 years ago

Topics: big-data, databricks-notebooks, ml-pipelines, pyspark-mllib, twitter-sentiment-analysis

    Loading...