An open API service providing repository metadata for many open source software ecosystems.

GitHub / BhagyashriT / DICLAB2-DataAggregationBigDataAnalysisAndVisualization

Collected data about from three sources, one opinion-based social media in twitter, research data in New York Times, and the third is the common crawl data for the same topic or key phrase, and from similar time periods. Processed the three data sets collected individually using classical big data methods like Map Reduce in Google Dataproc Clusters. And then compared the outcomes using popular visualization methods in tableau.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BhagyashriT%2FDICLAB2-DataAggregationBigDataAnalysisAndVisualization

Stars: 2
Forks: 0
Open issues: 0

License: None
Language: Python
Size: 39.8 MB
Dependencies parsed at: Pending

Created at: almost 6 years ago
Updated at: over 3 years ago
Pushed at: over 5 years ago
Last synced at: over 1 year ago

Topics: commoncrawl, crawler, dataproc, google, mapreduce, nytimes-apis, tableau, twitter-api

    Loading...