An open API service providing repository metadata for many open source software ecosystems.

GitHub / NathanP23 / Big-Data-Mining-52002

Midterm and Final assignments of the course "Big Data Mining (52002)" at The Hebrew University of Jerusalem, in the Department of Statistics and Data Science. Focuses on analyzing massive datasets using Python, SQL, cloud computing, and network analysis. Includes project guidelines for scalable data mining techniques and distributed computing.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NathanP23%2FBig-Data-Mining-52002
PURL: pkg:github/NathanP23/Big-Data-Mining-52002

Stars: 1
Forks: 0
Open issues: 0

License: None
Language: Jupyter Notebook
Size: 21 MB
Dependencies parsed at: Pending

Created at: 8 months ago
Updated at: 3 months ago
Pushed at: 3 months ago
Last synced at: 3 months ago

Topics: bash, big-data, data-engineering, data-mining, data-pipeline, huji, json-processing, nlp, nltk, pandas, python, slurm, spark, streaming-data, tfidf, unix, word-frequency

    Loading...