GitHub / NathanP23 / Big-Data-Mining-52002
Midterm and Final assignments of the course "Big Data Mining (52002)" at The Hebrew University of Jerusalem, in the Department of Statistics and Data Science. Focuses on analyzing massive datasets using Python, SQL, cloud computing, and network analysis. Includes project guidelines for scalable data mining techniques and distributed computing.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NathanP23%2FBig-Data-Mining-52002
PURL: pkg:github/NathanP23/Big-Data-Mining-52002
Stars: 1
Forks: 0
Open issues: 0
License: None
Language: Jupyter Notebook
Size: 21 MB
Dependencies parsed at: Pending
Created at: 8 months ago
Updated at: 3 months ago
Pushed at: 3 months ago
Last synced at: 3 months ago
Topics: bash, big-data, data-engineering, data-mining, data-pipeline, huji, json-processing, nlp, nltk, pandas, python, slurm, spark, streaming-data, tfidf, unix, word-frequency