Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / aaqib-ahmed-nazir / BDA_Assignment02

This repository aims to develop a basic search engine utilizing Hadoop's MapReduce framework to index and process extensive text corpora efficiently. The dataset used for this project is a subset of the English Wikipedia dump, totaling 5.2 GB in size. The project focuses on implementing a naive search algorithm to address challenges in information.

JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aaqib-ahmed-nazir%2FBDA_Assignment02

Stars: 1
Forks: 0
Open Issues: 0

License: None
Language: Jupyter Notebook
Repo Size: 120 KB
Dependencies: pending

Created: 2 months ago
Updated: 2 months ago
Last pushed: 2 months ago
Last synced: 2 months ago

Topics: apache-hadoop, hadoop, jupiter-notebook, jupyter-notebook, mapreduce, mapreduce-python, python, python3, search-algorithm, search-engine

Files
    Loading...
    Readme
    Loading...