An open API service providing repository metadata for many open source software ecosystems.

GitHub / darshank15 / wikipedia-search-engine

Built a complete search engine by creating an Inverted Index on the Wikipedia corpus ( of 2018 with size 72 GB). That gives you top search result related to given query words.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/darshank15%2Fwikipedia-search-engine
PURL: pkg:github/darshank15/wikipedia-search-engine

Stars: 5
Forks: 2
Open issues: 0

License: None
Language: Jupyter Notebook
Size: 51.5 MB
Dependencies parsed at: Pending

Created at: almost 6 years ago
Updated at: over 1 year ago
Pushed at: almost 5 years ago
Last synced at: over 1 year ago

Topics: inverted-index, mergesort, response-time, search, search-engine-algorithm, stemming, stop-words, tokenisation, wikipedia

    Loading...