GitHub / priyendumori / Wiki-Search-Engine
A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/priyendumori%2FWiki-Search-Engine
PURL: pkg:github/priyendumori/Wiki-Search-Engine
Stars: 0
Forks: 0
Open issues: 0
License: None
Language: Python
Size: 9.77 KB
Dependencies parsed at: Pending
Created at: almost 6 years ago
Updated at: almost 6 years ago
Pushed at: almost 6 years ago
Last synced at: over 2 years ago
Topics: external-merge-sort, indexing, ranking-algorithm, search-engine, tf-idf-score, wikipedia-dump