Topic: "nypost"
mukeshkdangi/nypost_searchengine
Crawled and stored metadata of web pages using multithreaded crawler. Used GCP Hadoop cluster to create inverted index. Developed custom page rank algorithm and exposed RESTful APIs with spellchecker and autocomplete features.
Language: TypeScript - Size: 10.4 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

mukeshkdangi/crawler_nypost
Crawling web pages and indexing for solr search
Language: Java - Size: 1.29 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

mukeshkdangi/edgeLink_nypost
Generating Edged between web pages which referenced from one another
Language: Java - Size: 1000 Bytes - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
