An open API service providing repository metadata for many open source software ecosystems.

GitHub / jhustles / newegg_webscraper_crawler_dataExtraction_automated

An automated web scraper and crawler program for laptops on Newegg.com using mainly Python, BeautifulSoup4, and Splinter. Alerts users when suspects you are a bot to take actions to circumvent their defenses. Data Extraction in ETL.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jhustles%2Fnewegg_webscraper_crawler_dataExtraction_automated

Stars: 2
Forks: 1
Open issues: 0

License: mit
Language: Jupyter Notebook
Size: 16.4 MB
Dependencies parsed at: Pending

Created at: about 5 years ago
Updated at: over 4 years ago
Pushed at: almost 5 years ago
Last synced at: about 2 years ago

Topics: beautifulsoup4, etl, jupyter-notebook, newegg, page-crawler, python3, splinter, webscraper

    Loading...