An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: python-scrapy-framework

demamano/bookscraper

The repository "bookscraper" contains web-scraping and web-crawling projects using the Python Scrapy framework. It scrapes and crawls data from several websites and saves the data in JSON, CSV, and XML formats. For more details, you can check the [README file](https://github.com/demamano/bookscraper/blob/c372021ff5b80db635d4551757c010b7a9f8c028/RE

Language: Python - Size: 2.21 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

WillCaton2350/Python-Scrapy-Web-Crawler

Web Crawler and data preprocessing tool written in Python and Scrapy. The ETL process involves multiple steps, extracting specific data from a web page using scrapy and organizing it into a structured format using scrapy items. Additionally, the extracted data is saved in JSON format for further analysis and integration into MySQL Workbench.

Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

matejbasic/PythonScrapyBasicSetup

Basic setup with random user agents and IP addresses for Python Scrapy Framework.

Language: Python - Size: 29.3 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 57 - Forks: 14