An open API service providing repository metadata for many open source software ecosystems.

GitHub / mahdizakery / py-link-crawler

A Python-based web crawler that uses Playwright to extract links from web pages, starting from a given URL. It collects all links within the same base domain, saves them to a JSON file.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mahdizakery%2Fpy-link-crawler
PURL: pkg:github/mahdizakery/py-link-crawler

Stars: 0
Forks: 0
Open issues: 0

License: None
Language: Python
Size: 1000 Bytes
Dependencies parsed at: Pending

Created at: 7 months ago
Updated at: 7 months ago
Pushed at: 7 months ago
Last synced at: 7 months ago

Topics: crawler, linkscrapper, scrapper, web-scraping

    Loading...