An open API service providing repository metadata for many open source software ecosystems.

GitHub / akumathedyn123 / cpp-url-collector

This C++ program crawls websites, extracts links from their HTML content, and saves them for further analysis. It takes URLs from a text file, downloads the corresponding HTML, parses it, and saves the extracted links to organized files.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/akumathedyn123%2Fcpp-url-collector
PURL: pkg:github/akumathedyn123/cpp-url-collector

Stars: 1
Forks: 0
Open issues: 0

License: mit
Language: C++
Size: 18.6 KB
Dependencies parsed at: Pending

Created at: about 1 year ago
Updated at: 10 months ago
Pushed at: 10 months ago
Last synced at: 10 months ago

Topics: cpp-url-bot, cpp-web-bot, cpp-web-collector, cpp-web-crawler, cpp-web-scrapper, cpp-web-services, url-collect, url-collect-bot, url-collector, url-collector-bot, url-crawl, url-crawler, web-bot, web-scaper, web-scraping

    Loading...