An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: url-crawler

kevincobain2000/email_extractor

Yes it works! God Speed. Email Extractor by Full Url Crawl. Extract emails and web urls from a website with full crawl or option limit, depth of urls to crawl using terminal.

Language: Go - Size: 422 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 81 - Forks: 39

cyclone-github/spider

Spider - web crawler and local file processor to generate wordlist / ngrams

Language: Go - Size: 84 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 15 - Forks: 0

komodoooo/Some-things

Scripts, POCs & bullshit

Language: Ruby - Size: 169 KB - Last synced at: 7 minutes ago - Pushed at: 2 months ago - Stars: 28 - Forks: 11

akumathedyn123/cpp-url-collector

This C++ program crawls websites, extracts links from their HTML content, and saves them for further analysis. It takes URLs from a text file, downloads the corresponding HTML, parses it, and saves the extracted links to organized files.

Language: C++ - Size: 18.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

ElektroStudios/Google-Search-URL-Crawler

Desktop app that crawls urls from Google's search engine results

Language: Visual Basic .NET - Size: 4.06 MB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 2

v0rl0x/golang-url-crawler

A script to fetch domains and subdomains in a target URL logging both in scope and out of scope URLs.

Language: Go - Size: 17.6 KB - Last synced at: 10 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

limit-zero/url-juicer

🍊🔗 Squeeze some juice from URLs: A URL crawler/extraction library.

Language: JavaScript - Size: 269 KB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

r3dxpl0it/Damn-Small-URL-Crawler

A Minimal Yet Powerful Crawler for Extracting all The Internal/External/Fuzz-able Links from a website

Language: Python - Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 19 - Forks: 9

DmitryKey/url-crawler

Python crawler with an HTML dashboard. It checks statuses of URLs for specific rules

Language: HTML - Size: 598 KB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 3

iampukar/url_crawler

A Python library to crawl the details of a URL.

Language: Python - Size: 11.7 KB - Last synced at: 11 days ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 1

bottomless-archive-project/url-collector

An application that crawls the Common Crawl corpus for URLs with the specified file extensions.

Language: Java - Size: 175 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

mdigger/urlinfo

Rich Content API (URL Content Info)

Language: Go - Size: 23.4 KB - Last synced at: 10 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

absingh31/Selective_Crawler

Language: Python - Size: 315 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 4