An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: web-extraction

laptopklm/WebExtractor

## WebExtractor WebExtractor is a Python tool for OSINT and ethical hacking that extracts email addresses, phone numbers, and links from target websites. It runs on Linux and Termux, providing a simple CLI interface for cybersecurity professionals to gather critical intelligence. 🐙💻

Language: Python - Size: 18.6 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

lightfeed/extractor

Use LLMs to robustly extract structured data from HTML and markdown

Language: TypeScript - Size: 76.2 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 39 - Forks: 3

ballisticspace1/WebExtractor

## WebExtractor WebExtractor is a Python tool for OSINT and ethical hacking that extracts email addresses, phone numbers, and links from target websites. It runs on Linux and Termux, providing a simple CLI interface for cybersecurity professionals to gather critical intelligence. 🐙💻

Size: 5.86 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

platonai/PulsarRPA

PulsarRPA: An AI-Enabled, Super-Fast, Thread-Safe Browser Automation Solution! 💖

Language: Kotlin - Size: 30.6 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 887 - Forks: 128

lightfeed/browser-agent

Serverless AI browser agent

Language: TypeScript - Size: 5.67 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0

platonai/PulsarRPAPro

Fully automated and hands-free, accurately extracting and understanding web content — powered by machine learning agents.

Language: Kotlin - Size: 24.3 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 119 - Forks: 27

Victor-Pavageau/AverageMoviesDuration

Language: Python - Size: 85.9 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

franciscomvargas/DeUrlCruncher

Get google URL results from search query

Language: Batchfile - Size: 5.27 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

iamxiatian/octopus_spider

基于Scala Akka的分布式主题网络爬虫

Language: Scala - Size: 3.48 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 2

timkriz/wieramemo_vase Fork of LukaZeleznik/wieramemo_vase

Programming assignments for Web Information Extraction and Retrieval, FRI UL, 2021. PA1: standalone webcrawler of .gov.si web sites, PA2: approaches of the structured web data extraction, PA3: Data processing and indexing and Data retrieval.

Language: HTML - Size: 31.1 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

galinaalperovich/Ms-Thesis-CVUT

Automatic extraction of the information on local event from a webpage with Machine Learning

Language: Jupyter Notebook - Size: 35.1 MB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 4 - Forks: 2

bharatpurohit97/Webextractor

Extracting links from any website.

Language: Python - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0