GitHub topics: web-extraction
laptopklm/WebExtractor
## WebExtractor WebExtractor is a Python tool for OSINT and ethical hacking that extracts email addresses, phone numbers, and links from target websites. It runs on Linux and Termux, providing a simple CLI interface for cybersecurity professionals to gather critical intelligence. 🐙💻
Language: Python - Size: 18.6 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

lightfeed/extractor
Use LLMs to robustly extract structured data from HTML and markdown
Language: TypeScript - Size: 76.2 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 39 - Forks: 3

ballisticspace1/WebExtractor
## WebExtractor WebExtractor is a Python tool for OSINT and ethical hacking that extracts email addresses, phone numbers, and links from target websites. It runs on Linux and Termux, providing a simple CLI interface for cybersecurity professionals to gather critical intelligence. 🐙💻
Size: 5.86 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

platonai/PulsarRPA
PulsarRPA: An AI-Enabled, Super-Fast, Thread-Safe Browser Automation Solution! 💖
Language: Kotlin - Size: 30.6 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 887 - Forks: 128

lightfeed/browser-agent
Serverless AI browser agent
Language: TypeScript - Size: 5.67 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0

platonai/PulsarRPAPro
Fully automated and hands-free, accurately extracting and understanding web content — powered by machine learning agents.
Language: Kotlin - Size: 24.3 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 119 - Forks: 27

Victor-Pavageau/AverageMoviesDuration
Language: Python - Size: 85.9 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

franciscomvargas/DeUrlCruncher
Get google URL results from search query
Language: Batchfile - Size: 5.27 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

iamxiatian/octopus_spider
基于Scala Akka的分布式主题网络爬虫
Language: Scala - Size: 3.48 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 2

timkriz/wieramemo_vase Fork of LukaZeleznik/wieramemo_vase
Programming assignments for Web Information Extraction and Retrieval, FRI UL, 2021. PA1: standalone webcrawler of .gov.si web sites, PA2: approaches of the structured web data extraction, PA3: Data processing and indexing and Data retrieval.
Language: HTML - Size: 31.1 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

galinaalperovich/Ms-Thesis-CVUT
Automatic extraction of the information on local event from a webpage with Machine Learning
Language: Jupyter Notebook - Size: 35.1 MB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 4 - Forks: 2

bharatpurohit97/Webextractor
Extracting links from any website.
Language: Python - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0
