An open API service providing repository metadata for many open source software ecosystems.

Topic: "web-extraction"

platonai/PulsarRPA

PulsarRPA: An AI-Enabled, Super-Fast, Thread-Safe Browser Automation Solution! 💖

Language: Kotlin - Size: 29.5 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 876 - Forks: 128

platonai/PulsarRPAPro

Fully automated and hands-free, accurately extracting and understanding web content — powered by machine learning agents.

Language: Kotlin - Size: 24.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 119 - Forks: 27

lightfeed/extractor

Use LLMs to robustly extract structured data from HTML and markdown

Language: TypeScript - Size: 181 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 37 - Forks: 3

iamxiatian/octopus_spider

基于Scala Akka的分布式主题网络爬虫

Language: Scala - Size: 3.48 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 2

galinaalperovich/Ms-Thesis-CVUT

Automatic extraction of the information on local event from a webpage with Machine Learning

Language: Jupyter Notebook - Size: 35.1 MB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 4 - Forks: 2

laptopklm/WebExtractor

## WebExtractor WebExtractor is a Python tool for OSINT and ethical hacking that extracts email addresses, phone numbers, and links from target websites. It runs on Linux and Termux, providing a simple CLI interface for cybersecurity professionals to gather critical intelligence. 🐙💻

Language: Python - Size: 18.6 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

franciscomvargas/DeUrlCruncher

Get google URL results from search query

Language: Batchfile - Size: 5.27 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

timkriz/wieramemo_vase Fork of LukaZeleznik/wieramemo_vase

Programming assignments for Web Information Extraction and Retrieval, FRI UL, 2021. PA1: standalone webcrawler of .gov.si web sites, PA2: approaches of the structured web data extraction, PA3: Data processing and indexing and Data retrieval.

Language: HTML - Size: 31.1 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

Victor-Pavageau/AverageMoviesDuration

Language: Python - Size: 85.9 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

bharatpurohit97/Webextractor

Extracting links from any website.

Language: Python - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0