An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: dom-scraping

sanyabeast/imprint

Imprint is a lightweight, declarative DOM scraping library for extracting structured data from web pages. Define JSON-like schemas to easily map and extract data from complex websites.

Language: JavaScript - Size: 43.9 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 1 - Forks: 0

sqonk/phext-datakit

Datakit is a library that assists with data analysis and research. It includes classes for working with tables of data and deriving statistical information, importing those tables from file formats such as CSV, a class wrapper with statistical methods for PHP arrays, as well as memory efficient packed arrays.

Language: PHP - Size: 1.13 MB - Last synced at: 24 days ago - Pushed at: 5 months ago - Stars: 8 - Forks: 2