An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: web-data

brightdata/brightdata-mcp

A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.

Language: JavaScript - Size: 63.5 MB - Last synced at: 11 days ago - Pushed at: 18 days ago - Stars: 547 - Forks: 78

smrfeld/export-safari-reading-list

Export Safari reading list to JSON or CSV

Language: Python - Size: 806 KB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 20 - Forks: 3

lume-workflow/patriotismo-no-cinema

Filmes patrióticos possuem maior popularidade em períodos de guerra?

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Florents-Tselai/WarcDB

WarcDB: Web crawl data as SQLite databases.

Language: Python - Size: 51.7 MB - Last synced at: 23 days ago - Pushed at: 11 months ago - Stars: 398 - Forks: 11

cuiyuheng/maxun Fork of getmaxun/maxun

🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes [In Beta]

Size: 3.1 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

mkearney/wibble

Web Data Frames

Language: R - Size: 497 KB - Last synced at: 2 months ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 0

asche910/SuperSpider

简书超级爬虫, 包括文章.用户信息.评论等

Language: Java - Size: 4.24 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 1

Develop-Packt/Web-Scraping-with-Jupyter-Notebooks

Analyze and parse HTML responses, programmatically scrape web data, and utilize Pandas DataFrames to store, transform, and merge tables.

Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 2