An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: web-page-parsing

ersinaksar/Network-of-Websites

Python command-line script that will be able to get all website page links from starting website ​START_URL with a max depth of ​DEPTH.​ The network of website URLs stored in the Memgraph database. The script also able to find the shortest path from S​TART_URL​to E​ ND_URL​from a scraped network of websites in the Memgraph database.

Language: Python - Size: 17.6 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Develop-Packt/Reading-Data-from-Different-Sources

Read and handle data from HTML, JSON, and CSV files (among others), and practice web page parsing with BeautifulSoup4

Language: HTML - Size: 1.04 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1