GitHub topics: web-page-parsing
ersinaksar/Network-of-Websites
Python command-line script that will be able to get all website page links from starting website START_URL with a max depth of DEPTH. The network of website URLs stored in the Memgraph database. The script also able to find the shortest path from START_URLto E ND_URLfrom a scraped network of websites in the Memgraph database.
Language: Python - Size: 17.6 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Develop-Packt/Reading-Data-from-Different-Sources
Read and handle data from HTML, JSON, and CSV files (among others), and practice web page parsing with BeautifulSoup4
Language: HTML - Size: 1.04 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1
