Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / hackersandslackers / jsonld-scraper-tutorial
🌎 🖥 Supercharge your scraper to extract quality page metadata by parsing JSON-LD data via Python's extruct library.
Stars: 12
Forks: 3
Open Issues: 24
License: mit
Language: Python
Repo Size: 489 KB
Dependencies:
93
Created: almost 4 years ago
Updated: about 1 year ago
Last pushed: 1 day ago
Last synced: 1 day ago
Topics: beautifulsoup, extruct, json-ld, python, scraper, structured-data, tutorial
Funding links: https://www.buymeacoffee.com/hackersslackers
Files
Loading...
Readme
Loading...
Dependencies
pyproject.toml
pypi
requirements.txt
pypi
- attrs ==20.1.0
- beautifulsoup4 ==4.9.1
- certifi ==2020.6.20
- chardet ==3.0.4
- extruct ==0.10.0
- html-text ==0.5.2
- html5lib ==1.1
- idna ==2.10
- isodate ==0.6.0
- lxml ==4.5.2
- mf2py ==1.1.2
- more-itertools ==8.5.0
- packaging ==20.4
- pluggy ==0.13.1
- py ==1.9.0
- pyparsing ==2.4.7
- pytest ==6.0.1
- rdflib ==5.0.0
- rdflib-jsonld ==0.5.0
- requests ==2.24.0
- six ==1.15.0
- soupsieve ==2.0.1
- urllib3 ==1.25.10
- w3lib ==1.22.0
- wcwidth ==0.2.5
- webencodings ==0.5.1
.github/workflows/python-app.yml
actions
- actions/checkout v2 composite
- actions/setup-python v2 composite
Pipfile.lock
pypi
- attrs ==19.3.0 develop
- iniconfig ==1.0.1 develop
- more-itertools ==8.4.0 develop
- packaging ==20.4 develop
- pluggy ==0.13.1 develop
- py ==1.9.0 develop
- pyparsing ==2.4.7 develop
- pytest ==6.0.1 develop
- six ==1.15.0 develop
- toml ==0.10.1 develop
- beautifulsoup4 ==4.9.1
- certifi ==2020.6.20
- chardet ==3.0.4
- extruct ==0.9.0
- html-text ==0.5.2
- html5lib ==1.1
- idna ==2.10
- isodate ==0.6.0
- lxml ==4.5.2
- mf2py ==1.1.2
- pyparsing ==2.4.7
- rdflib ==4.2.2
- rdflib-jsonld ==0.5.0
- requests ==2.24.0
- six ==1.15.0
- soupsieve ==2.0.1
- urllib3 ==1.25.10
- w3lib ==1.22.0
- webencodings ==0.5.1
poetry.lock
pypi
- atomicwrites 1.4.0 develop
- attrs 19.3.0 develop
- colorama 0.4.3 develop
- iniconfig 1.0.1 develop
- more-itertools 8.4.0 develop
- packaging 20.4 develop
- pluggy 0.13.1 develop
- py 1.9.0 develop
- pytest 6.0.1 develop
- toml 0.10.1 develop
- beautifulsoup4 4.9.1
- certifi 2020.6.20
- chardet 3.0.4
- extruct 0.9.0
- html-text 0.5.2
- html5lib 1.1
- idna 2.10
- isodate 0.6.0
- lxml 4.5.2
- mf2py 1.1.2
- pyparsing 2.4.7
- rdflib 4.2.2
- rdflib-jsonld 0.5.0
- requests 2.24.0
- six 1.15.0
- soupsieve 1.9.6
- urllib3 1.25.10
- w3lib 1.22.0
- webencodings 0.5.1