An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: extracting-texts

MaksimJames/pyhtmltext

pyhtmltext is a usefull and flexible tool for extracting text from html.

Language: Python - Size: 12.7 KB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

skupriienko/Pyxtract

python module for extracting texts from URL and PDF

Language: Jupyter Notebook - Size: 5.16 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1