Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / alisafaya / txt-from-pdf
Extracting clean text from pdfs using pdfminer.six and pypdf.
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alisafaya%2Ftxt-from-pdf
Stars: 1
Forks: 0
Open Issues: 1
License: apache-2.0
Language: Python
Repo Size: 23.4 KB
Dependencies:
3
Created: about 1 month ago
Updated: 7 days ago
Last pushed: 8 days ago
Last synced: 7 days ago
Topics: pdf, pdf-document-processor, text-mining
Files
Loading...
Readme
Loading...
Dependencies
pyproject.toml
pypi
setup.py
pypi
- pdfminer.six ==20231228
- pypdf *
- unicodedata2 *