Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / rmottanet / unchainedtext
UnchainedText: Break free from PDFs! Easily extract raw text to .txt for preprocessing.
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rmottanet%2Funchainedtext
Stars: 0
Forks: 0
Open Issues: 0
License: agpl-3.0
Language: Python
Repo Size: 31.3 KB
Dependencies:
14
Created: about 2 months ago
Updated: about 2 months ago
Last pushed: about 2 months ago
Last synced: about 2 months ago
Topics: data-extraction, extractor, pdf-text-extraction, text-extraction, text-extraction-tool, text-processing
Files
Dependencies
- actions/checkout v2 composite
- actions/checkout v2 composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- python 3.9-alpine build
- PyMuPDF ==1.24.0
- PyMuPDFb ==1.24.0
- exceptiongroup ==1.2.0
- iniconfig ==2.0.0
- packaging ==24.0
- pluggy ==1.4.0
- pytest ==8.1.1
- python-dotenv ==1.0.1
- tomli ==2.0.1