Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / andrealenzi11 / py-poppleract
Python library and Web service based on Poppler Pdftotext utility and Tesseract OCR for extracting text from PDF documents
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andrealenzi11%2Fpy-poppleract
Stars: 6
Forks: 0
Open Issues: 0
License: gpl-2.0
Language: Python
Repo Size: 195 KB
Dependencies:
9
Created: 6 months ago
Updated: 3 months ago
Last pushed: 6 months ago
Last synced: 4 days ago
Topics: ocr, optical-character-recognition, pdf-reader, pdf-splitting, pdf-to-text, pdf2text, pdftotext, poppler, poppleract, py-poppleract, tesseract, tesseract-ocr, text-extraction
Files
Dependencies
- fastapi ==0.104.1
- pdf2image ==1.16.3
- pdftotext ==2.2.2
- pytesseract ==0.3.10
- python-multipart ==0.0.6
- uvicorn ==0.24.0.post1