Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pdf-text

modesty/pdf2json

converts binary PDF to JSON and text, for server-side PDF processing and command-line use.

Language: Java - Size: 120 MB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 1,892 - Forks: 374

MicheleCotrufo/pdf2doi

A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.

Language: Python - Size: 79.8 MB - Last synced: 27 days ago - Pushed: 3 months ago - Stars: 84 - Forks: 12

bradsec/pdftext

PDFText is a web app developed with JavaScript, HTML, and CSS to convert standard PDF documents to text.

Language: CSS - Size: 126 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

PrathameshDhande22/PdfTxtBot

A Telegram bot which extract Text from PDF, also extract the Images of PDF Pages. Made with Python

Language: Python - Size: 12.7 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

shayanalibhatti/Designing-a-PDF-Audiobook-using-Python

In this code, a simple implementation of PDF to audio converter is shown

Language: Python - Size: 1.36 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 26 - Forks: 19