An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pdf-text

modesty/pdf2json

converts binary PDF to JSON and text, for server-side PDF processing and command-line use.

Language: Java - Size: 121 MB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 2,079 - Forks: 381

MicheleCotrufo/pdf2doi

A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.

Language: Python - Size: 79.7 MB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 113 - Forks: 21

bradsec/pdftext

PDFText is a web app developed with JavaScript, HTML, and CSS to convert standard PDF documents to text.

Language: CSS - Size: 129 KB - Last synced at: 17 days ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

ajxv/pyocr-flask

Pdf OCR text extraction using python

Language: HTML - Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

PrathameshDhande22/PdfTxtBot

A Telegram bot which extract Text from PDF, also extract the Images of PDF Pages. Made with Python

Language: Python - Size: 12.7 KB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

shayanalibhatti/Designing-a-PDF-Audiobook-using-Python

In this code, a simple implementation of PDF to audio converter is shown

Language: Python - Size: 1.36 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 26 - Forks: 19