An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: extractable

itext/itext-pdfocr-dotnet

pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving

Language: C# - Size: 115 MB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 40 - Forks: 15

itext/itext-pdfocr-java

pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving

Language: Java - Size: 266 MB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 35 - Forks: 8