GitHub topics: extractable

Repositories

itext/itext-pdfocr-java

pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving

Language: Java - Size: 266 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 36 - Forks: 9

itext/itext-pdfocr-dotnet

Language: C# - Size: 115 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 40 - Forks: 15

Related Keywords

archival 2 character 2 data 2 diacritic 2 extractable 2 glyphs 2 hindi 2 image 2 iso-compliant 2 ligatures 2 mandarin 2 ocr 2 optical 2 pdf 2 portuguese 2 recognition 2 scan 2 searchable 2 spanish 2 tesseract 2

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

GitHub topics: extractable

itext/itext-pdfocr-java

itext/itext-pdfocr-dotnet