Topic: "extract-text-from-pdf"
NanoNets/ocr-python
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
Language: Jupyter Notebook - Size: 5.52 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 4

euyogi/Projeto-Anceu-CS50
Meu projeto do curso CS50: Um analisador de pdfs que processa as notas dos aprovados pelo Acesso Enem e organiza tudo. Agora em C++
Language: C++ - Size: 43.2 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

sxaxmz/handle_scanned_pdf
A wrapper on top of python-OCR tools such as pytesseract and easyocr, to recognize and extract text embedded in images. Also, convert scanned-PDFs to text searchable PDFs.
Language: Python - Size: 811 KB - Last synced at: 11 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0
