GitHub topics: image-text-extraction
FurqanHun/textnomnom-py
Extract text from PDFs, PPTs, & URLs (with OCR support). Converts PPT to PDF & handles files or folders. 🦍
Language: Python - Size: 46.9 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

zong4/PDFAndImageDiffTool
This tool compares the text content of two PDF files or images and generates an HTML file highlighting the differences in a format similar to VSCode's Git Diff view. It supports text extraction from PDFs and images (using Tesseract OCR) and provides a visual side-by-side comparison of the differences. Perfect for document version control, proofread
Language: Python - Size: 117 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0
