An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: image-text-extraction

FurqanHun/textnomnom-py

Extract text from PDFs, PPTs, & URLs (with OCR support). Converts PPT to PDF & handles files or folders. 🦍

Language: Python - Size: 46.9 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

zong4/PDFAndImageDiffTool

This tool compares the text content of two PDF files or images and generates an HTML file highlighting the differences in a format similar to VSCode's Git Diff view. It supports text extraction from PDFs and images (using Tesseract OCR) and provides a visual side-by-side comparison of the differences. Perfect for document version control, proofread

Language: Python - Size: 117 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0