Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ocr-text

MehulGoel1/ocr.text.search

This allows to search text among all the image (screenshot) files in a specified folder and it returns a list of file names in which all, it found the text. It runs ocr always on just the newly added files for lesser time consumption. When any screenshots or images are removed from the folder thier corresponding text file is archived not deleted, and hence they not searched for the text.

Language: Python - Size: 12.7 KB - Last synced: 3 months ago - Pushed: almost 6 years ago - Stars: 3 - Forks: 2

marijnkoolen/fuzzy-search

Fuzzy search modules for searching lists of words in low quality OCR and HTR text.

Language: HTML - Size: 7.43 MB - Last synced: 3 months ago - Pushed: 6 months ago - Stars: 19 - Forks: 1

ruoyuxie/noisy_parallel_data_alignment

Enhanced awesome-align for low-resource languages and noise simulation: https://arxiv.org/abs/2301.09685

Language: Python - Size: 245 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0