An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: binarized-dataset

PedroBarcha/old-books-dataset

Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are eventually going to be uploaded.

Language: HTML - Size: 1.29 GB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 11 - Forks: 2