GitHub topics: binarized-dataset
PedroBarcha/old-books-dataset
Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are eventually going to be uploaded.
Language: HTML - Size: 1.29 GB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 11 - Forks: 2
