GitHub topics: ocr-quality
Living-with-machines/lwm_ARTIDIGH_2020_OCR_impact_downstream_NLP_tasks
Repository for code underlying the paper 'Assessing the Impact of OCR Quality on Downstream NLP Tasks'
Language: Jupyter Notebook - Size: 52.1 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 9 - Forks: 2

erl-ang/interactive-ocr
Implementation of a couple of heuristics that estimate OCR quality without reliance on ground truth data, focusing on historical documents written in English.
Language: Python - Size: 3.28 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1
