An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: fine-tune-arabic-ocr-model

OmarSamirz/Fine-Tune-Tesseract-For-Arabic-Language

This research aims to fine-tune an Arabic OCR model using Tesseract 5.0, enhancing text recognition accuracy through extensive data collection, preprocessing, and image generation. By leveraging advanced training techniques and data augmentation, we achieve significant improvements in word error rates (WER).

Language: Jupyter Notebook - Size: 39.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 1