GitHub / MaxineXiong / Scraping-Scanned-PDF-Docs-using-OCR-with-RPA
This repository contains automation solutions that efficiently extracts text from scanned PDF documents with consistent layouts. Utilizing Tesseract OCR engine, the UiPath RPA robot achieves nearly 90% accuracy, streamlining the process and significantly reducing manual workload.
Stars: 0
Forks: 0
Open issues: 0
License: mit
Language:
Size: 4.96 MB
Dependencies parsed at: Pending
Created at: about 1 year ago
Updated at: about 1 year ago
Pushed at: about 1 year ago
Last synced at: about 1 year ago
Topics: ocr, optical-character-recognition, robotic-process-automation, rpa, scanned-documents, scanned-receipts, screen-scraping, uipath, uipath-classic-design, uipath-modern-design, uipath-studio