Scraping-Scanned-PDF-Docs-using-OCR-with-RPA

This repository contains automation solutions that efficiently extracts text from scanned PDF documents with consistent layouts. Utilizing Tesseract OCR engine, the UiPath RPA robot achieves nearly 90% accuracy, streamlining the process and significantly reducing manual workload.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MaxineXiong%2FScraping-Scanned-PDF-Docs-using-OCR-with-RPA
PURL: pkg:github/MaxineXiong/Scraping-Scanned-PDF-Docs-using-OCR-with-RPA

Stars: 0
Forks: 0
Open issues: 0

License: mit
Language:
Size: 4.96 MB
Dependencies parsed at: Pending

Created at: over 1 year ago
Updated at: over 1 year ago
Pushed at: over 1 year ago
Last synced at: over 1 year ago

Topics: ocr, optical-character-recognition, robotic-process-automation, rpa, scanned-documents, scanned-receipts, screen-scraping, uipath, uipath-classic-design, uipath-modern-design, uipath-studio

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

GitHub / MaxineXiong / Scraping-Scanned-PDF-Docs-using-OCR-with-RPA