An open API service providing repository metadata for many open source software ecosystems.

GitHub / TimInTech / pdf-text-duplicate-checker

PDF Duplicate Detector & Mover (Text + Image Hashing)

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TimInTech%2Fpdf-text-duplicate-checker
PURL: pkg:github/TimInTech/pdf-text-duplicate-checker

Stars: 0
Forks: 0
Open issues: 0

License: mit
Language: Python
Size: 98.4 MB
Dependencies parsed at: Pending

Created at: 5 months ago
Updated at: 4 months ago
Pushed at: 4 months ago
Last synced at: about 1 month ago

Topics: document-processing, duplicate-detection, ocr-python, opensource-toolchain, pdf, plagiarism-detection, pymupdf, python, text-analysis, text-comparison

    Loading...