GitHub / genieincodebottle / parsemypdf
Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/genieincodebottle%2Fparsemypdf
PURL: pkg:github/genieincodebottle/parsemypdf
Stars: 111
Forks: 27
Open issues: 0
License: mit
Language: Python
Size: 3.01 MB
Dependencies parsed at: Pending
Created at: 9 months ago
Updated at: 26 days ago
Pushed at: 26 days ago
Last synced at: 26 days ago
Topics: camelot, claude, docling, gemini-ai, gemini-pro, llama-parse, llama-vision, llama4, markitdown, mistral-ocr, ocr, ocr-python, omniai, openai, pymupdf, pypdf, smoldocling, unstructured-io