An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pdf2markdown

PaddlePaddle/PaddleX

All-in-One Development Tool based on PaddlePaddle

Language: Python - Size: 626 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 5,571 - Forks: 1,039

MarkPDFdown/markpdfdown

A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具

Language: Python - Size: 9.52 MB - Last synced at: 6 days ago - Pushed at: 16 days ago - Stars: 859 - Forks: 60

PaddlePaddle/PaddleOCR

Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language: Python - Size: 1.29 GB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 50,611 - Forks: 8,347