An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pdf2markdown

PaddlePaddle/PaddleOCR

Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language: Python - Size: 1.46 GB - Last synced at: about 19 hours ago - Pushed at: 6 days ago - Stars: 53,400 - Forks: 8,578

PaddlePaddle/PaddleX

All-in-One Development Tool based on PaddlePaddle

Language: Python - Size: 677 MB - Last synced at: about 19 hours ago - Pushed at: 6 days ago - Stars: 5,739 - Forks: 1,072

MarkPDFdown/markpdfdown

A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具

Language: Python - Size: 9.53 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 1,565 - Forks: 112