GitHub topics: pdf2markdown
PaddlePaddle/PaddleX
All-in-One Development Tool based on PaddlePaddle
Language: Python - Size: 626 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 5,571 - Forks: 1,039

MarkPDFdown/markpdfdown
A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
Language: Python - Size: 9.52 MB - Last synced at: 6 days ago - Pushed at: 16 days ago - Stars: 859 - Forks: 60

PaddlePaddle/PaddleOCR
Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language: Python - Size: 1.29 GB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 50,611 - Forks: 8,347
