An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pdf-extractor-pretrain

opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Language: Python - Size: 124 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 33,699 - Forks: 2,708

aidayang/MinerU-OneClick

MinerU免安装部署一键启动整合包

Size: 49.8 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 7 - Forks: 0

Alapipapi/MinerU Fork of opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Language: Python - Size: 103 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0