mirror of
https://github.com/opendatalab/MinerU.git
synced 2026-03-27 11:08:32 +07:00
update readme
This commit is contained in:
@@ -21,8 +21,8 @@
|
||||
|
||||
MinerU is a one-stop, open-source data extraction tool, primarily includes the following features:
|
||||
|
||||
- PDF Document Extraction [Magic-PDF](#Magic-PDF)
|
||||
- Webpage & E-book Extraction [Magic-Doc](#Magic-Doc)
|
||||
- [Magic-PDF](#Magic-PDF) PDF Document Extraction
|
||||
- [Magic-Doc](#Magic-Doc) Webpage & E-book Extraction
|
||||
|
||||
# Magic-PDF
|
||||
|
||||
@@ -58,9 +58,9 @@ https://github.com/magicpdf/Magic-PDF/assets/11393164/618937cb-dc6a-4646-b433-e3
|
||||
### Submodule Repositories
|
||||
|
||||
- [PDF-Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit)
|
||||
A Comprehensive Toolkit for High-Quality PDF Content Extraction
|
||||
- A Comprehensive Toolkit for High-Quality PDF Content Extraction
|
||||
- [Miner-PDF-Benchmark](https://github.com/opendatalab/Miner-PDF-Benchmark)
|
||||
An end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios
|
||||
- An end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios
|
||||
|
||||
## Getting Started
|
||||
|
||||
|
||||
@@ -21,8 +21,8 @@
|
||||
|
||||
MinerU 是一款一站式开源数据提取工具,主要包含以下功能:
|
||||
|
||||
- PDF文档提取 [Magic-PDF](#Magic-PDF)
|
||||
- 网页与电子书提取 [Magic-Doc](#Magic-Doc)
|
||||
- [Magic-PDF](#Magic-PDF) PDF文档提取
|
||||
- [Magic-Doc](#Magic-Doc) 网页与电子书提取
|
||||
|
||||
# Magic-PDF
|
||||
|
||||
@@ -58,9 +58,9 @@ https://github.com/magicpdf/Magic-PDF/assets/11393164/618937cb-dc6a-4646-b433-e3
|
||||
### 子模块仓库
|
||||
|
||||
- [PDF-Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit)
|
||||
高质量的PDF内容提取工具包
|
||||
- 高质量的PDF内容提取工具包
|
||||
- [Miner-PDF-Benchmark](https://github.com/opendatalab/Miner-PDF-Benchmark)
|
||||
端到端的PDF文档理解评估套件,专为大规模模型数据场景而设计
|
||||
- 端到端的PDF文档理解评估套件,专为大规模模型数据场景而设计
|
||||
|
||||
|
||||
## 上手指南
|
||||
|
||||
Reference in New Issue
Block a user