mirror of
https://github.com/opendatalab/MinerU.git
synced 2026-03-27 11:08:32 +07:00
Merge pull request #4365 from tommygood/docs/fix-typos-spans-pdf
Docs: correct file naming format to use '_span.pdf'
This commit is contained in:
@@ -29,12 +29,12 @@ The following sections provide detailed descriptions of each file's purpose and
|
|||||||
|
|
||||||

|

|
||||||
|
|
||||||
### Text Spans File (spans.pdf)
|
### Text Spans File (span.pdf)
|
||||||
|
|
||||||
> [!NOTE]
|
> [!NOTE]
|
||||||
> Only applicable to pipeline backend
|
> Only applicable to pipeline backend
|
||||||
|
|
||||||
**File naming format**: `{original_filename}_spans.pdf`
|
**File naming format**: `{original_filename}_span.pdf`
|
||||||
|
|
||||||
**Functionality**:
|
**Functionality**:
|
||||||
|
|
||||||
@@ -702,7 +702,7 @@ The above files constitute MinerU's complete output results. Users can choose ap
|
|||||||
|
|
||||||
- **Debugging and verification** (Use visualization files):
|
- **Debugging and verification** (Use visualization files):
|
||||||
* layout.pdf
|
* layout.pdf
|
||||||
* spans.pdf
|
* span.pdf
|
||||||
|
|
||||||
- **Content extraction**: (Use simplified files):
|
- **Content extraction**: (Use simplified files):
|
||||||
* *.md
|
* *.md
|
||||||
|
|||||||
@@ -29,12 +29,12 @@
|
|||||||
|
|
||||||

|

|
||||||
|
|
||||||
### 文本片段文件 (spans.pdf)
|
### 文本片段文件 (span.pdf)
|
||||||
|
|
||||||
> [!NOTE]
|
> [!NOTE]
|
||||||
> 仅适用于 pipeline 后端
|
> 仅适用于 pipeline 后端
|
||||||
|
|
||||||
**文件命名格式**:`{原文件名}_spans.pdf`
|
**文件命名格式**:`{原文件名}_span.pdf`
|
||||||
|
|
||||||
**功能说明**:
|
**功能说明**:
|
||||||
|
|
||||||
@@ -817,7 +817,7 @@ vlm 后端的 content_list.json 文件结构与 pipeline 后端类似,伴随
|
|||||||
|
|
||||||
- **调试和验证**(使用可视化文件):
|
- **调试和验证**(使用可视化文件):
|
||||||
* layout.pdf
|
* layout.pdf
|
||||||
* spans.pdf
|
* span.pdf
|
||||||
|
|
||||||
- **内容提取**(使用简化文件):
|
- **内容提取**(使用简化文件):
|
||||||
* *.md
|
* *.md
|
||||||
|
|||||||
Reference in New Issue
Block a user