mirror of
https://github.com/opendatalab/MinerU.git
synced 2026-03-27 02:58:54 +07:00
@@ -52,7 +52,7 @@
|
||||
- The `pipeline` backend now supports 270-degree rotated table parsing, bringing support for table parsing in 0/90/270-degree orientations
|
||||
- `pipeline` added OCR capability support for Thai and Greek, and updated the English OCR model to the latest version. English recognition accuracy improved by 11%, Thai recognition model accuracy is 82.68%, and Greek recognition model accuracy is 89.28% (by PPOCRv5)
|
||||
- Added `bbox` field (mapped to 0-1000 range) in the output `content_list.json`, making it convenient for users to directly obtain position information for each content block
|
||||
|
||||
- Removed the `pipeline_old_linux` installation option, no longer supporting legacy Linux systems such as `CentOS 7`, to provide better support for `uv`'s `sync`/`run` commands
|
||||
|
||||
<details>
|
||||
<summary>History Log</summary>
|
||||
|
||||
@@ -52,6 +52,7 @@
|
||||
- `pipeline`后端增加270度旋转的表格解析能力,现已支持0/90/270度三个方向的表格解析
|
||||
- `pipeline`增加对泰文、希腊文的ocr能力支持,并更新了英文ocr模型至最新,英文识别精度提升11%,泰文识别模型精度 82.68%,希腊文识别模型精度 89.28%(by PPOCRv5)
|
||||
- 在输出的`content_list.json`中增加了`bbox`字段(映射至0-1000范围内),方便用户直接获取每个内容块的位置信息
|
||||
- 移除`pipeline_old_linux`安装可选项,不再支持老版本的Linux系统如`Centos 7`等,以便对`uv`的`sync`/`run`等命令进行更好的支持
|
||||
|
||||
|
||||
<details>
|
||||
|
||||
@@ -46,7 +46,6 @@ test = [
|
||||
"pytest",
|
||||
"pytest-cov",
|
||||
"coverage",
|
||||
"beautifulsoup4",
|
||||
"fuzzywuzzy"
|
||||
]
|
||||
vlm = [
|
||||
|
||||
Reference in New Issue
Block a user