Merge pull request #3427 from opendatalab/dev

Dev
This commit is contained in:
Xiaomeng Zhao
2025-09-05 19:19:15 +08:00
committed by GitHub
3 changed files with 2 additions and 2 deletions

View File

@@ -52,7 +52,7 @@
- The `pipeline` backend now supports 270-degree rotated table parsing, bringing support for table parsing in 0/90/270-degree orientations
- `pipeline` added OCR capability support for Thai and Greek, and updated the English OCR model to the latest version. English recognition accuracy improved by 11%, Thai recognition model accuracy is 82.68%, and Greek recognition model accuracy is 89.28% (by PPOCRv5)
- Added `bbox` field (mapped to 0-1000 range) in the output `content_list.json`, making it convenient for users to directly obtain position information for each content block
- Removed the `pipeline_old_linux` installation option, no longer supporting legacy Linux systems such as `CentOS 7`, to provide better support for `uv`'s `sync`/`run` commands
<details>
<summary>History Log</summary>

View File

@@ -52,6 +52,7 @@
- `pipeline`后端增加270度旋转的表格解析能力现已支持0/90/270度三个方向的表格解析
- `pipeline`增加对泰文、希腊文的ocr能力支持并更新了英文ocr模型至最新英文识别精度提升11%,泰文识别模型精度 82.68%,希腊文识别模型精度 89.28%by PPOCRv5
- 在输出的`content_list.json`中增加了`bbox`字段(映射至0-1000范围内),方便用户直接获取每个内容块的位置信息
- 移除`pipeline_old_linux`安装可选项不再支持老版本的Linux系统如`Centos 7`等,以便对`uv``sync`/`run`等命令进行更好的支持
<details>

View File

@@ -46,7 +46,6 @@ test = [
"pytest",
"pytest-cov",
"coverage",
"beautifulsoup4",
"fuzzywuzzy"
]
vlm = [