mirror of
https://github.com/opendatalab/MinerU.git
synced 2026-03-27 11:08:32 +07:00
docs(readme): update changelog and highlight usability improvements
- Remove duplicate entries for paddleocr2torch and thread safety - Add new entry for real-time progress bar implementation - Update mfr model to unimernet(2503) - Extend torch version compatibility - Enhance cuda support for various GPU models - Improve parsing speed on MPS devices
This commit is contained in:
@@ -49,7 +49,6 @@ Easier to use: Just grab MinerU Desktop. No coding, no login, just a simple inte
|
||||
# Changelog
|
||||
- 2025/04/03 Release of 1.3.0, in this version we made many optimizations and improvements:
|
||||
- Installation and compatibility optimization
|
||||
- By using `paddleocr2torch`, completely replaced the use of `paddle` framework and `paddleocr` in the project, solving the conflict issue between `paddle` and `torch`.
|
||||
- By removing the use of `layoutlmv3` in layout, resolved compatibility issues caused by `detectron2`.
|
||||
- Torch version compatibility extended to 2.2~2.6 (excluding 2.5).
|
||||
- CUDA compatibility supports 11.8/12.4/12.6 (CUDA version determined by torch), resolving compatibility issues for some users with 50-series and H-series GPUs.
|
||||
@@ -62,6 +61,9 @@ Easier to use: Just grab MinerU Desktop. No coding, no login, just a simple inte
|
||||
- Improved running speed on MPS devices.
|
||||
- Parsing effect optimization
|
||||
- Updated the mfr model to `unimernet(2503)`, solving the issue of lost line breaks in multi-line formulas.
|
||||
- Usability Optimization
|
||||
- By using `paddleocr2torch`, completely replaced the use of the `paddle` framework and `paddleocr` in the project, resolving conflicts between `paddle` and `torch`, as well as thread safety issues caused by the `paddle` framework.
|
||||
- Added a real-time progress bar during the parsing process to accurately track progress, making the wait less painful.
|
||||
- 2025/03/03 1.2.1 released, fixed several bugs:
|
||||
- Fixed the impact on punctuation marks during full-width to half-width conversion of letters and numbers
|
||||
- Fixed caption matching inaccuracies in certain scenarios
|
||||
|
||||
@@ -48,7 +48,6 @@
|
||||
# 更新记录
|
||||
- 2025/04/03 1.3.0 发布,在这个版本我们做出了许多优化和改进:
|
||||
- 安装与兼容性优化
|
||||
- 通过使用`paddleocr2torch`,完全替代了`paddle`框架以及`paddleocr`在项目中的使用,解决了`paddle`和`torch`的冲突问题
|
||||
- 通过移除layout中`layoutlmv3`的使用,解决了由`detectron2`导致的兼容问题
|
||||
- torch版本兼容扩展到2.2~2.6(2.5除外)
|
||||
- cuda兼容支持11.8/12.4/12.6(cuda版本由torch决定),解决部分用户50系显卡与H系显卡的兼容问题
|
||||
@@ -61,6 +60,9 @@
|
||||
- 优化了在mps设备上的运行速度
|
||||
- 解析效果优化
|
||||
- mfr模型更新到`unimernet(2503)`,解决多行公式中换行丢失的问题
|
||||
- 易用性优化
|
||||
- 通过使用`paddleocr2torch`,完全替代`paddle`框架以及`paddleocr`在项目中的使用,解决了`paddle`和`torch`的冲突问题,和由于`paddle`框架导致的线程不安全问题
|
||||
- 解析过程增加实时进度条显示,精准把握解析进度,让等待不再痛苦
|
||||
- 2025/03/03 1.2.1 发布,修复了一些问题:
|
||||
- 修复在字母与数字的全角转半角操作时对标点符号的影响
|
||||
- 修复在某些情况下caption的匹配不准确问题
|
||||
|
||||
Reference in New Issue
Block a user