Compare commits

...

7 Commits

Author SHA1 Message Date
Xiaomeng Zhao
55fcb7387f Merge pull request #2315 from opendatalab/release-1.3.7
Release 1.3.7
2025-04-22 19:26:03 +08:00
Xiaomeng Zhao
f2169686e1 Merge pull request #2314 from myhloli/dev
refactor(table): replace ocr_engine with lang in table model prediction
2025-04-22 19:25:00 +08:00
myhloli
9c4e779b91 fix(lang|performance): resolve lang parameter issue and speed up OCR/table parsing
- Fix lang parameter ineffectiveness during table parsing model initialization
- Resolve significant slowdown in OCR and table parsing speed in CPU mode
- Update changelog in README.md and README_zh-CN.md
2025-04-22 19:15:29 +08:00
myhloli
8d9070db10 fix(lang|performance): resolve lang parameter issue and speed up OCR/table parsing
- Fix lang parameter ineffectiveness during table parsing model initialization
- Resolve significant slowdown in OCR and table parsing speed in CPU mode
- Update changelog in README.md and README_zh-CN.md
2025-04-22 19:15:01 +08:00
myhloli
69cdea908d fix(ocr): switch to ch_lite model for Chinese OCR on CPU
- Automatically change to ch_lite model when using CPU for Chinese OCR
- This modification improves performance on CPU devices
2025-04-22 19:12:35 +08:00
myhloli
1d1c7ba9ab refactor(table): replace ocr_engine with lang in table model prediction
- Remove OCR engine instantiation inside the loop
- Pass language directly to the table model instead of OCR engine
- Simplify code structure and improve readability
2025-04-22 18:55:10 +08:00
myhloli
4d5fd0ee55 Update version.py with new version 2025-04-21 06:45:36 +00:00
5 changed files with 13 additions and 9 deletions

View File

@@ -48,6 +48,9 @@ Easier to use: Just grab MinerU Desktop. No coding, no login, just a simple inte
</div>
# Changelog
- 2025/04/22 1.3.7 Released
- Fixed the issue where the `lang` parameter was ineffective during table parsing model initialization.
- Fixed the significant slowdown in OCR and table parsing speed in `cpu` mode.
- 2025/04/16 1.3.4 Released
- Slightly improved the speed of OCR detection by removing some unused blocks.
- Fixed page-level sorting errors caused by footnotes in certain cases.

View File

@@ -47,6 +47,9 @@
</div>
# 更新记录
- 2025/04/22 1.3.7 发布
- 修复表格解析模型初始化时lang参数失效的问题
- 修复在`cpu`模式下ocr和表格解析速度大幅下降的问题
- 2025/04/16 1.3.4 发布
- 通过移除一些无用的块小幅提升了ocr-det的速度
- 修复部分情况下由footnote导致的页面内排序错误

View File

@@ -1 +1 @@
__version__ = "1.3.5"
__version__ = "1.3.6"

View File

@@ -161,20 +161,13 @@ class BatchAnalyze:
for table_res_dict in tqdm(table_res_list_all_page, desc="Table Predict"):
_lang = table_res_dict['lang']
atom_model_manager = AtomModelSingleton()
ocr_engine = atom_model_manager.get_atom_model(
atom_model_name='ocr',
ocr_show_log=False,
det_db_box_thresh=0.5,
det_db_unclip_ratio=1.6,
lang=_lang
)
table_model = atom_model_manager.get_atom_model(
atom_model_name='table',
table_model_name='rapid_table',
table_model_path='',
table_max_time=400,
device='cpu',
ocr_engine=ocr_engine,
lang=_lang,
table_sub_model_name='slanet_plus'
)
html_code, table_cell_bboxes, logic_points, elapse = table_model.predict(table_res_dict['table_img'])

View File

@@ -53,6 +53,11 @@ class PytorchPaddleOCR(TextSystem):
args = parser.parse_args(args)
self.lang = kwargs.get('lang', 'ch')
device = get_device()
if device == 'cpu' and self.lang == 'ch':
self.lang = 'ch_lite'
if self.lang in latin_lang:
self.lang = 'latin'
elif self.lang in arabic_lang: