4673 Commits

Author SHA1 Message Date
myhloli
13fef5319c Merge remote-tracking branch 'origin/dev' into dev 2025-12-23 10:19:47 +08:00
myhloli
e5ee3136d4 refactor: enhance backend info descriptions and improve language support in parsing logic 2025-12-23 10:18:21 +08:00
Xiaomeng Zhao
a9d6e709a4 Merge pull request #4231 from myhloli/dev
Dev
2025-12-22 20:06:08 +08:00
myhloli
416181700e refactor: change conditional statements from 'if' to 'elif' for backend checks 2025-12-22 20:03:32 +08:00
myhloli
1e4a763f7b refactor: add support for '(cid:)' in OCR text recognition logic 2025-12-22 20:00:00 +08:00
myhloli
09926d8a57 refactor: update Dockerfile tags for A2 and A3 compatibility 2025-12-22 19:35:59 +08:00
myhloli
54e28075c2 refactor: increase height of Markdown component for improved visibility 2025-12-22 19:23:27 +08:00
myhloli
1fa4e06974 refactor: update Examples component to use label for better i18n support 2025-12-22 18:42:03 +08:00
myhloli
c163d0d7bd refactor: update header.html to include support prompt and change file opening to utf-8 encoding 2025-12-22 17:15:12 +08:00
myhloli
91f82a4436 refactor: implement i18n support for UI labels and tooltips in gradio_app 2025-12-22 16:33:42 +08:00
myhloli
eabdef1824 refactor: improve info tooltip for OCR language checkbox clarity 2025-12-22 15:38:48 +08:00
myhloli
82ba2c7fac refactor: enhance OCR language handling and improve backend info tooltips 2025-12-22 15:16:37 +08:00
myhloli
b5ba2ba4c9 refactor: improve OCR language dropdown label and add info tooltip for clarity 2025-12-22 01:09:46 +08:00
myhloli
9efaf9197f refactor: update default backend option to hybrid-auto-engine in client and fast_api modules 2025-12-22 00:53:43 +08:00
myhloli
996be34534 refactor: enhance batch processing and IOU filtering in hybrid analysis 2025-12-22 00:41:52 +08:00
myhloli
37a43e3318 Merge remote-tracking branch 'origin/dev' into dev 2025-12-19 18:07:22 +08:00
myhloli
68f64aeef1 fix: update mineru-vl-utils version to 0.1.18 in pyproject.toml 2025-12-19 18:06:27 +08:00
Xiaomeng Zhao
b2f0bf5c10 Merge pull request #22 from myhloli/refactor_2.7
Refactor 2.7
2025-12-19 18:03:01 +08:00
Xiaomeng Zhao
edb5f8d26b Update mineru/backend/vlm/vlm_middle_json_mkcontent.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-12-19 18:02:49 +08:00
myhloli
637227fafd refactor: remove unused line_content variable from list block processing 2025-12-19 18:01:05 +08:00
myhloli
4dc2452063 refactor: integrate inline formula handling and update backend options for improved recognition 2025-12-19 17:44:19 +08:00
myhloli
c9eb7d4ca0 refactor: update backend descriptions to include hybrid options for improved clarity 2025-12-19 10:46:26 +08:00
myhloli
c72d7de61f refactor: enhance VLM engine selection logic for cross-platform compatibility 2025-12-19 01:42:49 +08:00
myhloli
04fdc91dc9 refactor: enhance backend options and streamline OCR processing logic 2025-12-18 20:46:53 +08:00
myhloli
ab8472e6a5 fix: use deepcopy for reference_row to prevent mutation during row checks in table_merge.py 2025-12-18 18:08:08 +08:00
myhloli
9f5417e223 refactor: use deepcopy for reference_row in table merging logic to prevent mutation 2025-12-18 18:05:23 +08:00
myhloli
15cf985b73 refactor: enhance hybrid processing by integrating language handling and dynamic parse method selection 2025-12-18 17:09:59 +08:00
myhloli
5c7f872488 refactor: add character utility functions for full-to-half conversion and hyphen detection 2025-12-18 15:59:45 +08:00
myhloli
990b7a4c78 refactor: improve text handling and span merging logic in merge_para_with_text_v2 function 2025-12-18 15:53:37 +08:00
myhloli
6fca3d3df1 refactor: add full-to-half character conversion and hyphen handling in merge_para_with_text function 2025-12-18 11:54:51 +08:00
myhloli
1ab0c0f68c refactor: initialize hybrid_pipeline_model for improved processing flow in hybrid_analyze.py 2025-12-18 11:24:58 +08:00
myhloli
a3d2d9a64d refactor: introduce NotExtractType enum for improved block type management in hybrid processing 2025-12-17 17:06:24 +08:00
myhloli
265e8b9e20 refactor: enhance MagicModel and output processing with additional parameters for inline formula spans extraction 2025-12-17 00:39:57 +08:00
myhloli
77700dd8b7 refactor: update MagicModel initialization and add bounding box calculation method 2025-12-16 19:21:19 +08:00
myhloli
fec1f18487 refactor: add normalization of polygon coordinates to bounding boxes in hybrid_analyze.py 2025-12-16 18:59:35 +08:00
myhloli
a55d494b44 refactor: enhance inline formula and OCR result handling in hybrid_analyze.py and hybrid_model_output_to_middle_json.py 2025-12-16 18:40:56 +08:00
myhloli
25df92c628 refactor: streamline OCR detection and model initialization in hybrid_analyze.py and model_init.py 2025-12-16 17:53:25 +08:00
Xiaomeng Zhao
39f8ee2d53 Merge pull request #21 from myhloli/dev
Dev
2025-12-15 18:36:51 +08:00
Xiaomeng Zhao
d570887c76 Merge pull request #4194 from opendatalab/master
master->dev
2025-12-15 18:32:34 +08:00
myhloli
eed479eb56 Update version.py with new version 2025-12-15 10:22:19 +00:00
Xiaomeng Zhao
4ccb5e04dd Merge pull request #4193 from opendatalab/dev
Dev
mineru-2.6.8-released
2025-12-15 18:20:52 +08:00
Xiaomeng Zhao
6096359daf Merge pull request #4192 from myhloli/dev
fix: improve content check and streamline content list generation in vlm_middle_json_mkcontent.py
2025-12-15 18:19:26 +08:00
myhloli
107e177223 fix: improve content check and streamline content list generation in vlm_middle_json_mkcontent.py 2025-12-15 17:58:11 +08:00
myhloli
deac3ef3b5 refactor: integrate hybrid backend processing and enhance document analysis functions 2025-12-15 16:32:38 +08:00
myhloli
05070f2a66 refactor: update comments in compose.yaml for clarity and guidance on engine parameters 2025-12-15 14:15:12 +08:00
Xiaomeng Zhao
da0cdfb717 Merge pull request #4179 from opendatalab/dev
refactor: update comments in compose.yaml for clarity and guidance on engine parameters
2025-12-12 20:29:39 +08:00
Xiaomeng Zhao
58a1bb781a Merge pull request #4178 from myhloli/dev
refactor: update comments in compose.yaml for clarity and guidance on engine parameters
2025-12-12 20:28:32 +08:00
myhloli
5f86767fae refactor: update comments in compose.yaml for clarity and guidance on engine parameters 2025-12-12 20:21:59 +08:00
Xiaomeng Zhao
8aae656686 Merge pull request #4177 from myhloli/dev
comment out test_vlm_transformers_with_default_config function in test_e2e.py
2025-12-12 18:19:34 +08:00
myhloli
a2a25200bc fix: comment out notify_to_feishu step in cli.yml 2025-12-12 18:18:42 +08:00