myhloli
|
6cecafd99d
|
refactor: add environment variable check to control pipeline enablement in OCR processing
|
2025-12-23 18:11:24 +08:00 |
|
myhloli
|
a58cb06a6d
|
refactor: update batch ratio documentation for clarity and adjust memory thresholds
|
2025-12-23 18:01:25 +08:00 |
|
myhloli
|
361f949d4a
|
refactor: update batch ratio documentation for clarity and adjust memory thresholds
|
2025-12-23 17:59:58 +08:00 |
|
myhloli
|
914770d651
|
refactor: enhance batch ratio documentation and adjust GPU memory thresholds
|
2025-12-23 17:40:06 +08:00 |
|
myhloli
|
447ffcd32f
|
refactor: implement dynamic batch ratio based on GPU memory and environment variable
|
2025-12-23 17:00:02 +08:00 |
|
myhloli
|
408d94ed58
|
refactor: add VastAI support documentation to index and create new VastAI.md file
|
2025-12-23 16:31:12 +08:00 |
|
myhloli
|
e48e1619f9
|
refactor: improve language support descriptions in OCR input options
|
2025-12-23 11:32:42 +08:00 |
|
myhloli
|
13fef5319c
|
Merge remote-tracking branch 'origin/dev' into dev
|
2025-12-23 10:19:47 +08:00 |
|
myhloli
|
e5ee3136d4
|
refactor: enhance backend info descriptions and improve language support in parsing logic
|
2025-12-23 10:18:21 +08:00 |
|
myhloli
|
416181700e
|
refactor: change conditional statements from 'if' to 'elif' for backend checks
|
2025-12-22 20:03:32 +08:00 |
|
myhloli
|
1e4a763f7b
|
refactor: add support for '(cid:)' in OCR text recognition logic
|
2025-12-22 20:00:00 +08:00 |
|
myhloli
|
09926d8a57
|
refactor: update Dockerfile tags for A2 and A3 compatibility
|
2025-12-22 19:35:59 +08:00 |
|
myhloli
|
54e28075c2
|
refactor: increase height of Markdown component for improved visibility
|
2025-12-22 19:23:27 +08:00 |
|
myhloli
|
1fa4e06974
|
refactor: update Examples component to use label for better i18n support
|
2025-12-22 18:42:03 +08:00 |
|
myhloli
|
c163d0d7bd
|
refactor: update header.html to include support prompt and change file opening to utf-8 encoding
|
2025-12-22 17:15:12 +08:00 |
|
myhloli
|
91f82a4436
|
refactor: implement i18n support for UI labels and tooltips in gradio_app
|
2025-12-22 16:33:42 +08:00 |
|
myhloli
|
eabdef1824
|
refactor: improve info tooltip for OCR language checkbox clarity
|
2025-12-22 15:38:48 +08:00 |
|
myhloli
|
82ba2c7fac
|
refactor: enhance OCR language handling and improve backend info tooltips
|
2025-12-22 15:16:37 +08:00 |
|
myhloli
|
b5ba2ba4c9
|
refactor: improve OCR language dropdown label and add info tooltip for clarity
|
2025-12-22 01:09:46 +08:00 |
|
myhloli
|
9efaf9197f
|
refactor: update default backend option to hybrid-auto-engine in client and fast_api modules
|
2025-12-22 00:53:43 +08:00 |
|
myhloli
|
996be34534
|
refactor: enhance batch processing and IOU filtering in hybrid analysis
|
2025-12-22 00:41:52 +08:00 |
|
myhloli
|
37a43e3318
|
Merge remote-tracking branch 'origin/dev' into dev
|
2025-12-19 18:07:22 +08:00 |
|
myhloli
|
68f64aeef1
|
fix: update mineru-vl-utils version to 0.1.18 in pyproject.toml
|
2025-12-19 18:06:27 +08:00 |
|
Xiaomeng Zhao
|
b2f0bf5c10
|
Merge pull request #22 from myhloli/refactor_2.7
Refactor 2.7
|
2025-12-19 18:03:01 +08:00 |
|
Xiaomeng Zhao
|
edb5f8d26b
|
Update mineru/backend/vlm/vlm_middle_json_mkcontent.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
|
2025-12-19 18:02:49 +08:00 |
|
myhloli
|
637227fafd
|
refactor: remove unused line_content variable from list block processing
|
2025-12-19 18:01:05 +08:00 |
|
myhloli
|
4dc2452063
|
refactor: integrate inline formula handling and update backend options for improved recognition
|
2025-12-19 17:44:19 +08:00 |
|
myhloli
|
c9eb7d4ca0
|
refactor: update backend descriptions to include hybrid options for improved clarity
|
2025-12-19 10:46:26 +08:00 |
|
myhloli
|
c72d7de61f
|
refactor: enhance VLM engine selection logic for cross-platform compatibility
|
2025-12-19 01:42:49 +08:00 |
|
myhloli
|
04fdc91dc9
|
refactor: enhance backend options and streamline OCR processing logic
|
2025-12-18 20:46:53 +08:00 |
|
myhloli
|
ab8472e6a5
|
fix: use deepcopy for reference_row to prevent mutation during row checks in table_merge.py
|
2025-12-18 18:08:08 +08:00 |
|
myhloli
|
9f5417e223
|
refactor: use deepcopy for reference_row in table merging logic to prevent mutation
|
2025-12-18 18:05:23 +08:00 |
|
myhloli
|
15cf985b73
|
refactor: enhance hybrid processing by integrating language handling and dynamic parse method selection
|
2025-12-18 17:09:59 +08:00 |
|
myhloli
|
5c7f872488
|
refactor: add character utility functions for full-to-half conversion and hyphen detection
|
2025-12-18 15:59:45 +08:00 |
|
myhloli
|
990b7a4c78
|
refactor: improve text handling and span merging logic in merge_para_with_text_v2 function
|
2025-12-18 15:53:37 +08:00 |
|
myhloli
|
6fca3d3df1
|
refactor: add full-to-half character conversion and hyphen handling in merge_para_with_text function
|
2025-12-18 11:54:51 +08:00 |
|
myhloli
|
1ab0c0f68c
|
refactor: initialize hybrid_pipeline_model for improved processing flow in hybrid_analyze.py
|
2025-12-18 11:24:58 +08:00 |
|
myhloli
|
a3d2d9a64d
|
refactor: introduce NotExtractType enum for improved block type management in hybrid processing
|
2025-12-17 17:06:24 +08:00 |
|
myhloli
|
265e8b9e20
|
refactor: enhance MagicModel and output processing with additional parameters for inline formula spans extraction
|
2025-12-17 00:39:57 +08:00 |
|
myhloli
|
77700dd8b7
|
refactor: update MagicModel initialization and add bounding box calculation method
|
2025-12-16 19:21:19 +08:00 |
|
myhloli
|
fec1f18487
|
refactor: add normalization of polygon coordinates to bounding boxes in hybrid_analyze.py
|
2025-12-16 18:59:35 +08:00 |
|
myhloli
|
a55d494b44
|
refactor: enhance inline formula and OCR result handling in hybrid_analyze.py and hybrid_model_output_to_middle_json.py
|
2025-12-16 18:40:56 +08:00 |
|
myhloli
|
25df92c628
|
refactor: streamline OCR detection and model initialization in hybrid_analyze.py and model_init.py
|
2025-12-16 17:53:25 +08:00 |
|
Xiaomeng Zhao
|
39f8ee2d53
|
Merge pull request #21 from myhloli/dev
Dev
|
2025-12-15 18:36:51 +08:00 |
|
Xiaomeng Zhao
|
d570887c76
|
Merge pull request #4194 from opendatalab/master
master->dev
|
2025-12-15 18:32:34 +08:00 |
|
myhloli
|
eed479eb56
|
Update version.py with new version
|
2025-12-15 10:22:19 +00:00 |
|
Xiaomeng Zhao
|
4ccb5e04dd
|
Merge pull request #4193 from opendatalab/dev
Dev
mineru-2.6.8-released
|
2025-12-15 18:20:52 +08:00 |
|
Xiaomeng Zhao
|
6096359daf
|
Merge pull request #4192 from myhloli/dev
fix: improve content check and streamline content list generation in vlm_middle_json_mkcontent.py
|
2025-12-15 18:19:26 +08:00 |
|
myhloli
|
107e177223
|
fix: improve content check and streamline content list generation in vlm_middle_json_mkcontent.py
|
2025-12-15 17:58:11 +08:00 |
|
myhloli
|
deac3ef3b5
|
refactor: integrate hybrid backend processing and enhance document analysis functions
|
2025-12-15 16:32:38 +08:00 |
|