Commit Graph

4648 Commits

Author SHA1 Message Date
myhloli
bf61e022d8 refactor: update backend options and enhance documentation for hybrid parsing methods 2025-12-25 19:32:06 +08:00
myhloli
984b303dfa refactor: update default backend to hybrid-auto-engine and enhance documentation for parsing options 2025-12-25 19:17:08 +08:00
myhloli
b2c126ef8a refactor: update comments for clarity in hybrid_model_output_to_middle_json.py 2025-12-25 15:46:24 +08:00
myhloli
136cc2fc3b refactor: remove redundant GPU memory allocation message in pipeline_analyze.py 2025-12-25 14:20:58 +08:00
Xiaomeng Zhao
f4eb59c954 Merge pull request #4244 from myhloli/dev
Dev
2025-12-25 01:19:54 +08:00
myhloli
726e0de6fe refactor: simplify backend choices in client.py 2025-12-24 17:28:01 +08:00
myhloli
edd1656851 refactor: streamline backend choice handling in update_interface function 2025-12-24 17:21:49 +08:00
myhloli
7f6f7d9d97 refactor: improve Markdown and API handling in gradio_app.py 2025-12-24 17:07:28 +08:00
myhloli
5f516ea7dc refactor: add support for hybrid backend in parse directory structure 2025-12-24 16:38:01 +08:00
myhloli
eeea4f38e3 refactor: update GPU model support information in Docker deployment documentation 2025-12-24 14:56:41 +08:00
myhloli
88822c7918 refactor: remove unused GPU capability checks and simplify batch ratio calculation 2025-12-24 14:48:28 +08:00
myhloli
0e4c9aee00 refactor: enhance batch ratio calculation based on GPU compute capability 2025-12-24 14:00:13 +08:00
Xiaomeng Zhao
76f7f778cd Merge pull request #4241 from myhloli/dev
Dev
2025-12-24 02:11:49 +08:00
Xiaomeng Zhao
41d5b4843a Merge branch 'opendatalab:dev' into dev 2025-12-24 02:08:48 +08:00
myhloli
7b02d8fbf6 remove unuse file 2025-12-23 19:05:55 +08:00
myhloli
6d7d1c3b0c refactor: expand OCR text conditions for category assignment in analysis scripts 2025-12-23 18:36:13 +08:00
myhloli
e2a06bbb0a refactor: add environment variable check to control pipeline enablement in OCR processing 2025-12-23 18:12:12 +08:00
myhloli
6cecafd99d refactor: add environment variable check to control pipeline enablement in OCR processing 2025-12-23 18:11:24 +08:00
myhloli
a58cb06a6d refactor: update batch ratio documentation for clarity and adjust memory thresholds 2025-12-23 18:01:25 +08:00
myhloli
361f949d4a refactor: update batch ratio documentation for clarity and adjust memory thresholds 2025-12-23 17:59:58 +08:00
myhloli
914770d651 refactor: enhance batch ratio documentation and adjust GPU memory thresholds 2025-12-23 17:40:06 +08:00
myhloli
447ffcd32f refactor: implement dynamic batch ratio based on GPU memory and environment variable 2025-12-23 17:00:02 +08:00
Xiaomeng Zhao
135eaf0c4f Merge pull request #4239 from myhloli/dev
Dev
2025-12-23 16:32:17 +08:00
myhloli
408d94ed58 refactor: add VastAI support documentation to index and create new VastAI.md file 2025-12-23 16:31:12 +08:00
myhloli
e48e1619f9 refactor: improve language support descriptions in OCR input options 2025-12-23 11:32:42 +08:00
myhloli
13fef5319c Merge remote-tracking branch 'origin/dev' into dev 2025-12-23 10:19:47 +08:00
myhloli
e5ee3136d4 refactor: enhance backend info descriptions and improve language support in parsing logic 2025-12-23 10:18:21 +08:00
Xiaomeng Zhao
a9d6e709a4 Merge pull request #4231 from myhloli/dev
Dev
2025-12-22 20:06:08 +08:00
myhloli
416181700e refactor: change conditional statements from 'if' to 'elif' for backend checks 2025-12-22 20:03:32 +08:00
myhloli
1e4a763f7b refactor: add support for '(cid:)' in OCR text recognition logic 2025-12-22 20:00:00 +08:00
myhloli
09926d8a57 refactor: update Dockerfile tags for A2 and A3 compatibility 2025-12-22 19:35:59 +08:00
myhloli
54e28075c2 refactor: increase height of Markdown component for improved visibility 2025-12-22 19:23:27 +08:00
myhloli
1fa4e06974 refactor: update Examples component to use label for better i18n support 2025-12-22 18:42:03 +08:00
myhloli
c163d0d7bd refactor: update header.html to include support prompt and change file opening to utf-8 encoding 2025-12-22 17:15:12 +08:00
myhloli
91f82a4436 refactor: implement i18n support for UI labels and tooltips in gradio_app 2025-12-22 16:33:42 +08:00
myhloli
eabdef1824 refactor: improve info tooltip for OCR language checkbox clarity 2025-12-22 15:38:48 +08:00
myhloli
82ba2c7fac refactor: enhance OCR language handling and improve backend info tooltips 2025-12-22 15:16:37 +08:00
myhloli
b5ba2ba4c9 refactor: improve OCR language dropdown label and add info tooltip for clarity 2025-12-22 01:09:46 +08:00
myhloli
9efaf9197f refactor: update default backend option to hybrid-auto-engine in client and fast_api modules 2025-12-22 00:53:43 +08:00
myhloli
996be34534 refactor: enhance batch processing and IOU filtering in hybrid analysis 2025-12-22 00:41:52 +08:00
myhloli
37a43e3318 Merge remote-tracking branch 'origin/dev' into dev 2025-12-19 18:07:22 +08:00
myhloli
68f64aeef1 fix: update mineru-vl-utils version to 0.1.18 in pyproject.toml 2025-12-19 18:06:27 +08:00
Xiaomeng Zhao
b2f0bf5c10 Merge pull request #22 from myhloli/refactor_2.7
Refactor 2.7
2025-12-19 18:03:01 +08:00
Xiaomeng Zhao
edb5f8d26b Update mineru/backend/vlm/vlm_middle_json_mkcontent.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-12-19 18:02:49 +08:00
myhloli
637227fafd refactor: remove unused line_content variable from list block processing 2025-12-19 18:01:05 +08:00
myhloli
4dc2452063 refactor: integrate inline formula handling and update backend options for improved recognition 2025-12-19 17:44:19 +08:00
myhloli
c9eb7d4ca0 refactor: update backend descriptions to include hybrid options for improved clarity 2025-12-19 10:46:26 +08:00
myhloli
c72d7de61f refactor: enhance VLM engine selection logic for cross-platform compatibility 2025-12-19 01:42:49 +08:00
myhloli
04fdc91dc9 refactor: enhance backend options and streamline OCR processing logic 2025-12-18 20:46:53 +08:00
myhloli
ab8472e6a5 fix: use deepcopy for reference_row to prevent mutation during row checks in table_merge.py 2025-12-18 18:08:08 +08:00