Commit Graph

4638 Commits

Author SHA1 Message Date
myhloli
8acc7dd326 Merge remote-tracking branch 'origin/dev' into dev 2025-12-31 16:57:13 +08:00
myhloli
1cde3fe5ad fix: add additional continuation markers for improved table merging 2025-12-31 16:57:00 +08:00
Xiaomeng Zhao
0a4c87fc22 Merge pull request #4273 from myhloli/dev
fix: update table rows for mineru, mineru-api, and mineru-gradio to reflect correct engine names
2025-12-30 18:52:41 +08:00
myhloli
12d803079f fix: update table rows for mineru, mineru-api, and mineru-gradio to reflect correct engine names 2025-12-30 18:49:52 +08:00
myhloli
8c4b3ef3a2 Update version.py with new version 2025-12-30 10:21:16 +00:00
Xiaomeng Zhao
ed6894c178 Merge pull request #4272 from opendatalab/release-2.7.0
Release 2.7.0
mineru-2.7.0-released
2025-12-30 18:08:29 +08:00
Xiaomeng Zhao
e0b91a4c92 Merge pull request #4271 from myhloli/dev
Dev
2025-12-30 17:58:45 +08:00
myhloli
4195a8b6b9 docs: add reference documentation and update changelog 2025-12-30 17:56:33 +08:00
myhloli
f4b821e509 docs: update usage instructions for MinerU to include GPU and CPU command options 2025-12-30 17:44:37 +08:00
myhloli
c794089abf fix: update installation command for mineru pipeline extension to include quotes 2025-12-30 17:39:52 +08:00
Xiaomeng Zhao
1fd10b9452 Merge pull request #4270 from myhloli/dev
fix: adjust vertical alignment for table headers and content in index.md
2025-12-30 17:32:51 +08:00
myhloli
93ec8fc09c fix: adjust vertical alignment for table headers and content in index.md 2025-12-30 17:29:56 +08:00
Xiaomeng Zhao
1df17918c6 Merge pull request #4268 from myhloli/dev
fix: center-align table headers and content in index.md
2025-12-30 17:26:08 +08:00
myhloli
fd03f1cfef fix: center-align table headers and content in index.md 2025-12-30 17:24:53 +08:00
Xiaomeng Zhao
9d708c5b51 Merge pull request #4267 from myhloli/dev
Dev
2025-12-30 17:03:24 +08:00
Xiaomeng Zhao
146f655c5f Update mineru/backend/vlm/vlm_magic_model.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-12-30 17:02:17 +08:00
myhloli
05b6ed3d8d feat: enhance logging by adding dynamic log level configuration and performance metrics 2025-12-30 16:43:24 +08:00
myhloli
466b85ba3f refactor: remove unused import and update help text for device mode option 2025-12-30 11:12:46 +08:00
myhloli
6f9ef69b34 fix: update Dockerfile to clarify GPU architecture compatibility and specify mineru version 2025-12-30 11:02:22 +08:00
myhloli
55f6731aa6 docs: add usage tip for switching model source in README_zh-CN.md 2025-12-30 10:48:43 +08:00
myhloli
edf422b4f7 fix: update hardware configuration requirement wording in documentation 2025-12-30 10:41:30 +08:00
Xiaomeng Zhao
22c7a84c19 更新 index.md 2025-12-30 03:31:37 +08:00
Xiaomeng Zhao
b02f60d772 Update 'High Config Requirements' to 'High Hardware Requirements' 2025-12-30 02:35:44 +08:00
Xiaomeng Zhao
f9f00cd2ee Merge branch 'opendatalab:dev' into dev 2025-12-30 02:29:35 +08:00
myhloli
f7fc7bd928 refactor: fix formatting in hybrid document analysis function 2025-12-29 19:05:33 +08:00
myhloli
037e5f2460 refactor: enhance hybrid backend logic and improve span processing 2025-12-29 19:03:30 +08:00
myhloli
7750d864ed refactor: streamline OCR processing and enable VLM OCR configuration 2025-12-29 17:09:08 +08:00
myhloli
997aab7c55 fix: enhance caption handling to include images and improve gap detection logic 2025-12-28 22:23:39 +08:00
myhloli
190b4ea472 fix: extend special handling for captions and footnotes to include images 2025-12-28 20:08:17 +08:00
myhloli
0dd4c4c4e4 refactor: improve CJK language handling and hyphen management in text processing 2025-12-27 01:00:50 +08:00
myhloli
e54e0c3001 fix: correct hyphen handling based on next line's span case 2025-12-27 00:25:16 +08:00
myhloli
b1aefabbea feat: add bbox_center_distance function and refactor category tying by index 2025-12-26 23:39:57 +08:00
myhloli
0032421167 docs: update memory and disk space requirements in README files for clarity 2025-12-26 17:25:47 +08:00
myhloli
efc428115c refactor: remove vllm engine references and streamline backend choice handling in Gradio app 2025-12-26 16:50:53 +08:00
myhloli
661aebdb2b docs: update Dockerfile comments for GPU architecture compatibility 2025-12-26 16:38:11 +08:00
myhloli
3c4334a37f docs: update installation instructions for lightweight client with vlm-http-client and hybrid-http-client modes 2025-12-26 15:57:46 +08:00
myhloli
5f751d44fb refactor: enhance hyphen handling in text processing for western contexts 2025-12-26 15:46:29 +08:00
myhloli
7496def7a5 fix: update changelog links in README files for accuracy 2025-12-26 14:17:34 +08:00
myhloli
deba6a991f feat: add changelog section to documentation and create changelog file 2025-12-26 11:56:17 +08:00
myhloli
9a355fca02 refactor: remove unnecessary environment variable from Docker run command in VastAI.md 2025-12-26 10:47:56 +08:00
myhloli
bf61e022d8 refactor: update backend options and enhance documentation for hybrid parsing methods 2025-12-25 19:32:06 +08:00
myhloli
984b303dfa refactor: update default backend to hybrid-auto-engine and enhance documentation for parsing options 2025-12-25 19:17:08 +08:00
myhloli
b2c126ef8a refactor: update comments for clarity in hybrid_model_output_to_middle_json.py 2025-12-25 15:46:24 +08:00
myhloli
136cc2fc3b refactor: remove redundant GPU memory allocation message in pipeline_analyze.py 2025-12-25 14:20:58 +08:00
Xiaomeng Zhao
f4eb59c954 Merge pull request #4244 from myhloli/dev
Dev
2025-12-25 01:19:54 +08:00
myhloli
726e0de6fe refactor: simplify backend choices in client.py 2025-12-24 17:28:01 +08:00
myhloli
edd1656851 refactor: streamline backend choice handling in update_interface function 2025-12-24 17:21:49 +08:00
myhloli
7f6f7d9d97 refactor: improve Markdown and API handling in gradio_app.py 2025-12-24 17:07:28 +08:00
myhloli
5f516ea7dc refactor: add support for hybrid backend in parse directory structure 2025-12-24 16:38:01 +08:00
myhloli
eeea4f38e3 refactor: update GPU model support information in Docker deployment documentation 2025-12-24 14:56:41 +08:00