Commit Graph

4509 Commits

Author SHA1 Message Date
Xiaomeng Zhao
45f8ad1d5c Merge pull request #4305 from opendatalab/release-2.7.1
Release 2.7.1
mineru-2.7.1-released
2026-01-06 14:47:23 +08:00
Xiaomeng Zhao
b69191ba2b Merge pull request #4304 from opendatalab/dev
Dev
2026-01-06 14:46:18 +08:00
Xiaomeng Zhao
0028514ced Merge pull request #4303 from myhloli/dev
Dev
2026-01-06 14:45:35 +08:00
myhloli
8d8daf6851 fix: add qwen-vl-utils dependency to pyproject.toml 2026-01-06 14:44:53 +08:00
myhloli
815280dd23 fix: update pdfminer.six dependency to resolve CVE-2025-64512 and improve EXIF handling 2026-01-06 14:42:48 +08:00
myhloli
7b52f92aea fix: update pdfminer.six dependency to resolve CVE-2025-64512 and improve EXIF handling 2026-01-06 14:41:47 +08:00
Xiaomeng Zhao
33543b76c9 Merge pull request #4301 from myhloli/dev
Dev
2026-01-06 14:10:08 +08:00
myhloli
ea5f8e98dd fix: update pdfminer.six version to 20251230 in pyproject.toml 2026-01-06 11:54:17 +08:00
myhloli
8996e06448 fix: restore hybrid analyze imports in common.py for backend processing 2026-01-06 11:51:31 +08:00
myhloli
bfb304ef1f fix: improve EXIF handling and save PDF logic in pdf_image_tools.py 2026-01-05 00:27:01 +08:00
Xiaomeng Zhao
17e6016b58 Merge pull request #4283 from kingdomad/fix/image-exif-rotation
fix: add EXIF orientation handling for image inputs
2026-01-04 18:31:06 +08:00
Xiaomeng Zhao
ba06cd14ef Update pdf_image_tools.py 2026-01-04 18:29:51 +08:00
Xiaomeng Zhao
0209ada8d0 Merge pull request #4287 from myhloli/dev
Dev
2026-01-04 15:26:16 +08:00
myhloli
e2140222bc docs: update VastAI.md with new version numbers and improved instructions 2026-01-04 15:24:23 +08:00
myhloli
d679d99192 docs: update heading from '快速开始' to '快速入门' for consistency 2026-01-04 15:16:15 +08:00
Xiaomeng Zhao
4bfcc0b808 Merge pull request #4286 from opendatalab/master
master->dev
2026-01-04 15:12:00 +08:00
Xiaomeng Zhao
ead29489ff Merge pull request #4285 from myhloli/dev
docs: update navigation and terminology in documentation for clarity
2026-01-04 15:11:29 +08:00
myhloli
c01e35b4c6 docs: update navigation and terminology in documentation for clarity 2026-01-04 15:10:37 +08:00
Xiaomeng Zhao
a89249069c Merge pull request #4284 from myhloli/dev
Dev
2026-01-04 14:34:15 +08:00
myhloli
2fc395bcff docs: add reference section to mkdocs.yml for improved documentation structure 2026-01-04 14:33:32 +08:00
史提芬达
0ca244ad62 fix: add EXIF orientation handling for image inputs 2026-01-04 13:41:55 +08:00
myhloli
8acc7dd326 Merge remote-tracking branch 'origin/dev' into dev 2025-12-31 16:57:13 +08:00
myhloli
1cde3fe5ad fix: add additional continuation markers for improved table merging 2025-12-31 16:57:00 +08:00
Xiaomeng Zhao
0a4c87fc22 Merge pull request #4273 from myhloli/dev
fix: update table rows for mineru, mineru-api, and mineru-gradio to reflect correct engine names
2025-12-30 18:52:41 +08:00
myhloli
12d803079f fix: update table rows for mineru, mineru-api, and mineru-gradio to reflect correct engine names 2025-12-30 18:49:52 +08:00
myhloli
8c4b3ef3a2 Update version.py with new version 2025-12-30 10:21:16 +00:00
Xiaomeng Zhao
ed6894c178 Merge pull request #4272 from opendatalab/release-2.7.0
Release 2.7.0
mineru-2.7.0-released
2025-12-30 18:08:29 +08:00
Xiaomeng Zhao
e0b91a4c92 Merge pull request #4271 from myhloli/dev
Dev
2025-12-30 17:58:45 +08:00
myhloli
4195a8b6b9 docs: add reference documentation and update changelog 2025-12-30 17:56:33 +08:00
myhloli
f4b821e509 docs: update usage instructions for MinerU to include GPU and CPU command options 2025-12-30 17:44:37 +08:00
myhloli
c794089abf fix: update installation command for mineru pipeline extension to include quotes 2025-12-30 17:39:52 +08:00
Xiaomeng Zhao
1fd10b9452 Merge pull request #4270 from myhloli/dev
fix: adjust vertical alignment for table headers and content in index.md
2025-12-30 17:32:51 +08:00
myhloli
93ec8fc09c fix: adjust vertical alignment for table headers and content in index.md 2025-12-30 17:29:56 +08:00
Xiaomeng Zhao
1df17918c6 Merge pull request #4268 from myhloli/dev
fix: center-align table headers and content in index.md
2025-12-30 17:26:08 +08:00
myhloli
fd03f1cfef fix: center-align table headers and content in index.md 2025-12-30 17:24:53 +08:00
Xiaomeng Zhao
9d708c5b51 Merge pull request #4267 from myhloli/dev
Dev
2025-12-30 17:03:24 +08:00
Xiaomeng Zhao
146f655c5f Update mineru/backend/vlm/vlm_magic_model.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-12-30 17:02:17 +08:00
myhloli
05b6ed3d8d feat: enhance logging by adding dynamic log level configuration and performance metrics 2025-12-30 16:43:24 +08:00
myhloli
466b85ba3f refactor: remove unused import and update help text for device mode option 2025-12-30 11:12:46 +08:00
myhloli
6f9ef69b34 fix: update Dockerfile to clarify GPU architecture compatibility and specify mineru version 2025-12-30 11:02:22 +08:00
myhloli
55f6731aa6 docs: add usage tip for switching model source in README_zh-CN.md 2025-12-30 10:48:43 +08:00
myhloli
edf422b4f7 fix: update hardware configuration requirement wording in documentation 2025-12-30 10:41:30 +08:00
Xiaomeng Zhao
22c7a84c19 更新 index.md 2025-12-30 03:31:37 +08:00
Xiaomeng Zhao
b02f60d772 Update 'High Config Requirements' to 'High Hardware Requirements' 2025-12-30 02:35:44 +08:00
Xiaomeng Zhao
f9f00cd2ee Merge branch 'opendatalab:dev' into dev 2025-12-30 02:29:35 +08:00
myhloli
f7fc7bd928 refactor: fix formatting in hybrid document analysis function 2025-12-29 19:05:33 +08:00
myhloli
037e5f2460 refactor: enhance hybrid backend logic and improve span processing 2025-12-29 19:03:30 +08:00
myhloli
7750d864ed refactor: streamline OCR processing and enable VLM OCR configuration 2025-12-29 17:09:08 +08:00
myhloli
997aab7c55 fix: enhance caption handling to include images and improve gap detection logic 2025-12-28 22:23:39 +08:00
myhloli
190b4ea472 fix: extend special handling for captions and footnotes to include images 2025-12-28 20:08:17 +08:00