Commit Graph

2627 Commits

Author SHA1 Message Date
Xiaomeng Zhao
ccf2ea04cb Merge pull request #2156 from opendatalab/dev
Dev
2025-04-08 18:16:07 +08:00
Xiaomeng Zhao
564991512c Merge branch 'release-1.3.1' into dev 2025-04-08 18:16:01 +08:00
Xiaomeng Zhao
a1595f1912 Merge pull request #2155 from myhloli/dev
docs: update version number in README files
2025-04-08 18:15:17 +08:00
myhloli
bc0ff1acb0 docs: update version number in README files
- Correct version number from 1.3.2 to 1.3.1 in both README.md and README_zh-CN.md
- Update changelog entries for the latest release
2025-04-08 18:14:29 +08:00
Xiaomeng Zhao
b3ac3ac148 Merge branch 'master' into release-1.3.2 2025-04-08 18:11:16 +08:00
Xiaomeng Zhao
2c7094ff3d Merge pull request #2153 from opendatalab/dev
Dev
2025-04-08 18:10:16 +08:00
Xiaomeng Zhao
0ed231cb8b Merge pull request #2152 from myhloli/dev
docs(README): update version number and changelog in README files
2025-04-08 18:09:53 +08:00
myhloli
bd4728aaeb docs(README): update version number and changelog in README files
- Update version number from 1.3.1 to 1.3.2
2025-04-08 18:09:05 +08:00
Xiaomeng Zhao
2813e59905 Merge pull request #2151 from myhloli/dev
refactor(ocr): improve OCR score precision to three decimal places
2025-04-08 18:06:31 +08:00
myhloli
ea730ae2e9 refactor(ocr): improve OCR score precision to three decimal places
- Update OCR score formatting in batch_analyze.py and pdf_parse_union_core_v2.py
- Change score rounding method to preserve three decimal places
- Enhance accuracy representation without significantly altering the score value
2025-04-08 18:02:03 +08:00
myhloli
0ab29cdbee docs(README): update version number in release notes
- Update version from1.3.1 to 1.3.2 in both English and Chinese README files
- Keep other content unchanged
2025-04-08 17:37:39 +08:00
Xiaomeng Zhao
44665d3966 Update python-package.yml 2025-04-08 17:35:39 +08:00
myhloli
79feb926b7 Update version.py with new version 2025-04-08 09:23:09 +00:00
Xiaomeng Zhao
a2cde43b57 Merge pull request #2146 from opendatalab/release-1.3.1
Release 1.3.1
2025-04-08 17:20:21 +08:00
Xiaomeng Zhao
b8856ca96a Merge pull request #2148 from opendatalab/dev
Dev
2025-04-08 17:03:27 +08:00
Xiaomeng Zhao
098cf1df60 Merge pull request #2147 from myhloli/dev
docs: update badges and project URLs- Update PyPI version badge to us…
2025-04-08 17:02:43 +08:00
myhloli
90f0e7370a docs: update badges and project URLs- Update PyPI version badge to use shields.io
- Add project URLs in setup.py for better discoverability
- Make consistent changes across README.md and README_zh-CN.md
2025-04-08 17:01:41 +08:00
Xiaomeng Zhao
714504864e Update python-package.yml 2025-04-08 16:49:56 +08:00
Xiaomeng Zhao
87fd4c2806 Update bug_report.yml 2025-04-08 16:49:02 +08:00
Xiaomeng Zhao
3251c73250 Merge pull request #2145 from opendatalab/dev
fix(table): add model path for slanet-plus to resolve RapidTableError
2025-04-08 16:47:45 +08:00
Xiaomeng Zhao
697da27cf7 Merge pull request #2144 from myhloli/dev
fix(table): add model path for slanet-plus to resolve RapidTableError
2025-04-08 16:47:09 +08:00
myhloli
e327e9bad5 fix(table): add model path for slanet-plus to resolve RapidTableError
- Import os and pathlib modules to handle file paths
- Define the path to the slanet-plus model
- Update RapidTableInput initialization to include the model path
2025-04-08 16:46:01 +08:00
Xiaomeng Zhao
99d5c022c4 Merge pull request #2142 from myhloli/dev
update 1.3.1
2025-04-08 16:13:28 +08:00
myhloli
7b61b418a3 ci: update Python version support and installation process
- Add support for Python3.11, 3.12, and 3.13
- Replace requirements.txt based installation with editable install
2025-04-08 16:10:07 +08:00
myhloli
4fd8d626c4 docs(install): update Python version requirements and simplify torch installation
- Update Python version requirements to >=3.10
- Simplify torch installation command- Remove numpy version restriction
- Update CUDA compatibility information
- Adjust environment creation commands across multiple documentation files
2025-04-08 16:06:02 +08:00
myhloli
cf6fa12767 build(setup): remove rapid_table dependency
- Remove rapid_table from install_requires in setup.py
2025-04-08 14:24:15 +08:00
myhloli
de4bc5a32d ci: update issue template options for Python version and dependency version
- Add "3.13" option for Python version
- Remove "3.9" option for Python version
- Update dependency version options:
  - Remove "0.8.x", "0.9.x", "0.10.x"
  - Add "1.1.x", "1.2.x", "1.3.x"
2025-04-08 14:22:06 +08:00
myhloli
9b5d2796f8 build(deps): update dependencies and add support for old Linux systems
- Update transformers to exclude version 4.51.0 due to compatibility issues- Rapid table version range expanded to >=1.0.5,<2.0.0
- Add separate 'full_old_linux' extras_require for better support of older Linux systems
- Update matplotlib version requirements for different platforms
- Remove platform-specific paddlepaddle versions,
2025-04-08 14:18:49 +08:00
myhloli
0f0591cf8f build(old_linux): add rapid_table dependency for PDF conversion
- Add rapid_table==1.0.3 to old_linux specific dependencies
- This version is compatible with Linux systems from 2019 and earlier
- Newer versions of rapid_table depend on onnxruntime, which is not supported on older Linux systems
2025-04-08 11:58:38 +08:00
Xiaomeng Zhao
cf6ffc6b1e Merge pull request #2128 from myhloli/dev
fix(model): improve VRAM detection and handling
2025-04-07 18:18:09 +08:00
myhloli
d32a63cada fix(model): improve VRAM detection and handling
- Refactor VRAM detection logic for better readability and efficiency
- Add fallback mechanism for unknown VRAM sizes
- Improve device checking in get_vram function
2025-04-07 18:15:37 +08:00
Xiaomeng Zhao
dfb3cbfb17 Merge pull request #2126 from icecraft/fix/image_ds_add_lang
fix: image dataset add lang field
2025-04-07 16:57:49 +08:00
icecraft
e36a083dc3 fix: image dataset add lang field 2025-04-07 15:40:06 +08:00
Xiaomeng Zhao
980f5c8cd7 Merge pull request #2125 from opendatalab/dev
docs: update torchvision version in CUDA installation guide
2025-04-07 15:26:13 +08:00
Xiaomeng Zhao
f442adfc95 Merge pull request #2124 from myhloli/dev
docs: update torchvision version in CUDA installation guide
2025-04-07 14:54:30 +08:00
myhloli
d4cda0a8c2 docs: update torchvision version in CUDA installation guide
- Update torchvision version from0.21.1 to0.21.0 in Windows CUDA acceleration guides
- Update both English and Chinese versions of the documentation
2025-04-07 14:53:25 +08:00
Xiaomeng Zhao
60fdf851a4 Merge pull request #2115 from myhloli/dev
build: remove accelerate dependency
2025-04-06 22:25:01 +08:00
myhloli
a10b9aec74 build: remove accelerate dependency
- Remove accelerate package from requirements.txt
- This change ensures only necessary external dependencies are introduced
2025-04-06 22:24:23 +08:00
Xiaomeng Zhao
e3261b0eea Merge pull request #2114 from myhloli/dev
build(deps): add accelerate package and update requirements https://github.com/opendatalab/MinerU/issues/2112
2025-04-06 22:17:20 +08:00
myhloli
09632dddc1 build(deps): add accelerate package and update requirements
- Add accelerate package to support model training acceleration
- Update requirements.txt to include new dependency
2025-04-06 22:16:01 +08:00
Xiaomeng Zhao
c5329a0722 Merge pull request #2093 from opendatalab/master
master -> dev
2025-04-03 23:33:35 +08:00
myhloli
d629ce04bb Update version.py with new version 2025-04-03 15:27:29 +00:00
Xiaomeng Zhao
3963b96583 Merge pull request #2091 from opendatalab/release-1.3.0
Release 1.3.0
magic_pdf-1.3.0-released
2025-04-03 23:24:27 +08:00
Xiaomeng Zhao
1cd50125ed Merge pull request #2090 from opendatalab/dev
docs(readme): update release notes for version 1.3.0
2025-04-03 23:23:55 +08:00
Xiaomeng Zhao
1a1b8fdb2a Merge pull request #2089 from myhloli/dev
docs(readme): update release notes for version 1.3.0
2025-04-03 23:23:22 +08:00
myhloli
4067f6fdf4 docs(readme): update changelog and highlight usability improvements
- Remove duplicate entries for paddleocr2torch and thread safety
- Add new entry for real-time progress bar implementation
- Update mfr model to unimernet(2503)
- Extend torch version compatibility
- Enhance cuda support for various GPU models
- Improve parsing speed on MPS devices
2025-04-03 23:22:35 +08:00
myhloli
5c2e25acd4 docs(readme): update release notes for version 1.3.0
- Update release notes in both English and Chinese README files
- Highlight major optimizations and improvements in version 1.3.0
- Clarify compatibility changes for torch, CUDA, and Python versions
- Emphasize performance improvements and parsing speed enhancements
- Mention specific bug fixes and parsing effect optimizations
2025-04-03 23:17:53 +08:00
Xiaomeng Zhao
41d96cd89a Merge pull request #2065 from opendatalab/release-1.3.0
Release 1.3.0
2025-04-03 23:06:41 +08:00
Xiaomeng Zhao
dd96663c6b Merge pull request #2088 from opendatalab/dev
fix: support non-pdf file in batch mode
2025-04-03 18:24:04 +08:00
Xiaomeng Zhao
bb40b9b6f7 Merge pull request #2087 from icecraft/fix/convert_image_with_pymupdf
fix: convert image with pymupdf
2025-04-03 18:17:10 +08:00