Commit Graph

2828 Commits

Author SHA1 Message Date
Xiaomeng Zhao
e3a4295527 Merge pull request #2513 from myhloli/dev
feat(docs): update changelog for PP-OCRv5 model support and handwritten document recognition enhancements
2025-05-24 15:55:39 +08:00
myhloli
73f0530d16 feat(docs): update changelog for PP-OCRv5 model support and handwritten document recognition enhancements 2025-05-24 15:47:31 +08:00
Xiaomeng Zhao
e92b5b698e Merge pull request #2512 from myhloli/dev
fix(ocr): adjust area ratio threshold and update fitz document handling in image conversion
2025-05-24 13:46:17 +08:00
myhloli
1e01ffcf78 fix(ocr): adjust area ratio threshold and update fitz document handling in image conversion 2025-05-24 13:39:34 +08:00
Xiaomeng Zhao
04b81dc1ab Merge pull request #2511 from myhloli/dev
Merge pull request #10 from myhloli/img2text
2025-05-24 12:01:47 +08:00
Xiaomeng Zhao
90585b67a9 Merge pull request #2510 from myhloli/img2text
feat(ocr): add area ratio calculation for OCR results and enhance get_coords_and_area function
2025-05-24 12:00:34 +08:00
Xiaomeng Zhao
4949dd0c18 Merge pull request #10 from myhloli/img2text
feat(ocr): add area ratio calculation for OCR results and enhance get_coords_and_area function
2025-05-24 11:59:52 +08:00
myhloli
a2b848136b feat(ocr): add area ratio calculation for OCR results and enhance get_coords_and_area function 2025-05-24 11:58:02 +08:00
Xiaomeng Zhao
04a712f940 Merge pull request #2506 from myhloli/dev
feat(ocr): implement PPHGNetV2 architecture with multiple stages and layers
2025-05-23 18:09:27 +08:00
myhloli
27cad566fa feat(ocr): implement PPHGNetV2 architecture with multiple stages and layers 2025-05-23 18:06:21 +08:00
Xiaomeng Zhao
ea3003f6ef Merge pull request #2505 from myhloli/dev
feat(ocr): add PPHGNetV2_B4 backbone and update OCR models
2025-05-23 17:34:56 +08:00
myhloli
93ad41edce feat(ocr): add PPHGNetV2_B4 backbone and update OCR models
- Add PPHGNetV2_B4 backbone to the list of supported backbones
- Introduce new OCR model configuration for PP-OCRv5 with PPHGNetV2_B4
- Update existing model configurations to use the new backbone
- Modify RNN neck to support input with H > 1
- Adjust batch size for inference
2025-05-23 17:06:52 +08:00
Xiaomeng Zhao
8f8b8c4c1f Merge pull request #2501 from myhloli/dev
feat(ocr): add PP-OCRv5 models and update configurations
2025-05-22 17:40:43 +08:00
myhloli
048f6af406 feat(ocr): add PP-OCRv5 models and update configurations
- Add new PP-OCRv5 detection and recognition models
- Update arch_config.yaml with new model architectures
- Modify models_config.yml to include PP-OCRv5 models for ch_lite configuration- Change dictionary file for ch_lite to ppocrv5_dict.txt
2025-05-22 17:29:47 +08:00
Xiaomeng Zhao
b122b86e8a Merge pull request #2487 from myhloli/dev
fix(ocr_mkcontent): improve image handling and footnote integration in markdown output
2025-05-19 15:47:48 +08:00
myhloli
002333a8d7 fix(ocr_mkcontent): improve image handling and footnote integration in markdown output 2025-05-19 15:45:26 +08:00
Xiaomeng Zhao
e3f22e84ab Merge pull request #2468 from opendatalab/master
master->dev
2025-05-14 10:46:50 +08:00
myhloli
40851b1c61 Update version.py with new version 2025-05-14 02:34:34 +00:00
Xiaomeng Zhao
ea619281ef Merge pull request #2467 from opendatalab/release-1.3.11
Release 1.3.11
magic_pdf-1.3.11-released
2025-05-14 10:33:00 +08:00
Xiaomeng Zhao
212cfcf24a Merge pull request #2466 from opendatalab/dev
docs(changelog): remove pdfminer.six version pinning from release notes
2025-05-14 10:32:31 +08:00
Xiaomeng Zhao
cda85d6262 Merge pull request #2465 from myhloli/dev
docs(changelog): remove pdfminer.six version pinning from release notes
2025-05-14 10:31:55 +08:00
myhloli
51ceb48014 docs(changelog): remove pdfminer.six version pinning from release notes 2025-05-14 10:30:55 +08:00
Xiaomeng Zhao
0b8c614280 Merge pull request #2464 from opendatalab/release-1.3.11
Release 1.3.11
2025-05-14 10:22:18 +08:00
Xiaomeng Zhao
c1b387abe6 Merge pull request #2451 from myhloli/dev
fix(modeling): escape backslashes in LaTeX command descriptions
2025-05-10 00:37:50 +08:00
myhloli
1ab54ac2e3 fix(modeling): escape backslashes in LaTeX command descriptions 2025-05-10 00:34:11 +08:00
myhloli
78a0208425 docs(installation): remove numpy version restriction from PyTorch installation instructions 2025-05-10 00:28:55 +08:00
Xiaomeng Zhao
cd785f6af8 Merge pull request #2450 from myhloli/dev
fix(requirements): update pdfminer.six version and restrict torch version upper limit
2025-05-09 23:58:42 +08:00
myhloli
a8f752f753 fix(requirements): update pdfminer.six version and restrict torch version upper limit 2025-05-09 23:57:22 +08:00
Xiaomeng Zhao
65f332ffae Merge pull request #2449 from myhloli/dev
fix(setup): update python_requires to support Python 3.10 to 3.13
2025-05-09 23:45:16 +08:00
myhloli
c4b04ae642 Merge remote-tracking branch 'origin/dev' into dev 2025-05-09 23:38:50 +08:00
myhloli
3858d918dd fix(setup): update python_requires to support Python 3.10 to 3.13 2025-05-09 23:38:37 +08:00
Xiaomeng Zhao
70696165c7 Merge pull request #2446 from myhloli/dev
fix(Dockerfile): update modelscope installation command to use mirror
2025-05-09 18:23:08 +08:00
myhloli
b799d302c2 Merge remote-tracking branch 'origin/dev' into dev 2025-05-09 17:35:01 +08:00
myhloli
9351d64a41 fix(Dockerfile): update modelscope installation command to use mirror 2025-05-09 17:33:47 +08:00
Xiaomeng Zhao
3230793b55 Merge pull request #2440 from myhloli/dev
docs(installation): update Python version and CUDA installation instructions
2025-05-09 11:10:09 +08:00
myhloli
9f0d45bb58 docs(installation): update Python version and CUDA installation instructions 2025-05-09 10:48:14 +08:00
Xiaomeng Zhao
6c9645aa0c Merge pull request #2437 from myhloli/dev
docs(README): reorder installation commands for clarity
2025-05-08 18:56:34 +08:00
myhloli
96fb646a86 Merge remote-tracking branch 'origin/dev' into dev 2025-05-08 18:55:49 +08:00
myhloli
71a429a32e docs(README): reorder installation commands for clarity 2025-05-08 18:54:39 +08:00
Xiaomeng Zhao
201e338b3a Merge pull request #2429 from myhloli/dev
feat(modeling): add regex patterns for LaTeX symbol replacements
2025-05-08 11:27:57 +08:00
myhloli
2a28f604c6 feat(modeling): add regex patterns for LaTeX symbol replacements 2025-05-08 11:26:42 +08:00
Xiaomeng Zhao
38d0a622d9 Merge pull request #2423 from myhloli/dev
feat(modeling): add 'protect' command to removal patterns
2025-05-06 18:22:18 +08:00
myhloli
a8ca183094 feat(modeling): add 'protect' command to removal patterns 2025-05-06 18:21:03 +08:00
Xiaomeng Zhao
11bf98d0aa Merge pull request #2411 from CharlesKeeling65/patch-1
Update app.py: Fix parameter parsing in /file_parse endpoint
2025-04-30 17:51:08 +08:00
github-actions[bot]
50700646e4 @CharlesKeeling65 has signed the CLA in opendatalab/MinerU#2411 2025-04-30 09:25:44 +00:00
Wang Yubo
862891e294 Update app.py: Fix parameter parsing in /file_parse endpoint
I have updated the `/file_parse` endpoint in `app.py` to correctly handle boolean and string parameters when they are sent via `multipart/form-data` requests (commonly used for file uploads). Previously, these parameters were not being properly parsed because FastAPI expects them to be passed as query or JSON body parameters by default.

### Changes Made:
- Added `Form(...)` to all non-file parameters (`parse_method`, `is_json_md_dump`, `output_dir`, and return flags like `return_layout`, etc.).
- This ensures that FastAPI correctly reads these fields from form-data, allowing clients to send both files and structured configuration options in the same request.

### Why This Change Was Needed:
- When using `requests.post(..., data=data, files=files)`, the `data` dictionary is sent as form-encoded data.
- Without explicitly declaring these fields with `Form(...)`, FastAPI does not bind them correctly, leading to default values always being used (e.g., `False` for boolean flags).
- This change allows the API to accurately reflect the client's intent and enables features like `return_layout`, `return_images`, etc., to work as expected.

This update improves compatibility with HTTP clients that rely on standard form-based file upload mechanisms while preserving the existing behavior of the API.
2025-04-30 17:15:54 +08:00
Xiaomeng Zhao
f0b66d3aab Merge pull request #2410 from myhloli/dev
feat(model): add logging for batch image processing
2025-04-30 17:09:49 +08:00
myhloli
b29b73af21 feat(model): add logging for batch image processing
- Add logger info for each batch processed
- Include batch number and page count in log message
2025-04-30 17:08:20 +08:00
Xiaomeng Zhao
5e8656c74f Merge pull request #2406 from opendatalab/master
update version
2025-04-29 16:09:37 +08:00
myhloli
2aaf2310f2 Update version.py with new version 2025-04-29 08:06:04 +00:00