Xiaomeng Zhao
e3a4295527
Merge pull request #2513 from myhloli/dev
...
feat(docs): update changelog for PP-OCRv5 model support and handwritten document recognition enhancements
2025-05-24 15:55:39 +08:00
myhloli
73f0530d16
feat(docs): update changelog for PP-OCRv5 model support and handwritten document recognition enhancements
2025-05-24 15:47:31 +08:00
Xiaomeng Zhao
e92b5b698e
Merge pull request #2512 from myhloli/dev
...
fix(ocr): adjust area ratio threshold and update fitz document handling in image conversion
2025-05-24 13:46:17 +08:00
myhloli
1e01ffcf78
fix(ocr): adjust area ratio threshold and update fitz document handling in image conversion
2025-05-24 13:39:34 +08:00
Xiaomeng Zhao
04b81dc1ab
Merge pull request #2511 from myhloli/dev
...
Merge pull request #10 from myhloli/img2text
2025-05-24 12:01:47 +08:00
Xiaomeng Zhao
90585b67a9
Merge pull request #2510 from myhloli/img2text
...
feat(ocr): add area ratio calculation for OCR results and enhance get_coords_and_area function
2025-05-24 12:00:34 +08:00
Xiaomeng Zhao
4949dd0c18
Merge pull request #10 from myhloli/img2text
...
feat(ocr): add area ratio calculation for OCR results and enhance get_coords_and_area function
2025-05-24 11:59:52 +08:00
myhloli
a2b848136b
feat(ocr): add area ratio calculation for OCR results and enhance get_coords_and_area function
2025-05-24 11:58:02 +08:00
Xiaomeng Zhao
04a712f940
Merge pull request #2506 from myhloli/dev
...
feat(ocr): implement PPHGNetV2 architecture with multiple stages and layers
2025-05-23 18:09:27 +08:00
myhloli
27cad566fa
feat(ocr): implement PPHGNetV2 architecture with multiple stages and layers
2025-05-23 18:06:21 +08:00
Xiaomeng Zhao
ea3003f6ef
Merge pull request #2505 from myhloli/dev
...
feat(ocr): add PPHGNetV2_B4 backbone and update OCR models
2025-05-23 17:34:56 +08:00
myhloli
93ad41edce
feat(ocr): add PPHGNetV2_B4 backbone and update OCR models
...
- Add PPHGNetV2_B4 backbone to the list of supported backbones
- Introduce new OCR model configuration for PP-OCRv5 with PPHGNetV2_B4
- Update existing model configurations to use the new backbone
- Modify RNN neck to support input with H > 1
- Adjust batch size for inference
2025-05-23 17:06:52 +08:00
Xiaomeng Zhao
8f8b8c4c1f
Merge pull request #2501 from myhloli/dev
...
feat(ocr): add PP-OCRv5 models and update configurations
2025-05-22 17:40:43 +08:00
myhloli
048f6af406
feat(ocr): add PP-OCRv5 models and update configurations
...
- Add new PP-OCRv5 detection and recognition models
- Update arch_config.yaml with new model architectures
- Modify models_config.yml to include PP-OCRv5 models for ch_lite configuration- Change dictionary file for ch_lite to ppocrv5_dict.txt
2025-05-22 17:29:47 +08:00
Xiaomeng Zhao
b122b86e8a
Merge pull request #2487 from myhloli/dev
...
fix(ocr_mkcontent): improve image handling and footnote integration in markdown output
2025-05-19 15:47:48 +08:00
myhloli
002333a8d7
fix(ocr_mkcontent): improve image handling and footnote integration in markdown output
2025-05-19 15:45:26 +08:00
Xiaomeng Zhao
e3f22e84ab
Merge pull request #2468 from opendatalab/master
...
master->dev
2025-05-14 10:46:50 +08:00
myhloli
40851b1c61
Update version.py with new version
2025-05-14 02:34:34 +00:00
Xiaomeng Zhao
ea619281ef
Merge pull request #2467 from opendatalab/release-1.3.11
...
Release 1.3.11
magic_pdf-1.3.11-released
2025-05-14 10:33:00 +08:00
Xiaomeng Zhao
212cfcf24a
Merge pull request #2466 from opendatalab/dev
...
docs(changelog): remove pdfminer.six version pinning from release notes
2025-05-14 10:32:31 +08:00
Xiaomeng Zhao
cda85d6262
Merge pull request #2465 from myhloli/dev
...
docs(changelog): remove pdfminer.six version pinning from release notes
2025-05-14 10:31:55 +08:00
myhloli
51ceb48014
docs(changelog): remove pdfminer.six version pinning from release notes
2025-05-14 10:30:55 +08:00
Xiaomeng Zhao
0b8c614280
Merge pull request #2464 from opendatalab/release-1.3.11
...
Release 1.3.11
2025-05-14 10:22:18 +08:00
Xiaomeng Zhao
c1b387abe6
Merge pull request #2451 from myhloli/dev
...
fix(modeling): escape backslashes in LaTeX command descriptions
2025-05-10 00:37:50 +08:00
myhloli
1ab54ac2e3
fix(modeling): escape backslashes in LaTeX command descriptions
2025-05-10 00:34:11 +08:00
myhloli
78a0208425
docs(installation): remove numpy version restriction from PyTorch installation instructions
2025-05-10 00:28:55 +08:00
Xiaomeng Zhao
cd785f6af8
Merge pull request #2450 from myhloli/dev
...
fix(requirements): update pdfminer.six version and restrict torch version upper limit
2025-05-09 23:58:42 +08:00
myhloli
a8f752f753
fix(requirements): update pdfminer.six version and restrict torch version upper limit
2025-05-09 23:57:22 +08:00
Xiaomeng Zhao
65f332ffae
Merge pull request #2449 from myhloli/dev
...
fix(setup): update python_requires to support Python 3.10 to 3.13
2025-05-09 23:45:16 +08:00
myhloli
c4b04ae642
Merge remote-tracking branch 'origin/dev' into dev
2025-05-09 23:38:50 +08:00
myhloli
3858d918dd
fix(setup): update python_requires to support Python 3.10 to 3.13
2025-05-09 23:38:37 +08:00
Xiaomeng Zhao
70696165c7
Merge pull request #2446 from myhloli/dev
...
fix(Dockerfile): update modelscope installation command to use mirror
2025-05-09 18:23:08 +08:00
myhloli
b799d302c2
Merge remote-tracking branch 'origin/dev' into dev
2025-05-09 17:35:01 +08:00
myhloli
9351d64a41
fix(Dockerfile): update modelscope installation command to use mirror
2025-05-09 17:33:47 +08:00
Xiaomeng Zhao
3230793b55
Merge pull request #2440 from myhloli/dev
...
docs(installation): update Python version and CUDA installation instructions
2025-05-09 11:10:09 +08:00
myhloli
9f0d45bb58
docs(installation): update Python version and CUDA installation instructions
2025-05-09 10:48:14 +08:00
Xiaomeng Zhao
6c9645aa0c
Merge pull request #2437 from myhloli/dev
...
docs(README): reorder installation commands for clarity
2025-05-08 18:56:34 +08:00
myhloli
96fb646a86
Merge remote-tracking branch 'origin/dev' into dev
2025-05-08 18:55:49 +08:00
myhloli
71a429a32e
docs(README): reorder installation commands for clarity
2025-05-08 18:54:39 +08:00
Xiaomeng Zhao
201e338b3a
Merge pull request #2429 from myhloli/dev
...
feat(modeling): add regex patterns for LaTeX symbol replacements
2025-05-08 11:27:57 +08:00
myhloli
2a28f604c6
feat(modeling): add regex patterns for LaTeX symbol replacements
2025-05-08 11:26:42 +08:00
Xiaomeng Zhao
38d0a622d9
Merge pull request #2423 from myhloli/dev
...
feat(modeling): add 'protect' command to removal patterns
2025-05-06 18:22:18 +08:00
myhloli
a8ca183094
feat(modeling): add 'protect' command to removal patterns
2025-05-06 18:21:03 +08:00
Xiaomeng Zhao
11bf98d0aa
Merge pull request #2411 from CharlesKeeling65/patch-1
...
Update app.py: Fix parameter parsing in /file_parse endpoint
2025-04-30 17:51:08 +08:00
github-actions[bot]
50700646e4
@CharlesKeeling65 has signed the CLA in opendatalab/MinerU#2411
2025-04-30 09:25:44 +00:00
Wang Yubo
862891e294
Update app.py: Fix parameter parsing in /file_parse endpoint
...
I have updated the `/file_parse` endpoint in `app.py` to correctly handle boolean and string parameters when they are sent via `multipart/form-data` requests (commonly used for file uploads). Previously, these parameters were not being properly parsed because FastAPI expects them to be passed as query or JSON body parameters by default.
### Changes Made:
- Added `Form(...)` to all non-file parameters (`parse_method`, `is_json_md_dump`, `output_dir`, and return flags like `return_layout`, etc.).
- This ensures that FastAPI correctly reads these fields from form-data, allowing clients to send both files and structured configuration options in the same request.
### Why This Change Was Needed:
- When using `requests.post(..., data=data, files=files)`, the `data` dictionary is sent as form-encoded data.
- Without explicitly declaring these fields with `Form(...)`, FastAPI does not bind them correctly, leading to default values always being used (e.g., `False` for boolean flags).
- This change allows the API to accurately reflect the client's intent and enables features like `return_layout`, `return_images`, etc., to work as expected.
This update improves compatibility with HTTP clients that rely on standard form-based file upload mechanisms while preserving the existing behavior of the API.
2025-04-30 17:15:54 +08:00
Xiaomeng Zhao
f0b66d3aab
Merge pull request #2410 from myhloli/dev
...
feat(model): add logging for batch image processing
2025-04-30 17:09:49 +08:00
myhloli
b29b73af21
feat(model): add logging for batch image processing
...
- Add logger info for each batch processed
- Include batch number and page count in log message
2025-04-30 17:08:20 +08:00
Xiaomeng Zhao
5e8656c74f
Merge pull request #2406 from opendatalab/master
...
update version
2025-04-29 16:09:37 +08:00
myhloli
2aaf2310f2
Update version.py with new version
2025-04-29 08:06:04 +00:00