MinerU

mirror of https://github.com/opendatalab/MinerU.git synced 2026-03-27 11:08:32 +07:00

Author	SHA1	Message	Date
Xiaomeng Zhao	e3a4295527	Merge pull request #2513 from myhloli/dev feat(docs): update changelog for PP-OCRv5 model support and handwritten document recognition enhancements	2025-05-24 15:55:39 +08:00
myhloli	73f0530d16	feat(docs): update changelog for PP-OCRv5 model support and handwritten document recognition enhancements	2025-05-24 15:47:31 +08:00
Xiaomeng Zhao	e92b5b698e	Merge pull request #2512 from myhloli/dev fix(ocr): adjust area ratio threshold and update fitz document handling in image conversion	2025-05-24 13:46:17 +08:00
myhloli	1e01ffcf78	fix(ocr): adjust area ratio threshold and update fitz document handling in image conversion	2025-05-24 13:39:34 +08:00
Xiaomeng Zhao	04b81dc1ab	Merge pull request #2511 from myhloli/dev Merge pull request #10 from myhloli/img2text	2025-05-24 12:01:47 +08:00
Xiaomeng Zhao	90585b67a9	Merge pull request #2510 from myhloli/img2text feat(ocr): add area ratio calculation for OCR results and enhance get_coords_and_area function	2025-05-24 12:00:34 +08:00
Xiaomeng Zhao	4949dd0c18	Merge pull request #10 from myhloli/img2text feat(ocr): add area ratio calculation for OCR results and enhance get_coords_and_area function	2025-05-24 11:59:52 +08:00
myhloli	a2b848136b	feat(ocr): add area ratio calculation for OCR results and enhance get_coords_and_area function	2025-05-24 11:58:02 +08:00
Xiaomeng Zhao	04a712f940	Merge pull request #2506 from myhloli/dev feat(ocr): implement PPHGNetV2 architecture with multiple stages and layers	2025-05-23 18:09:27 +08:00
myhloli	27cad566fa	feat(ocr): implement PPHGNetV2 architecture with multiple stages and layers	2025-05-23 18:06:21 +08:00
Xiaomeng Zhao	ea3003f6ef	Merge pull request #2505 from myhloli/dev feat(ocr): add PPHGNetV2_B4 backbone and update OCR models	2025-05-23 17:34:56 +08:00
myhloli	93ad41edce	feat(ocr): add PPHGNetV2_B4 backbone and update OCR models - Add PPHGNetV2_B4 backbone to the list of supported backbones - Introduce new OCR model configuration for PP-OCRv5 with PPHGNetV2_B4 - Update existing model configurations to use the new backbone - Modify RNN neck to support input with H > 1 - Adjust batch size for inference	2025-05-23 17:06:52 +08:00
Xiaomeng Zhao	8f8b8c4c1f	Merge pull request #2501 from myhloli/dev feat(ocr): add PP-OCRv5 models and update configurations	2025-05-22 17:40:43 +08:00
myhloli	048f6af406	feat(ocr): add PP-OCRv5 models and update configurations - Add new PP-OCRv5 detection and recognition models - Update arch_config.yaml with new model architectures - Modify models_config.yml to include PP-OCRv5 models for ch_lite configuration- Change dictionary file for ch_lite to ppocrv5_dict.txt	2025-05-22 17:29:47 +08:00
Xiaomeng Zhao	b122b86e8a	Merge pull request #2487 from myhloli/dev fix(ocr_mkcontent): improve image handling and footnote integration in markdown output	2025-05-19 15:47:48 +08:00
myhloli	002333a8d7	fix(ocr_mkcontent): improve image handling and footnote integration in markdown output	2025-05-19 15:45:26 +08:00
Xiaomeng Zhao	e3f22e84ab	Merge pull request #2468 from opendatalab/master master->dev	2025-05-14 10:46:50 +08:00
myhloli	40851b1c61	Update version.py with new version	2025-05-14 02:34:34 +00:00
Xiaomeng Zhao	ea619281ef	Merge pull request #2467 from opendatalab/release-1.3.11 Release 1.3.11 magic_pdf-1.3.11-released	2025-05-14 10:33:00 +08:00
Xiaomeng Zhao	212cfcf24a	Merge pull request #2466 from opendatalab/dev docs(changelog): remove pdfminer.six version pinning from release notes	2025-05-14 10:32:31 +08:00
Xiaomeng Zhao	cda85d6262	Merge pull request #2465 from myhloli/dev docs(changelog): remove pdfminer.six version pinning from release notes	2025-05-14 10:31:55 +08:00
myhloli	51ceb48014	docs(changelog): remove pdfminer.six version pinning from release notes	2025-05-14 10:30:55 +08:00
Xiaomeng Zhao	0b8c614280	Merge pull request #2464 from opendatalab/release-1.3.11 Release 1.3.11	2025-05-14 10:22:18 +08:00
Xiaomeng Zhao	c1b387abe6	Merge pull request #2451 from myhloli/dev fix(modeling): escape backslashes in LaTeX command descriptions	2025-05-10 00:37:50 +08:00
myhloli	1ab54ac2e3	fix(modeling): escape backslashes in LaTeX command descriptions	2025-05-10 00:34:11 +08:00
myhloli	78a0208425	docs(installation): remove numpy version restriction from PyTorch installation instructions	2025-05-10 00:28:55 +08:00
Xiaomeng Zhao	cd785f6af8	Merge pull request #2450 from myhloli/dev fix(requirements): update pdfminer.six version and restrict torch version upper limit	2025-05-09 23:58:42 +08:00
myhloli	a8f752f753	fix(requirements): update pdfminer.six version and restrict torch version upper limit	2025-05-09 23:57:22 +08:00
Xiaomeng Zhao	65f332ffae	Merge pull request #2449 from myhloli/dev fix(setup): update python_requires to support Python 3.10 to 3.13	2025-05-09 23:45:16 +08:00
myhloli	c4b04ae642	Merge remote-tracking branch 'origin/dev' into dev	2025-05-09 23:38:50 +08:00
myhloli	3858d918dd	fix(setup): update python_requires to support Python 3.10 to 3.13	2025-05-09 23:38:37 +08:00
Xiaomeng Zhao	70696165c7	Merge pull request #2446 from myhloli/dev fix(Dockerfile): update modelscope installation command to use mirror	2025-05-09 18:23:08 +08:00
myhloli	b799d302c2	Merge remote-tracking branch 'origin/dev' into dev	2025-05-09 17:35:01 +08:00
myhloli	9351d64a41	fix(Dockerfile): update modelscope installation command to use mirror	2025-05-09 17:33:47 +08:00
Xiaomeng Zhao	3230793b55	Merge pull request #2440 from myhloli/dev docs(installation): update Python version and CUDA installation instructions	2025-05-09 11:10:09 +08:00
myhloli	9f0d45bb58	docs(installation): update Python version and CUDA installation instructions	2025-05-09 10:48:14 +08:00
Xiaomeng Zhao	6c9645aa0c	Merge pull request #2437 from myhloli/dev docs(README): reorder installation commands for clarity	2025-05-08 18:56:34 +08:00
myhloli	96fb646a86	Merge remote-tracking branch 'origin/dev' into dev	2025-05-08 18:55:49 +08:00
myhloli	71a429a32e	docs(README): reorder installation commands for clarity	2025-05-08 18:54:39 +08:00
Xiaomeng Zhao	201e338b3a	Merge pull request #2429 from myhloli/dev feat(modeling): add regex patterns for LaTeX symbol replacements	2025-05-08 11:27:57 +08:00
myhloli	2a28f604c6	feat(modeling): add regex patterns for LaTeX symbol replacements	2025-05-08 11:26:42 +08:00
Xiaomeng Zhao	38d0a622d9	Merge pull request #2423 from myhloli/dev feat(modeling): add 'protect' command to removal patterns	2025-05-06 18:22:18 +08:00
myhloli	a8ca183094	feat(modeling): add 'protect' command to removal patterns	2025-05-06 18:21:03 +08:00
Xiaomeng Zhao	11bf98d0aa	Merge pull request #2411 from CharlesKeeling65/patch-1 Update app.py: Fix parameter parsing in /file_parse endpoint	2025-04-30 17:51:08 +08:00
github-actions[bot]	50700646e4	@CharlesKeeling65 has signed the CLA in opendatalab/MinerU#2411	2025-04-30 09:25:44 +00:00
Wang Yubo	862891e294	Update app.py: Fix parameter parsing in /file_parse endpoint I have updated the `/file_parse` endpoint in `app.py` to correctly handle boolean and string parameters when they are sent via `multipart/form-data` requests (commonly used for file uploads). Previously, these parameters were not being properly parsed because FastAPI expects them to be passed as query or JSON body parameters by default. ### Changes Made: - Added `Form(...)` to all non-file parameters (`parse_method`, `is_json_md_dump`, `output_dir`, and return flags like `return_layout`, etc.). - This ensures that FastAPI correctly reads these fields from form-data, allowing clients to send both files and structured configuration options in the same request. ### Why This Change Was Needed: - When using `requests.post(..., data=data, files=files)`, the `data` dictionary is sent as form-encoded data. - Without explicitly declaring these fields with `Form(...)`, FastAPI does not bind them correctly, leading to default values always being used (e.g., `False` for boolean flags). - This change allows the API to accurately reflect the client's intent and enables features like `return_layout`, `return_images`, etc., to work as expected. This update improves compatibility with HTTP clients that rely on standard form-based file upload mechanisms while preserving the existing behavior of the API.	2025-04-30 17:15:54 +08:00
Xiaomeng Zhao	f0b66d3aab	Merge pull request #2410 from myhloli/dev feat(model): add logging for batch image processing	2025-04-30 17:09:49 +08:00
myhloli	b29b73af21	feat(model): add logging for batch image processing - Add logger info for each batch processed - Include batch number and page count in log message	2025-04-30 17:08:20 +08:00
Xiaomeng Zhao	5e8656c74f	Merge pull request #2406 from opendatalab/master update version	2025-04-29 16:09:37 +08:00
myhloli	2aaf2310f2	Update version.py with new version	2025-04-29 08:06:04 +00:00

1 2 3 4 5 ...

2828 Commits