Merge pull request #3075 from opendatalab/dev

Dev
This commit is contained in:
Xiaomeng Zhao
2025-07-16 18:03:57 +08:00
committed by GitHub
3 changed files with 27 additions and 20 deletions

View File

@@ -44,6 +44,14 @@
# Changelog
- 2025/07/16 2.1.1 Released
- Bug fixes
- Fixed text block content loss issue that could occur in certain `pipeline` scenarios #3005
- Fixed issue where `sglang-client` required unnecessary packages like `torch` #2968
- Updated `dockerfile` to fix incomplete text content parsing due to missing fonts in Linux #2915
- Usability improvements
- Updated `compose.yaml` to facilitate direct startup of `sglang-server`, `mineru-api`, and `mineru-gradio` services
- Launched brand new [online documentation site](https://opendatalab.github.io/MinerU/), simplified readme, providing better documentation experience
- 2025/07/05 Version 2.1.0 Released
- This is the first major update of MinerU 2, which includes a large number of new features and improvements, covering significant performance optimizations, user experience enhancements, and bug fixes. The detailed update contents are as follows:
- **Performance Optimizations:**
@@ -517,6 +525,11 @@ You can get the [Docker Deployment Instructions](https://opendatalab.github.io/M
### Using MinerU
The simplest command line invocation is:
```bash
mineru -p <input_path> -o <output_path>
```
You can use MinerU for PDF parsing through various methods such as command line, API, and WebUI. For detailed instructions, please refer to the [Usage Guide](https://opendatalab.github.io/MinerU/usage/).
# TODO

View File

@@ -43,6 +43,14 @@
</div>
# 更新记录
- 2025/07/16 2.1.1发布
- bug修复
- 修复`pipeline`在某些情况可能发生的文本块内容丢失问题 #3005
- 修复`sglang-client`需要安装`torch`等不必要的包的问题 #2968
- 更新`dockerfile`以修复linux字体缺失导致的解析文本内容不完整问题 #2915
- 易用性更新
- 更新`compose.yaml`,便于用户直接启动`sglang-server``mineru-api``mineru-gradio`服务
- 启用全新的[在线文档站点](https://opendatalab.github.io/MinerU/zh/)简化readme提供更好的文档体验
- 2025/07/05 2.1.0发布
- 这是 MinerU 2 的第一个大版本更新包含了大量新功能和改进包含众多性能优化、体验优化和bug修复具体更新内容如下
- 性能优化:
@@ -503,6 +511,12 @@ MinerU提供了便捷的docker部署方式这有助于快速搭建环境并
---
### 使用 MinerU
最简单的命令行调用方式:
```bash
mineru -p <input_path> -o <output_path>
```
您可以通过命令行、API、WebUI等多种方式使用MinerU进行PDF解析具体使用方法请参考[使用指南](https://opendatalab.github.io/MinerU/zh/usage/)。
# TODO

View File

@@ -1,20 +0,0 @@
pytest
Levenshtein
nltk
rapidfuzz
statistics
openxlab #安装opendatalab
pandas
numpy
matplotlib
seaborn
scipy
scikit-learn
tqdm
htmltabletomd
pypandoc
pyopenssl==24.0.0
struct-eqtable==0.1.0
pytest-cov
beautifulsoup4
coverage