mirror of
https://github.com/opendatalab/MinerU.git
synced 2026-03-27 11:08:32 +07:00
13
README.md
13
README.md
@@ -44,6 +44,14 @@
|
||||
|
||||
# Changelog
|
||||
|
||||
- 2025/07/16 2.1.1 Released
|
||||
- Bug fixes
|
||||
- Fixed text block content loss issue that could occur in certain `pipeline` scenarios #3005
|
||||
- Fixed issue where `sglang-client` required unnecessary packages like `torch` #2968
|
||||
- Updated `dockerfile` to fix incomplete text content parsing due to missing fonts in Linux #2915
|
||||
- Usability improvements
|
||||
- Updated `compose.yaml` to facilitate direct startup of `sglang-server`, `mineru-api`, and `mineru-gradio` services
|
||||
- Launched brand new [online documentation site](https://opendatalab.github.io/MinerU/), simplified readme, providing better documentation experience
|
||||
- 2025/07/05 Version 2.1.0 Released
|
||||
- This is the first major update of MinerU 2, which includes a large number of new features and improvements, covering significant performance optimizations, user experience enhancements, and bug fixes. The detailed update contents are as follows:
|
||||
- **Performance Optimizations:**
|
||||
@@ -517,6 +525,11 @@ You can get the [Docker Deployment Instructions](https://opendatalab.github.io/M
|
||||
|
||||
### Using MinerU
|
||||
|
||||
The simplest command line invocation is:
|
||||
```bash
|
||||
mineru -p <input_path> -o <output_path>
|
||||
```
|
||||
|
||||
You can use MinerU for PDF parsing through various methods such as command line, API, and WebUI. For detailed instructions, please refer to the [Usage Guide](https://opendatalab.github.io/MinerU/usage/).
|
||||
|
||||
# TODO
|
||||
|
||||
@@ -43,6 +43,14 @@
|
||||
</div>
|
||||
|
||||
# 更新记录
|
||||
- 2025/07/16 2.1.1发布
|
||||
- bug修复
|
||||
- 修复`pipeline`在某些情况可能发生的文本块内容丢失问题 #3005
|
||||
- 修复`sglang-client`需要安装`torch`等不必要的包的问题 #2968
|
||||
- 更新`dockerfile`以修复linux字体缺失导致的解析文本内容不完整问题 #2915
|
||||
- 易用性更新
|
||||
- 更新`compose.yaml`,便于用户直接启动`sglang-server`、`mineru-api`、`mineru-gradio`服务
|
||||
- 启用全新的[在线文档站点](https://opendatalab.github.io/MinerU/zh/),简化readme,提供更好的文档体验
|
||||
- 2025/07/05 2.1.0发布
|
||||
- 这是 MinerU 2 的第一个大版本更新,包含了大量新功能和改进,包含众多性能优化、体验优化和bug修复,具体更新内容如下:
|
||||
- 性能优化:
|
||||
@@ -503,6 +511,12 @@ MinerU提供了便捷的docker部署方式,这有助于快速搭建环境并
|
||||
---
|
||||
|
||||
### 使用 MinerU
|
||||
|
||||
最简单的命令行调用方式:
|
||||
```bash
|
||||
mineru -p <input_path> -o <output_path>
|
||||
```
|
||||
|
||||
您可以通过命令行、API、WebUI等多种方式使用MinerU进行PDF解析,具体使用方法请参考[使用指南](https://opendatalab.github.io/MinerU/zh/usage/)。
|
||||
|
||||
# TODO
|
||||
|
||||
@@ -1,20 +0,0 @@
|
||||
pytest
|
||||
Levenshtein
|
||||
nltk
|
||||
rapidfuzz
|
||||
statistics
|
||||
openxlab #安装opendatalab
|
||||
pandas
|
||||
numpy
|
||||
matplotlib
|
||||
seaborn
|
||||
scipy
|
||||
scikit-learn
|
||||
tqdm
|
||||
htmltabletomd
|
||||
pypandoc
|
||||
pyopenssl==24.0.0
|
||||
struct-eqtable==0.1.0
|
||||
pytest-cov
|
||||
beautifulsoup4
|
||||
coverage
|
||||
Reference in New Issue
Block a user