mirror of
https://github.com/opendatalab/MinerU.git
synced 2026-03-27 11:08:32 +07:00
add minerU video
This commit is contained in:
@@ -21,7 +21,6 @@
|
||||
|
||||
<!-- hot link -->
|
||||
<p align="center">
|
||||
<a href="https://github.com/opendatalab/MinerU">MinerU: 端到端的PDF解析工具(基于PDF-Extract-Kit)支持PDF转Markdown</a>🚀🚀🚀<br>
|
||||
<a href="https://github.com/opendatalab/PDF-Extract-Kit">PDF-Extract-Kit: 高质量PDF解析工具箱</a>🔥🔥🔥
|
||||
</p>
|
||||
|
||||
@@ -82,7 +81,9 @@
|
||||
# MinerU
|
||||
## 项目简介
|
||||
MinerU是一款将PDF转化为机器可读格式的工具(如markdown、json),可以很方便地抽取为任意格式。
|
||||
MinerU诞生于[书生-浦语](https://github.com/InternLM/InternLM)的预训练过程中,我们将会集中精力解决科技文献中的符号转化问题,以此在大模型时代为科技发展做出一点贡献。
|
||||
MinerU诞生于[书生-浦语](https://github.com/InternLM/InternLM)的预训练过程中,我们将会集中精力解决科技文献中的符号转化问题,希望在大模型时代为科技发展做出贡献。
|
||||
|
||||
https://github.com/user-attachments/assets/4bea02c9-6d54-4cd6-97ed-dff14340982c
|
||||
|
||||
## 主要功能
|
||||
|
||||
@@ -266,6 +267,7 @@ The project currently leverages PyMuPDF to deliver advanced functionalities; how
|
||||
|
||||
# Acknowledgments
|
||||
|
||||
- [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) 🔥🔥🔥
|
||||
- [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
|
||||
- [PyMuPDF](https://github.com/pymupdf/PyMuPDF)
|
||||
- [fast-langdetect](https://github.com/LlmKira/fast-langdetect)
|
||||
|
||||
Reference in New Issue
Block a user