mirror of
https://github.com/opendatalab/MinerU.git
synced 2026-03-27 11:08:32 +07:00
docs: update download model instructions for Chinese users
This commit is contained in:
@@ -104,13 +104,13 @@ pip install detectron2 --extra-index-url https://myhloli.github.io/wheels/
|
||||
|
||||
#### 2. Downloading model weights files
|
||||
|
||||
For detailed references, please see below[how_to_download_models](docs/how_to_download_models.md)
|
||||
For detailed references, please see below[how_to_download_models](docs/how_to_download_models_en.md)
|
||||
|
||||
After downloading the model weights, move the 'models' directory to a directory on a larger disk space, preferably an SSD.
|
||||
|
||||
|
||||
#### 3. Copy the Configuration File and Make Configurations
|
||||
You can get the magic-pdf.template.json file in the repository root directory.
|
||||
You can get the [magic-pdf.template.json](magic-pdf.template.json) file in the repository root directory.
|
||||
```bash
|
||||
cp magic-pdf.template.json ~/magic-pdf.json
|
||||
```
|
||||
|
||||
@@ -98,11 +98,11 @@ pip install detectron2 --extra-index-url https://myhloli.github.io/wheels/
|
||||
|
||||
#### 2. 下载模型权重文件
|
||||
|
||||
详细参考[如何下载模型文件](docs/how_to_download_models.md)
|
||||
下载后请将models目录拷贝到空间较大的ssd磁盘目录
|
||||
详细参考[如何下载模型文件](docs/how_to_download_models_zh_cn.md)
|
||||
下载后请将models目录移动到空间较大的ssd磁盘目录
|
||||
|
||||
#### 3. 拷贝配置文件并进行配置
|
||||
在仓库根目录可以获得magic-pdf.template.json文件
|
||||
在仓库根目录可以获得[magic-pdf.template.json](magic-pdf.template.json)文件
|
||||
```bash
|
||||
cp magic-pdf.template.json ~/magic-pdf.json
|
||||
```
|
||||
|
||||
0
docs/how_to_download_models.md → docs/how_to_download_models_en.md
Executable file → Normal file
0
docs/how_to_download_models.md → docs/how_to_download_models_en.md
Executable file → Normal file
38
docs/how_to_download_models_zh_cn.md
Normal file
38
docs/how_to_download_models_zh_cn.md
Normal file
@@ -0,0 +1,38 @@
|
||||
#### 安装 Git LFS
|
||||
开始之前,请确保您的系统上已安装 Git 大文件存储 (Git LFS)。使用以下命令进行安装
|
||||
|
||||
```bash
|
||||
git lfs install
|
||||
```
|
||||
|
||||
#### 从 Hugging Face 下载模型
|
||||
请使用以下命令从 Hugging Face 下载 PDF-Extract-Kit 模型:
|
||||
|
||||
```bash
|
||||
git lfs clone https://huggingface.co/wanderkid/PDF-Extract-Kit
|
||||
```
|
||||
|
||||
确保在克隆过程中启用了 Git LFS,以便正确下载所有大文件。
|
||||
|
||||
|
||||
将 'models' 目录移动到具有较大磁盘空间的目录中,最好是在固态硬盘(SSD)上。
|
||||
|
||||
|
||||
模型文件夹的结构如下,包含了不同组件的配置文件和权重文件:
|
||||
```
|
||||
./
|
||||
├── Layout
|
||||
│ ├── config.json
|
||||
│ └── model_final.pth
|
||||
├── MFD
|
||||
│ └── weights.pt
|
||||
├── MFR
|
||||
│ └── UniMERNet
|
||||
│ ├── config.json
|
||||
│ ├── preprocessor_config.json
|
||||
│ ├── pytorch_model.bin
|
||||
│ ├── README.md
|
||||
│ ├── tokenizer_config.json
|
||||
│ └── tokenizer.json
|
||||
└── README.md
|
||||
```
|
||||
Reference in New Issue
Block a user