Compare commits

..

3 Commits

Author SHA1 Message Date
Xiaomeng Zhao
973443d6d4 Merge branch 'master' into release-2.1.2 2025-07-22 22:27:37 +08:00
Xiaomeng Zhao
7714e4c41a Merge pull request #3141 from opendatalab/dev
Dev
2025-07-22 22:23:57 +08:00
Xiaomeng Zhao
2bf2337e76 @myhloli has signed the CLA in opendatalab/MinerU#3129 2025-07-22 21:06:30 +08:00
4 changed files with 10 additions and 6 deletions

View File

@@ -43,7 +43,7 @@
</div>
# Changelog
- 2025/07/22 2.1.3 Released
- 2025/07/22 2.1.2 Released
- Bug Fixes
- Fixed the issue of excessive memory consumption during the `MFR` step in the `pipeline` backend under certain scenarios #2771
- Fixed the inaccurate matching between `image`/`table` and `caption`/`footnote` under certain conditions #3129

View File

@@ -43,7 +43,7 @@
</div>
# 更新记录
- 2025/07/22 2.1.3发布
- 2025/07/22 2.1.2发布
- bug修复
- 修复`pipeline`后端中`MFR`步骤在某些情况下显存消耗过大的问题 #2771
- 修复某些情况下`image`/`table``caption`/`footnote`匹配不准确的问题 #3129

View File

@@ -104,10 +104,6 @@ class UnimernetModel(object):
# Create dataset with sorted images
dataset = MathDataset(sorted_images, transform=self.model.transform)
# 如果batch_size> len(sorted_images)则设置为不超过len(sorted_images)的2的幂
batch_size = min(batch_size, 2 ** (len(sorted_images).bit_length() - 1))
dataloader = DataLoader(dataset, batch_size=batch_size, num_workers=0)
# Process batches and store results

View File

@@ -391,6 +391,14 @@
"created_at": "2025-07-16T08:53:24Z",
"repoId": 765083837,
"pullRequestNo": 3070
},
{
"name": "huazZeng",
"id": 125243371,
"comment_id": 3100630363,
"created_at": "2025-07-22T03:04:40Z",
"repoId": 765083837,
"pullRequestNo": 3129
}
]
}