quyuan01
|
591b76b53f
|
Update update_base.yml
|
2024-04-09 21:35:46 +08:00 |
|
quyuan
|
9c842ad2d3
|
CI yaml
|
2024-04-09 21:27:46 +08:00 |
|
quyuan
|
dd29734e9e
|
CI yaml
|
2024-04-09 21:25:19 +08:00 |
|
quyuan
|
35c061ecb8
|
CI yaml
|
2024-04-09 21:10:34 +08:00 |
|
quyuan
|
dd5d4dd875
|
CI yaml
|
2024-04-09 21:08:18 +08:00 |
|
quyuan01
|
1bed454082
|
Update update_base.yml
|
2024-04-09 21:00:12 +08:00 |
|
quyuan
|
0e86cef661
|
CI yaml
|
2024-04-09 20:54:45 +08:00 |
|
quyuan
|
9dd1f7f7bf
|
CI yaml
|
2024-04-09 20:44:44 +08:00 |
|
quyuan
|
39fefab1dd
|
Merge branch 'master' of https://github.com/magicpdf/Magic-PDF
|
2024-04-09 20:24:19 +08:00 |
|
quyuan
|
915b630ae3
|
CI yaml
|
2024-04-09 20:23:57 +08:00 |
|
drunkpig
|
c5424292d7
|
Merge pull request #13 from papayalove/master
更新io modules
|
2024-04-09 19:26:23 +08:00 |
|
liukaiwen
|
94d94e61e4
|
io modules
|
2024-04-09 19:17:37 +08:00 |
|
liukaiwen
|
8f65af9f48
|
io modules
|
2024-04-09 17:33:28 +08:00 |
|
quyuan
|
cfac3b2527
|
CI yaml
|
2024-04-09 16:57:41 +08:00 |
|
quyuan
|
779215e1e5
|
Merge branch 'master' of https://github.com/magicpdf/Magic-PDF
|
2024-04-09 16:20:20 +08:00 |
|
quyuan
|
cdb318fe43
|
CI yaml
|
2024-04-09 16:19:19 +08:00 |
|
drunkpig
|
6532b24fe8
|
Merge pull request #12 from magicpdf/dev-xm
remove spark/s3.py,remove pipeline
|
2024-04-09 16:03:40 +08:00 |
|
quyuan01
|
2f4c76ca81
|
Update benchmark.yml
|
2024-04-09 15:39:55 +08:00 |
|
赵小蒙
|
c81f699e68
|
更新libs/config_reader,删除spark/s3.py
pipeline_cor.py pipeline_txt.py, pipeline.py 移动到code_clean并修复一些依赖关系
|
2024-04-09 15:25:16 +08:00 |
|
quyuan01
|
992b8922fc
|
Update benchmark.yml
|
2024-04-09 14:56:18 +08:00 |
|
quyuan01
|
3e8edeb2f4
|
Update benchmark.yml
|
2024-04-09 14:33:16 +08:00 |
|
quyuan01
|
284ff9b090
|
Update benchmark.yml
|
2024-04-09 14:26:23 +08:00 |
|
quyuan01
|
9e22acfc25
|
Update benchmark.yml
|
2024-04-09 14:24:18 +08:00 |
|
quyuan
|
a4b42f14b8
|
add qa
|
2024-04-09 14:08:41 +08:00 |
|
quyuan01
|
eead46cd96
|
Update benchmark.yml
|
2024-04-09 12:57:45 +08:00 |
|
quyuan01
|
489be94bda
|
Update benchmark.yml
|
2024-04-09 11:36:57 +08:00 |
|
quyuan01
|
3e794aaca5
|
Update benchmark.yml
|
2024-04-09 11:16:55 +08:00 |
|
quyuan01
|
340683a622
|
Update benchmark.yml
|
2024-04-09 11:07:53 +08:00 |
|
quyuan01
|
1ce51ed7f0
|
Update benchmark.yml
|
2024-04-09 10:52:34 +08:00 |
|
drunkpig
|
b75ee676fe
|
Merge pull request #11 from magicpdf/dev-xm
fix logic
|
2024-04-09 10:37:57 +08:00 |
|
赵小蒙
|
4b87a571bf
|
config读写配置更新
|
2024-04-09 10:34:02 +08:00 |
|
quyuan01
|
b4fb6a6828
|
Update benchmark.yml
|
2024-04-09 10:21:46 +08:00 |
|
quyuan01
|
d9d07d3ca3
|
Update benchmark.yml
|
2024-04-09 10:20:37 +08:00 |
|
quyuan01
|
a0b39744f3
|
Update benchmark.yml
|
2024-04-09 10:18:25 +08:00 |
|
quyuan01
|
ff463d8edd
|
Update benchmark.yml
|
2024-04-09 10:07:31 +08:00 |
|
quyuan
|
55ffd959e2
|
CI yaml
|
2024-04-09 10:02:31 +08:00 |
|
quyuan
|
fed7f9ffc0
|
CI yaml
|
2024-04-09 10:01:12 +08:00 |
|
drunkpig
|
022a4e9332
|
Merge pull request #10 from magicpdf/benchmark
tools
|
2024-04-08 22:19:06 +08:00 |
|
Shuimo
|
b66dda3815
|
Improved script to read json in compressed packages
|
2024-04-08 20:34:42 +08:00 |
|
Shuimo
|
015e2bdd81
|
rename the folder
|
2024-04-08 19:54:43 +08:00 |
|
Shuimo
|
bda3444976
|
update the necessary package requirements.txt and add two json files
|
2024-04-08 19:42:55 +08:00 |
|
Shuimo
|
5e05e48553
|
add a ocr_badcase script
|
2024-04-08 19:08:43 +08:00 |
|
赵小蒙
|
aedaeb00fa
|
一些变量命名更新
|
2024-04-08 18:36:15 +08:00 |
|
赵小蒙
|
7e59b4b651
|
实现从本地home目录获取s3config信息
|
2024-04-08 18:21:37 +08:00 |
|
赵小蒙
|
58c191e769
|
将s3的init配置转换成json配置,并保存到home目录下
|
2024-04-08 18:21:04 +08:00 |
|
赵小蒙
|
34bde6d8ec
|
classify后在jso根层级添加_pdf_type标识,同时取消对非文本类pdf的drop
|
2024-04-08 18:18:55 +08:00 |
|
赵小蒙
|
f65be6e094
|
pdf_parse_by_model.py ---> pdf_parse_by_txt.py
|
2024-04-08 15:12:26 +08:00 |
|
赵小蒙
|
0f3bfa1044
|
Merge remote-tracking branch 'origin/master'
|
2024-04-08 14:56:39 +08:00 |
|
赵小蒙
|
f52c6249be
|
更新路径输入和markdown输出逻辑
|
2024-04-08 14:56:13 +08:00 |
|
赵小蒙
|
ca7059e514
|
注释更新
|
2024-04-08 12:03:36 +08:00 |
|