4673 Commits

Author SHA1 Message Date
quyuan01
591b76b53f Update update_base.yml 2024-04-09 21:35:46 +08:00
quyuan
9c842ad2d3 CI yaml 2024-04-09 21:27:46 +08:00
quyuan
dd29734e9e CI yaml 2024-04-09 21:25:19 +08:00
quyuan
35c061ecb8 CI yaml 2024-04-09 21:10:34 +08:00
quyuan
dd5d4dd875 CI yaml 2024-04-09 21:08:18 +08:00
quyuan01
1bed454082 Update update_base.yml 2024-04-09 21:00:12 +08:00
quyuan
0e86cef661 CI yaml 2024-04-09 20:54:45 +08:00
quyuan
9dd1f7f7bf CI yaml 2024-04-09 20:44:44 +08:00
quyuan
39fefab1dd Merge branch 'master' of https://github.com/magicpdf/Magic-PDF 2024-04-09 20:24:19 +08:00
quyuan
915b630ae3 CI yaml 2024-04-09 20:23:57 +08:00
drunkpig
c5424292d7 Merge pull request #13 from papayalove/master
更新io modules
2024-04-09 19:26:23 +08:00
liukaiwen
94d94e61e4 io modules 2024-04-09 19:17:37 +08:00
liukaiwen
8f65af9f48 io modules 2024-04-09 17:33:28 +08:00
quyuan
cfac3b2527 CI yaml 2024-04-09 16:57:41 +08:00
quyuan
779215e1e5 Merge branch 'master' of https://github.com/magicpdf/Magic-PDF 2024-04-09 16:20:20 +08:00
quyuan
cdb318fe43 CI yaml 2024-04-09 16:19:19 +08:00
drunkpig
6532b24fe8 Merge pull request #12 from magicpdf/dev-xm
remove spark/s3.py,remove pipeline
2024-04-09 16:03:40 +08:00
quyuan01
2f4c76ca81 Update benchmark.yml 2024-04-09 15:39:55 +08:00
赵小蒙
c81f699e68 更新libs/config_reader,删除spark/s3.py
pipeline_cor.py pipeline_txt.py, pipeline.py 移动到code_clean并修复一些依赖关系
2024-04-09 15:25:16 +08:00
quyuan01
992b8922fc Update benchmark.yml 2024-04-09 14:56:18 +08:00
quyuan01
3e8edeb2f4 Update benchmark.yml 2024-04-09 14:33:16 +08:00
quyuan01
284ff9b090 Update benchmark.yml 2024-04-09 14:26:23 +08:00
quyuan01
9e22acfc25 Update benchmark.yml 2024-04-09 14:24:18 +08:00
quyuan
a4b42f14b8 add qa 2024-04-09 14:08:41 +08:00
quyuan01
eead46cd96 Update benchmark.yml 2024-04-09 12:57:45 +08:00
quyuan01
489be94bda Update benchmark.yml 2024-04-09 11:36:57 +08:00
quyuan01
3e794aaca5 Update benchmark.yml 2024-04-09 11:16:55 +08:00
quyuan01
340683a622 Update benchmark.yml 2024-04-09 11:07:53 +08:00
quyuan01
1ce51ed7f0 Update benchmark.yml 2024-04-09 10:52:34 +08:00
drunkpig
b75ee676fe Merge pull request #11 from magicpdf/dev-xm
fix logic
2024-04-09 10:37:57 +08:00
赵小蒙
4b87a571bf config读写配置更新 2024-04-09 10:34:02 +08:00
quyuan01
b4fb6a6828 Update benchmark.yml 2024-04-09 10:21:46 +08:00
quyuan01
d9d07d3ca3 Update benchmark.yml 2024-04-09 10:20:37 +08:00
quyuan01
a0b39744f3 Update benchmark.yml 2024-04-09 10:18:25 +08:00
quyuan01
ff463d8edd Update benchmark.yml 2024-04-09 10:07:31 +08:00
quyuan
55ffd959e2 CI yaml 2024-04-09 10:02:31 +08:00
quyuan
fed7f9ffc0 CI yaml 2024-04-09 10:01:12 +08:00
drunkpig
022a4e9332 Merge pull request #10 from magicpdf/benchmark
tools
2024-04-08 22:19:06 +08:00
Shuimo
b66dda3815 Improved script to read json in compressed packages 2024-04-08 20:34:42 +08:00
Shuimo
015e2bdd81 rename the folder 2024-04-08 19:54:43 +08:00
Shuimo
bda3444976 update the necessary package requirements.txt and add two json files 2024-04-08 19:42:55 +08:00
Shuimo
5e05e48553 add a ocr_badcase script 2024-04-08 19:08:43 +08:00
赵小蒙
aedaeb00fa 一些变量命名更新 2024-04-08 18:36:15 +08:00
赵小蒙
7e59b4b651 实现从本地home目录获取s3config信息 2024-04-08 18:21:37 +08:00
赵小蒙
58c191e769 将s3的init配置转换成json配置,并保存到home目录下 2024-04-08 18:21:04 +08:00
赵小蒙
34bde6d8ec classify后在jso根层级添加_pdf_type标识,同时取消对非文本类pdf的drop 2024-04-08 18:18:55 +08:00
赵小蒙
f65be6e094 pdf_parse_by_model.py ---> pdf_parse_by_txt.py 2024-04-08 15:12:26 +08:00
赵小蒙
0f3bfa1044 Merge remote-tracking branch 'origin/master' 2024-04-08 14:56:39 +08:00
赵小蒙
f52c6249be 更新路径输入和markdown输出逻辑 2024-04-08 14:56:13 +08:00
赵小蒙
ca7059e514 注释更新 2024-04-08 12:03:36 +08:00