Commit Graph

4953 Commits

Author SHA1 Message Date
myhloli
f71d5c1d84 feat: implement visualization job handling with context management and logging 2026-03-26 01:49:31 +08:00
Xiaomeng Zhao
b064ebdf69 Merge pull request #27 from myhloli/new_client
feat: increase maximum concurrent requests to 3 and enhance task logg…
2026-03-26 01:16:21 +08:00
Xiaomeng Zhao
b786ba3f76 Merge pull request #26 from myhloli/new_client
New client
2026-03-26 01:12:20 +08:00
myhloli
87960671f8 feat: increase maximum concurrent requests to 3 and enhance task logging with progress tracking 2026-03-26 01:08:52 +08:00
myhloli
fe257fe6a3 feat: update API client to support maximum concurrent requests and improve logging 2026-03-26 00:34:33 +08:00
myhloli
053ae8eb24 feat: implement robust cleanup for temporary API directory with retry mechanism 2026-03-25 21:12:15 +08:00
myhloli
40a52da3cf feat: add API protocol version and default processing window size to client 2026-03-25 20:44:46 +08:00
myhloli
ad35c69a18 feat: simplify PDF byte conversion by integrating page range handling directly in rewrite function 2026-03-25 19:00:10 +08:00
myhloli
8d50bd9b63 feat: refactor PDF byte conversion to utilize pdfium for improved performance and error handling 2026-03-25 18:39:34 +08:00
myhloli
6c52a44143 feat: remove async context manager for PDF handling and simplify locking mechanism 2026-03-25 16:40:43 +08:00
myhloli
c57e36ac54 feat: remove low memory handling and refactor to use processing window size 2026-03-25 15:19:31 +08:00
myhloli
efeb832272 feat: add functions for retrieving image paths and MIME types 2026-03-25 14:26:46 +08:00
myhloli
934fbd03ec feat: add pdfium_guard for thread-safe PDF document handling 2026-03-25 14:16:56 +08:00
myhloli
cf8964c873 feat: refactor PDF handling to utilize pdfium_guard for resource management 2026-03-25 14:11:33 +08:00
myhloli
0eff2b0d70 feat: adjust default GPU memory utilization based on vllm version and GPU memory 2026-03-24 18:45:21 +08:00
myhloli
8861657d18 feat: update OCR detection base batch size and adjust memory requirements in documentation 2026-03-24 17:25:41 +08:00
myhloli
0cff3438f6 feat: refactor PDF conversion to use pypdf and remove threading lock 2026-03-24 16:43:06 +08:00
myhloli
39f7311f5f feat: mark lines in index blocks as list start lines 2026-03-24 16:05:20 +08:00
myhloli
ed3731ba96 feat: update minimum dynamic batch size for MFR processing to 16 2026-03-24 15:42:54 +08:00
myhloli
635775c810 feat: add support for disabling VLM acceleration via environment variable 2026-03-24 15:41:50 +08:00
myhloli
5b640d6580 feat: change default value for low memory mode to true 2026-03-24 11:19:17 +08:00
myhloli
b49b0ce3b8 feat: adjust OCR detection base batch size and optimize batch ratio logic for GPU memory 2026-03-24 11:16:22 +08:00
myhloli
4aada116e8 Merge remote-tracking branch 'origin/dev' into dev 2026-03-24 10:54:07 +08:00
myhloli
6ced3dd6f1 feat: improve text block merging by ensuring both blocks have lines before merging 2026-03-24 10:53:44 +08:00
myhloli
9b68645352 feat: optimize dynamic batch size calculation for MFR processing 2026-03-24 10:33:36 +08:00
myhloli
4aab895c8a feat: enhance document processing by improving file suffix handling and adding progress indicators 2026-03-24 04:58:21 +08:00
myhloli
7dbfb81b08 feat: fix language detection for code blocks by correcting sub_type reference 2026-03-24 00:23:09 +08:00
myhloli
4beb2ad207 feat: enhance code block rendering by adding language support for syntax highlighting 2026-03-24 00:08:21 +08:00
myhloli
d3e79967df feat: enhance content list generation by adding support for SEAL and CHART block types 2026-03-23 23:36:53 +08:00
myhloli
3c6bab713c feat: enhance markdown rendering by refactoring text and hyperlink handling 2026-03-23 18:50:26 +08:00
myhloli
c30d88f618 feat: enhance PDF classification by implementing hybrid and legacy strategies 2026-03-23 17:13:26 +08:00
myhloli
e4995cfd84 feat: implement synchronous parsing endpoint and enhance task management 2026-03-23 16:11:47 +08:00
myhloli
42c278a79f feat: enhance batch group finalization by adding dynamic splitting and merging logic 2026-03-23 15:23:04 +08:00
myhloli
21aa9f7b7c feat: enhance bounding box processing by adding support for CHART and REF_TEXT block types 2026-03-23 12:03:10 +08:00
myhloli
6eb91d3632 feat: optimize batch processing by implementing dynamic batch grouping and enhancing formula item handling 2026-03-23 01:46:00 +08:00
myhloli
cbbabcb347 feat: refactor prediction methods to streamline batch processing and enhance error handling 2026-03-23 00:44:48 +08:00
myhloli
7423c135d1 feat: enhance formula number processing by appending tags to interline equations 2026-03-22 23:47:54 +08:00
myhloli
01d8e18a13 feat: add support for SEAL block type in bounding box processing 2026-03-22 23:20:32 +08:00
myhloli
fb7246540c feat: improve paragraph splitting logic by adding conditions for block positioning and line count 2026-03-22 23:09:28 +08:00
myhloli
7a365d92c9 feat: enhance PDF generation by preserving original image raster content and optimizing image handling 2026-03-22 03:06:26 +08:00
myhloli
e07820a12c feat: extend model initialization to support MFR alongside Layout 2026-03-21 19:33:36 +08:00
myhloli
7685afc4de feat: enhance table processing with inline object extraction and base64 image handling 2026-03-21 18:44:56 +08:00
myhloli
4d57a0fe58 feat: refactor batch analysis by removing unused methods and optimizing table processing logic 2026-03-21 16:01:54 +08:00
myhloli
db2f76d556 feat: integrate vertical crop rotation in OCR image processing for improved alignment 2026-03-21 04:23:11 +08:00
myhloli
6562d440db feat: sort blocks by index in rebuilt page blocks for consistent ordering 2026-03-21 04:18:21 +08:00
myhloli
28eeebb95d feat: optimize PDF rendering process with dynamic thread allocation and page range calculation 2026-03-21 03:53:10 +08:00
myhloli
09fc22fcc2 feat: add support for SEAL block type in bbox drawing and update image handling in markdown content 2026-03-21 03:33:34 +08:00
myhloli
01ab656487 feat: replace shape attribute with len for box count and update return type for ragged arrays 2026-03-21 03:14:12 +08:00
myhloli
7409e645f3 feat: update license from AGPL-3.0 to Apache License 2.0 and reflect changes in documentation 2026-03-21 03:04:31 +08:00
Xiaomeng Zhao
09cd7b16cf Merge pull request #25 from myhloli/add_ppdoclayout
Add ppdoclayout
2026-03-21 02:48:39 +08:00