myhloli
|
bcb30fe79c
|
fix: simplify VRAM size retrieval and improve error handling in memory management
|
2025-12-01 18:31:07 +08:00 |
|
myhloli
|
44fdeb663f
|
Refactor async function and improve output directory handling in prediction
|
2025-10-13 11:32:28 +08:00 |
|
myhloli
|
3ec6479462
|
fix: update backend comment to reflect renaming from sglang-engine to vlm-vllm-engine
|
2025-09-15 02:00:58 +08:00 |
|
zhanluxianshen
|
1671e68367
|
fix error logs for multi_gpu endpoint.
Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>
|
2025-08-26 10:26:10 +08:00 |
|
Xiaomeng Zhao
|
d3f6736e0a
|
Update _config_endpoint.py
|
2025-07-05 04:33:49 +08:00 |
|
Xiaomeng Zhao
|
07b4cbc0ec
|
Update projects/multi_gpu_v2/_config_endpoint.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
|
2025-07-05 04:32:31 +08:00 |
|
Xiaomeng Zhao
|
c08a86d6c7
|
Update projects/multi_gpu_v2/server.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
|
2025-07-05 04:29:36 +08:00 |
|
Xiaomeng Zhao
|
ea9336c0c1
|
Update server.py
|
2025-07-05 04:14:58 +08:00 |
|
ca1yz
|
3f32f2a587
|
Update README.md
|
2025-06-18 19:08:39 +08:00 |
|
ca1yz
|
dbfd392f05
|
add updated example project based on 2.0
|
2025-06-18 19:07:53 +08:00 |
|