ollama/runner/ollamarunner/runner.go at 810d4f9c22319491cd3ac360afed6d2cae6be99a

mirror of https://github.com/ollama/ollama.git synced 2026-03-27 02:58:43 +07:00

Files

easonysliu 810d4f9c22 runner: fix swallowed error in allocModel graph reservation

In allocModel(), the first call to reserveWorstCaseGraph(true) had its
error silently discarded — `return nil` was used instead of `return err`.

This meant that if the prompt-sized graph reservation failed (e.g. due
to insufficient memory), the error was swallowed, allocModel reported
success, and the model appeared to load correctly. Subsequent inference
would then fail in unexpected ways because the worst-case graph was
never properly reserved.

Fix: return the actual error so the caller can handle the failure
(retry with reduced parallelism, report OOM, etc.).

Co-Authored-By: Claude (claude-opus-4-6) <noreply@anthropic.com>

2026-03-16 15:48:45 -07:00

40 KiB

Raw Blame History

View Raw

40 KiB Raw Blame History

40 KiB

Raw Blame History