mirror of
https://github.com/ollama/ollama.git
synced 2026-03-27 02:58:43 +07:00
When a zstd-compressed request (e.g. from Codex CLI) hits /v1/responses with a cloud model the request failed. Fix by decompressing zstd bodies before model extraction, so cloud models are detected and proxied directly without the writer being wrapped.