mirror of
https://github.com/ollama/ollama.git
synced 2026-03-27 02:58:43 +07:00
Claude Code sends an x-anthropic-billing-header that changes on every request. This is embedded in the system prompt and consequently breaks the KV cache for every request. Given the size of the prompts that Claude Code usees, this has significant performance impact.