mirror of
https://github.com/ollama/ollama.git
synced 2026-03-27 02:58:43 +07:00
This change fixes a problem in the token cache logic to avoid panics caused by empty token arrays by ensuring at least one token remains on full cache hits in the relevant function. The happens if there is an exact match in the cache on subsequent generations.