Files
ollama/model
Jeffrey Morgan 255579aaa7 qwen3next: fix issue in delta net (#14075)
gDiffExp was being broadcast across the wrong axis when multiplying with k. This fix reshapes gDiffExp to [1, chunkSize, nChunks, ...]
2026-02-04 13:40:38 -08:00
..
2025-11-18 16:11:37 -08:00
2025-03-11 14:35:08 -07:00