Commit Graph

  • 7b62c41060 cmd/config: use envconfig.Host() for base API in launch config packages (#13937) Gabe Goodhart 2026-01-27 14:30:00 -07:00
  • 26acab64b7 docs: add clawdbot (#13925) Parth Sareen 2026-01-26 21:32:54 -05:00
  • e0f03790b1 parsers/ministral: fix nested tool call parsing by counting brace nesting (#13905) Gyungrai Wang 2026-01-27 08:03:43 +09:00
  • 3ab842b0f5 cmd: clawdbot config fixes (#13922) v0.15.2 Parth Sareen 2026-01-26 17:34:29 -05:00
  • b8e8ef8929 cmd: ollama launch clawdbot (#13921) Parth Sareen 2026-01-26 16:40:59 -05:00
  • 465d124183 cmd: fix opencode config (#13894) v0.15.1 Parth Sareen 2026-01-24 21:42:56 -05:00
  • d310e56fa3 cmd: add fallback for claude (#13892) Parth Sareen 2026-01-24 21:26:01 -05:00
  • a1ca428c90 glm4moelite: fix attention scale calculation (#13893) Jeffrey Morgan 2026-01-24 17:48:09 -08:00
  • 16750865d1 glm4moelite: quantize more tensors to q8_0 and avoid double BOS token (#13891) v0.15.1-rc1 Jeffrey Morgan 2026-01-24 16:33:54 -08:00
  • f3b476c592 build: add -O3 optimization to CGO flags (#13877) v0.15.1-rc0 Jeffrey Morgan 2026-01-24 10:55:38 -08:00
  • 5267d31d56 docs: ollama launch (#13852) Parth Sareen 2026-01-24 02:18:50 -05:00
  • b44f56319f README: Update the "Ollama for ruby" to the most popular and maintained ruby gem. (#13855) Stillhart 2026-01-24 07:24:52 +01:00
  • 0209c268bb llama: fix CUDA MMA errors in release build (#13874) v0.15.0-rc6 v0.15.0 Jeffrey Morgan 2026-01-23 20:10:04 -08:00
  • 8e22b09e2c ggml-cuda: fix fattn build for GLM 4.7 flash support fix-cuda12-fattn-shmem Jeffrey Morgan 2026-01-24 03:12:49 +00:00
  • 912d984346 llama: fix fattn-tile shared memory overflow on sm_50/52 (#13872) v0.15.0-rc5 Jeffrey Morgan 2026-01-23 19:22:32 -08:00
  • aae6ecbaff cmd: rename ollama config to ollama launch (#13871) v0.15.0-rc4 Parth Sareen 2026-01-23 21:40:40 -05:00
  • 64737330a4 Re-apply "model: add MLA absorption for glm4moelite" with fix (#13870) Jeffrey Morgan 2026-01-23 18:40:28 -08:00
  • 2eda97f1c3 Revert "model: add MLA absorption for glm4moelite (#13810)" (#13869) v0.15.0-rc3 Jeffrey Morgan 2026-01-23 17:14:15 -08:00
  • 66831dcf70 x/imagegen: fix image editing support (#13866) v0.15.0-rc2 Jeffrey Morgan 2026-01-23 15:37:17 -08:00
  • 1044b0419a model: add MLA absorption for glm4moelite (#13810) Jeffrey Morgan 2026-01-23 14:47:42 -08:00
  • 771d9280ec cmd: ollama config fix droid model name configuration (#13856) Parth Sareen 2026-01-23 14:44:22 -05:00
  • 862bc0a3bf x/imagegen: respect stream=false in /api/generate (#13853) Jeffrey Morgan 2026-01-22 22:16:39 -08:00
  • c01608b6a1 x/imagegen: add image edit capabilities (#13846) v0.15.0-rc1 Jeffrey Morgan 2026-01-22 20:35:08 -08:00
  • 199c41e16e cmd: ollama config command to help configure integrations to use Ollama (#13712) Parth Sareen 2026-01-22 23:17:11 -05:00
  • 3b3bf6c217 x/imagegen: replace memory estimation with actual weight size (#13848) Jeffrey Morgan 2026-01-22 18:32:41 -08:00
  • f52c21f457 fix: handle Enter key pressed during model loading (#13839) Parth Sareen 2026-01-22 21:32:02 -05:00
  • 8b4410633d Add image generation documentation ollama-imagegen-docs jmorganca 2026-01-22 13:51:56 -08:00
  • c73feaf73d Clean up the manifest and modelpath (#13807) parth/fix-multiline-inputs Patrick Devine 2026-01-21 11:46:17 -08:00
  • cc3ac5fee3 docs: update instructions for ollama config command brucemacd/config-docs Bruce MacDonald 2026-01-21 17:03:41 -08:00
  • b5d0f72f16 x/imagegen: remove qwen_image and qwen_image_edit models (#13827) v0.15.0-rc0 Jeffrey Morgan 2026-01-21 13:37:08 -08:00
  • 148a1be0a3 Clean up the manifest and modelpath (#13807) Patrick Devine 2026-01-21 11:46:17 -08:00
  • e4d0a8f016 draft: model manifest file interface mxyng/model-files Michael Yang 2026-01-19 13:34:24 -08:00
  • 268c2a1df1 fix: remove multiline option in non-experimental mode sunyongyue 2026-01-22 02:02:54 +08:00
  • 110b072200 chore: simplify runner.Execute mxyng/simple-execute Michael Yang 2026-01-19 14:00:05 -08:00
  • d6dd430abd x/imagegen: respect OLLAMA_MODELS for manifests and blobs (#13797) v0.14.3 next-n 2026-01-21 03:31:52 +06:30
  • ae78112c50 test: add lfm2.5-thinking coverage (#13802) Daniel Hiltgen 2026-01-20 12:57:02 -08:00
  • 91a3b77553 x/imagegen: respect OLLAMA_MODELS environment variable fix-imagegen-ollama-models jmorganca 2026-01-20 12:46:20 -08:00
  • 01cf7445f3 model: add lfm2 architecture and LFM2.5-1.2B-Thinking support (#13792) v0.14.3-rc3 Jeffrey Morgan 2026-01-20 12:20:53 -08:00
  • 9ef2106b47 cmd: create blob in parallel with checksum mxyng/async-upload Michael Yang 2026-01-20 09:53:56 -08:00
  • 31085d5e53 fix: use api.GenerateRequest for image generation test (#13793) Jeffrey Morgan 2026-01-20 03:23:31 -08:00
  • 20776ced38 x/imagegen: skip eval when no dtype conversion needed jmorganca/lazy-no-dtype-changes jmorganca 2026-01-19 16:25:37 -08:00
  • 5dc20e91d0 simplify imagegen-auto-detect-create jmorganca 2026-01-19 16:15:15 -08:00
  • c42e9d244f test: add image gen test case (#13698) Daniel Hiltgen 2026-01-19 16:01:31 -08:00
  • e98b5e8b4e /api/show: default to empty model_info (#13785) Devon Rifkin 2026-01-19 15:26:17 -08:00
  • bda8cb7403 cmd: auto-detect image generation models during create jmorganca 2026-01-19 14:43:14 -08:00
  • 68e00c7c36 fix: prevent image generation models from loading during deletion (#13781) v0.14.3-rc2 Jeffrey Morgan 2026-01-19 12:48:34 -08:00
  • 4f138a1749 model: add Glm4MoeLiteForCausalLM architecture to support GLM-4.7-Flash (#13779) Jeffrey Morgan 2026-01-19 12:47:17 -08:00
  • 03bf241c33 x/imagegen: add FP4 quantization support for image generation models (#13773) Jeffrey Morgan 2026-01-19 00:54:54 -08:00
  • a887406c24 x/imagegen: add preliminary support for FLUX.2-klein model (#13772) Jeffrey Morgan 2026-01-18 22:30:49 -08:00
  • d51e95ba7e server: prevent image generation models from reloading on every request (#13771) Jeffrey Morgan 2026-01-18 20:50:04 -08:00
  • 6b2abfb433 server: add tests and fix isHuggingFaceURL edge case parth/decrease-concurrent-download-hf Parth Sareen 2026-01-18 16:45:17 -08:00
  • 805ed4644c server: reduce download concurrency for HuggingFace URLs Parth Sareen 2026-01-18 16:38:49 -08:00
  • 582d93ab22 fix: lazy init MLX for quantization and improve library discovery fix-mlx-quantize-init jmorganca 2026-01-17 22:25:31 -08:00
  • 3d01f2aa34 parsers: refactor Nemotron parser to reuse Qwen3Coder for tool calls (#13764) Jeffrey Morgan 2026-01-17 18:28:52 -08:00
  • 634c416645 Add experimental image generation fields to /api/generate (#13753) Jeffrey Morgan 2026-01-17 18:27:41 -08:00
  • b220bfa0b9 x: clean up some experimental mlx backend code jmorganca/x-cleanup jmorganca 2026-01-17 01:27:12 -08:00
  • 57de86cc61 docs: update claude code docs (#13757) Michael 2026-01-16 22:41:34 -08:00
  • 12719b6e87 MLX - dynamic loading of mlx-c (#13735) v0.14.3-rc1 Daniel Hiltgen 2026-01-16 16:34:22 -08:00
  • a077d996e3 Fix create and show commands for experimental models (#13741) Patrick Devine 2026-01-16 14:31:55 -08:00
  • c23d5095de x/imagegen: clean up image generation code (#13725) Jeffrey Morgan 2026-01-16 12:19:25 -08:00
  • 7601f0e93e server: reject unexpected auth hosts (#13738) Bruce MacDonald 2026-01-16 14:10:36 -05:00
  • aad3f03890 app: allow macOS app to terminate during system shutdown (#13737) v0.14.3-rc0 Eva H 2026-01-16 09:05:04 -05:00
  • d132315276 uip api: expose usage data usage jmorganca 2026-01-08 22:28:33 -08:00
  • 55d0b6e8b9 integration: fix tools_test.go for ToolCallFunctionArguments API change (#13731) v0.14.2 Gyungrai Wang 2026-01-16 09:08:09 +09:00
  • 38eac40d56 openai: tweak v1/responses to conform better (#13736) v0.14.2-rc1 Devon Rifkin 2026-01-15 15:46:36 -08:00
  • 80f3f1bc25 readme: add instructions to build with MLX (#13733) v0.14.2-rc0 Jeffrey Morgan 2026-01-15 11:03:52 -08:00
  • b1a0db547b docs: add env var needed for claude code in docs (#13721) Parth Sareen 2026-01-15 10:11:00 -08:00
  • bb1a5617b6 readme: add instructions to build with MLX imagegen-readme jmorganca 2026-01-15 09:52:56 -08:00
  • 75d7b5f926 cmd: enable multi-line input and shift enter (#13694) Parth Sareen 2026-01-14 17:52:46 -08:00
  • 349d814814 docs: add marimo integration (#13326) vincent d warmerdam 2026-01-15 02:37:38 +01:00
  • c8743031e0 docs: add onyx integration (#13135) Yuhong Sun 2026-01-14 17:32:05 -08:00
  • 0d3648c1be glm-image wip jmorganca/glm-image jmorganca 2026-01-14 16:46:50 -08:00
  • 4adb9cf4bb scripts: fix macOS auto-update signature verification failure (#13713) v0.14.1 Jeffrey Morgan 2026-01-14 07:48:10 -08:00
  • 74f475e735 Revert "Documentation edits made through Mintlify web editor" (#13688) Daniel Hiltgen 2026-01-14 07:42:34 -08:00
  • 875cecba74 docs: update default context window size to 4096 tokens (#13709) Maternion 2026-01-14 14:31:28 +05:30
  • 7d411a4686 docs: update web search param in examples (#13711) Josh Daniel Bañares 2026-01-14 16:38:39 +08:00
  • 02a2401596 mlx: bundle openblas dependency (#13706) v0.14.0-rc11 v0.14.0 Daniel Hiltgen 2026-01-13 15:29:47 -08:00
  • e4b488a7b5 CI: dedup cuda libraries to reduce payload size (#13704) v0.14.0-rc10 Daniel Hiltgen 2026-01-13 11:25:31 -08:00
  • 98079ddd79 ci: add missing mlx components to release build (#13702) v0.14.0-rc9 Daniel Hiltgen 2026-01-13 09:13:09 -08:00
  • d70942f47b x/imagegen/cli: skip local model check (#13699) v0.14.0-rc8 Jeffrey Morgan 2026-01-12 22:38:10 -08:00
  • 58e4701557 scripts: increase notarization timeout to 20m (#13697) v0.14.0-rc7 Jeffrey Morgan 2026-01-12 20:38:38 -08:00
  • dbf47ee55a cmake: use CMAKE_SYSTEM_PROCESSOR instead of CMAKE_OSX_ARCHITECTURES for mlx.metallib install (#13696) v0.14.0-rc6 Jeffrey Morgan 2026-01-12 20:05:11 -08:00
  • af7ea6e96e x/imagegen: install mlx.metallib and fix macOS rpath handling, add mlx library directories to LD_LIBRARY_PATH (#13695) v0.14.0-rc5 Jeffrey Morgan 2026-01-12 19:03:11 -08:00
  • 8f1e0140e7 x/imagegen: fix mlx build in Dockerfile and macOS build script (#13693) v0.14.0-rc4 Jeffrey Morgan 2026-01-12 15:52:43 -08:00
  • 35c3c9e3c2 anthropic: allow non-thinking models when using Anthropic API (#13692) Parth Sareen 2026-01-12 15:13:26 -08:00
  • d06acbcb19 x/cmd: enable web search and web fetch with flag (#13690) v0.14.0-rc3 Parth Sareen 2026-01-12 13:59:40 -08:00
  • 9667c2282f x/imagegen: add naive TeaCache and FP8 quantization support (#13683) Jeffrey Morgan 2026-01-12 13:45:22 -08:00
  • a937a68317 server: fix slow 'ollama rm' of models with many layers (#13680) Jeffrey Morgan 2026-01-12 13:17:48 -08:00
  • 2185112d84 x/cmd: connect /set flags to behavior in experimental mode (#13684) Parth Sareen 2026-01-12 00:40:44 -08:00
  • 91926601dc x: add missing /set, /show, /load, /save commands to experimental mode (#13682) Parth Sareen 2026-01-11 23:12:31 -08:00
  • 361d6c16c2 x/imagegen/transfer: fix timeout and progress reporting (#13679) Jeffrey Morgan 2026-01-11 15:33:53 -08:00
  • 7e2496e88e Fix cmake install command in README (#13678) Patrick Devine 2026-01-11 13:16:42 -08:00
  • 5b84e29882 docs: fix troubleshooting page (#13674) WhatToPutHere 2026-01-11 21:58:07 +13:00
  • e23ddd84b8 x/grammar: add experimental GPU accelerated constrained decoding package mlx-gpu-cd jmorganca 2026-01-10 16:42:45 -08:00
  • 87cb080a91 support other modelfile commands for image generation models jmorganca/imagegen-modelfile jmorganca 2026-01-10 12:39:44 -08:00
  • 7cc2a653f2 dockerfile: remove unused COPY command (#13664) v0.14.0-rc2 Jeffrey Morgan 2026-01-09 23:07:15 -08:00
  • 2584940016 Add z-image image generation prototype (#13659) v0.14.0-rc1 Jeffrey Morgan 2026-01-09 21:09:46 -08:00
  • c6d4c0c7f2 Documentation edits made through Mintlify web editor Michael 2026-01-09 21:29:03 -05:00
  • 1ef4241727 x: request access for all commands, add welcome message (#13662) Parth Sareen 2026-01-09 18:20:39 -08:00
  • 68fafd3002 x: improve approval selector with clearer labels (#13663) Parth Sareen 2026-01-09 17:08:12 -08:00