Commit Graph

  • 57653b8e42 cmd/launch: show WSL guidance on Windows instead of handing off (#15637) main v0.21.0-rc1 v0.21.0 Parth Sareen 2026-04-16 17:18:04 -07:00
  • a50ce61c54 launch: skip unchanged managed-single rewrite (#15633) Parth Sareen 2026-04-16 16:20:42 -07:00
  • 2bb7ea00d2 create: avoid gc race with create (#15628) Daniel Hiltgen 2026-04-16 13:29:16 -07:00
  • 55fa80d07a mlx: additional gemma4 cache fixes (#15607) Daniel Hiltgen 2026-04-16 13:07:19 -07:00
  • b9cb535407 mlx: fix gemma4 cache to use logical view (#15617) v0.21.0-rc0 Daniel Hiltgen 2026-04-16 11:54:30 -07:00
  • 031baef094 mlx: fix imagegen lookup (#15588) Daniel Hiltgen 2026-04-16 10:39:00 -07:00
  • 7d271e6dc9 cmd/launch: add Copilot CLI integration (#15583) Mike Wallio 2026-04-15 20:22:53 -04:00
  • c88dae2d6b Merge pull request #15612 from ollama/drifkin/gemma4-split-templates Devon Rifkin 2026-04-15 17:15:35 -07:00
  • a67e30cf4e Update docs launch-copilot-cli ParthSareen 2026-04-15 15:37:58 -07:00
  • 283b393ed9 docs(readme): add Copilot CLI launch integration Mike Wallio 2026-04-14 10:28:08 -04:00
  • 1b3a200c25 docs(integrations): add Copilot CLI guide Mike Wallio 2026-04-14 10:16:37 -04:00
  • f4438d8215 feat(launch): add Copilot CLI integration Mike Wallio 2026-04-14 10:16:37 -04:00
  • 9e3618d663 make empty block conditional Devon Rifkin 2026-04-15 15:35:25 -07:00
  • 5d920cc6bc Keep Gemma4 router projection in source precision (#15613) Daniel Hiltgen 2026-04-15 15:04:23 -07:00
  • e585ecd11f gemma4: render differently based on model size Devon Rifkin 2026-04-15 14:37:16 -07:00
  • cdddea0592 launch: always list cloud recommendations first (#15593) Eva H 2026-04-15 13:17:35 -07:00
  • 43f90def04 launch: add hermes (#15569) Parth Sareen 2026-04-15 12:00:23 -07:00
  • 06ae6367bd mlx: fix RotatingKVCache.concat() dropping context on mid-rotation (#15591) Daniel Hiltgen 2026-04-14 18:29:06 -07:00
  • 48ad7085c4 mlx: Improve gemma4 performance with fused operations (#15587) Daniel Hiltgen 2026-04-14 18:04:04 -07:00
  • e1e3cec8d0 models: fuse MLP activation functions via mlx_compile Jesse Gross 2026-04-13 12:20:39 -07:00
  • d3e67e305c mlx: add compiled closure support Jesse Gross 2026-04-13 12:20:33 -07:00
  • 7a3ed0a1b4 adding test hoyyeva/opencode-thinking Eva Ho 2026-04-13 15:37:04 -07:00
  • 03f9e57274 add test Eva Ho 2026-04-08 15:09:17 -07:00
  • 30d9100fff launch: add thinking capability detection to opencode Eva Ho 2026-04-08 14:42:57 -07:00
  • 698e04a14b launch: OpenCode inline config (#15586) Eva H 2026-04-14 15:08:42 -07:00
  • 1d9537bc33 launch/openclaw: fix --yes flag behaviour to skip channels configuration (#15589) Eva H 2026-04-14 13:57:35 -07:00
  • 27d7bd37a7 models: fuse MLP activation functions via mlx_compile origin/jessegross/closures Jesse Gross 2026-04-13 12:20:39 -07:00
  • 3f8e0af045 mlx: add compiled closure support Jesse Gross 2026-04-13 12:20:33 -07:00
  • 8ddbd9bf60 launch: fetch recommended models from server endpoint brucemacd/launch-fetch-reccomended Bruce MacDonald 2026-04-13 20:27:18 -07:00
  • 120424d832 Revert "launch/opencode: use inline config (#15462)" (#15568) Eva H 2026-04-13 18:40:17 -07:00
  • 5818001610 launch: skip unchanged integration rewrite configration (#15491) Eva H 2026-04-13 17:18:56 -07:00
  • 2cba7756c5 Gemma4 on MLX (#15244) v0.20.8-rc0 Daniel Hiltgen 2026-04-13 16:36:51 -07:00
  • 8d0dcf4b6d Merge pull request #15561 from ollama/drifkin/backport v0.20.7-rc1 v0.20.7 release_v0.20.7 Devon Rifkin 2026-04-13 15:17:26 -07:00
  • 2b456af804 gemma4: restore e2b-style nothink prompt (#15560) Devon Rifkin 2026-04-13 14:26:15 -07:00
  • 90e2ac0c41 Revert "gemma4: fix nothink case renderer (#15553)" (#15556) Devon Rifkin 2026-04-13 13:12:18 -07:00
  • c2762b5a0d Revert "gemma4: add nothink renderer tests (#15554)" (#15555) Devon Rifkin 2026-04-13 13:00:59 -07:00
  • bf2a421727 gemma4: restore e2b-style nothink prompt (#15560) Devon Rifkin 2026-04-13 14:26:15 -07:00
  • f3cf6b75fb launch/opencode: use inline config (#15462) Eva H 2026-04-13 13:41:31 -07:00
  • 5dfac387a6 Revert "gemma4: fix nothink case renderer (#15553)" (#15556) Devon Rifkin 2026-04-13 13:12:18 -07:00
  • a99e5d9c22 mac: prevent generate on cross-compiles (#15120) Daniel Hiltgen 2026-04-13 13:04:58 -07:00
  • 0abf3aca36 cgo: suppress deprecated warning to quiet down go build (#15438) Daniel Hiltgen 2026-04-13 13:04:11 -07:00
  • ee0266462a Revert "gemma4: add nothink renderer tests (#15554)" (#15555) Devon Rifkin 2026-04-13 13:00:59 -07:00
  • c88fb286ec mlx: add op wrappers for Conv2d, Pad, activations, trig, and masked SDPA (#14913) Daniel Hiltgen 2026-04-13 11:43:24 -07:00
  • d3da29cbfc mlx: mixed-precision quant and capability detection improvements (#15409) Daniel Hiltgen 2026-04-13 11:43:07 -07:00
  • 1b70bb8a10 gemma4: add nothink renderer tests (#15554) v0.20.7-rc0 Devon Rifkin 2026-04-13 11:38:19 -07:00
  • ec29ce4ce3 gemma4: fix compiler error on metal (#15550) Daniel Hiltgen 2026-04-13 11:32:00 -07:00
  • 4d75f5da03 gemma4: fix nothink case renderer (#15553) Devon Rifkin 2026-04-13 11:23:19 -07:00
  • 798fd09bfe Update to ROCm 7.2.1 (#15483) saman-amd 2026-04-12 15:11:58 -04:00
  • 9330bb9120 gemma4: be less strict about whitespace before bare keys (#15494) v0.20.6-rc1 v0.20.6 Devon Rifkin 2026-04-11 16:30:27 -07:00
  • 40a1317dfd gemma4: update renderer to match new jinja template (#15490) v0.20.6-rc0 Devon Rifkin 2026-04-10 15:45:27 -07:00
  • fdfe9cec98 model/parsers: fix missing parallel tool call indices (#15467) Devon Rifkin 2026-04-10 15:23:21 -07:00
  • 9517864603 app/ui: re-validate image attachments when selected model changes (#15272) Matteo Celani 2026-04-10 23:03:51 +02:00
  • 8e6d86dbe3 docs: add hermes agent integration guide (#15488) Bruce MacDonald 2026-04-10 13:13:36 -07:00
  • 7c9213aac4 test: align launch expectations after rebase parth-auto-save-backup ParthSareen 2026-04-08 18:00:08 -07:00
  • cdd0bc48a3 launch: remove banner and warn only when backup-relevant configs change (#15124) Jeffrey Morgan 2026-03-29 00:27:24 -07:00
  • 04e41ddcfb launch: emit backup notice after config write ParthSareen 2026-03-28 15:04:34 -07:00
  • d94d683c32 launch: backup configs for integrations automatically ParthSareen 2026-03-28 11:06:49 -07:00
  • 80d3744c5d launch: update openclaw channel message (#15463) v0.20.5-rc2 v0.20.5 Parth Sareen 2026-04-09 15:20:30 -07:00
  • 2a94f03823 launch: add re-run hint to dependency error message (#15439) v0.20.5-rc1 Eva H 2026-04-09 09:51:34 -07:00
  • eb97274e5c modelfiles: fix /save command and add shortname for safetensors based models (#15413) Patrick Devine 2026-04-08 21:05:39 -07:00
  • 6b5db12aa2 mlx: remove stale x86 libmlx library (#15443) Daniel Hiltgen 2026-04-08 20:51:47 -07:00
  • f69453457d hi parth-test ParthSareen 2026-04-08 18:25:11 -07:00
  • 612f0a17d3 fix: improve error message for unknown input item type in responses API (#15424) v0.20.5-rc0 7. Sun 2026-04-09 01:41:12 +01:00
  • 673726fa0e app: restore launch default and refine launch sidebar open for app (#15437) Parth Sareen 2026-04-08 16:59:21 -07:00
  • b5918f9785 pull/push: refine safetensors (#14946) Daniel Hiltgen 2026-04-08 14:15:39 -07:00
  • d17f482d50 launch/opencode: detect curl installed opencode at ~/.opencode/bin (#15197) Eva H 2026-04-08 13:54:51 -07:00
  • 4e16f562c0 launch: add openclaw channels setup (#15407) Parth Sareen 2026-04-08 13:25:27 -07:00
  • 55308f1421 launch: update ctx length for glm-5.1 and gemma4 (#15411) Parth Sareen 2026-04-08 12:11:50 -07:00
  • d64812eb5d cmd: improve multi-select sorting and selection status (#15200) Eva H 2026-04-08 10:39:18 -07:00
  • f86a969f27 responses: add support for fn call output arrays (#15406) v0.20.4 Devon Rifkin 2026-04-07 16:47:30 -07:00
  • 9fa80a1660 app/ui: fix lint errors for unused vars, prefer-const, and empty catch (#15282) Matteo Celani 2026-04-08 01:28:36 +02:00
  • dde09129d1 gemma4: Disable FA on older GPUs where it doesn't work (#15403) v0.20.4-rc2 Daniel Hiltgen 2026-04-07 14:54:25 -07:00
  • 780556c4d0 mlx: use default http client (#15405) Patrick Devine 2026-04-07 14:53:23 -07:00
  • dfae363b5b gemma4: add missing file (#15394) v0.20.4-rc1 Daniel Hiltgen 2026-04-07 09:18:01 -07:00
  • 30fdd229a4 create: Clean up experimental paths, fix create from existing safetensor model (#14679) v0.20.4-rc0 Daniel Hiltgen 2026-04-07 08:12:57 -07:00
  • e823bff873 gemma4: enable flash attention (#15378) Daniel Hiltgen 2026-04-07 08:12:36 -07:00
  • 8968740836 mlx: Improve M5 performance with NAX (#15345) Daniel Hiltgen 2026-04-07 08:12:24 -07:00
  • 8c8f8f3450 model/parsers: add gemma4 tool call repair (#15374) v0.20.3-rc0 v0.20.3 Devon Rifkin 2026-04-06 18:47:17 -07:00
  • 82f0139587 launch/openclaw: patch approvedScopes baseline for TUI pairing (#15375) Parth Sareen 2026-04-06 18:00:12 -07:00
  • 26a58b294c app: update featured models (#15373) Bruce MacDonald 2026-04-06 16:35:35 -07:00
  • 34a790a2e6 model/parsers: suppress extra gemma4 closing tool tags (#15370) Devon Rifkin 2026-04-06 12:41:33 -07:00
  • 4589fa2cf5 app: default app home view to new chat instead of launch (#15312) v0.20.2 Jeffrey Morgan 2026-04-03 21:50:55 -07:00
  • 2beb5445a4 mlxrunner: replace TextGenerationPipeline with scheduler jessegross/batching Jesse Gross 2026-04-03 10:57:55 -07:00
  • 98615b86a3 mlxrunner: per-sequence trie paths and seqID-parameterized cache operations Jesse Gross 2026-04-03 10:50:53 -07:00
  • d8067801c3 mlxrunner: multi-sequence KVCache, RotatingKVCache, and RecurrentCache Jesse Gross 2026-04-02 15:11:32 -07:00
  • 02fe50c90c mlxrunner: SDPA, GatedDelta, and RecurrentConv1d with KVHistory Jesse Gross 2026-04-02 12:09:50 -07:00
  • 1ea8e70d94 mlxrunner: positions tensor and RoPEWithBase Jesse Gross 2026-04-02 12:08:20 -07:00
  • b7b2aa5d4e mlxrunner: Cache.Update takes ForwardBatch and returns KVHistory Jesse Gross 2026-04-02 12:05:35 -07:00
  • 987f74c8a5 mlxrunner: introduce ForwardBatch for model forward pass Jesse Gross 2026-04-01 16:40:32 -07:00
  • 30915b6b44 mlxrunner: tokenize prompts in request handler goroutines Jesse Gross 2026-03-31 14:15:09 -07:00
  • d137b850b6 mlx: make array management thread-safe Jesse Gross 2026-03-31 14:15:04 -07:00
  • 4bc2728047 Revert "enable flash attention for gemma4 (#15296)" (#15311) v0.20.1 Daniel Hiltgen 2026-04-03 17:44:44 -07:00
  • f474a632ab mlxrunner: tokenize prompts in request handler goroutines jessegross/tokenize Jesse Gross 2026-04-03 16:25:33 -07:00
  • 82b0205061 mlx: make array management thread-safe Jesse Gross 2026-04-03 16:25:28 -07:00
  • 49d5fd5a3e model/parsers: rework gemma4 tool call handling (#15306) v0.20.1-rc2 Devon Rifkin 2026-04-03 14:35:00 -07:00
  • 3cd2b03a5e ggml: fix ROCm build for cublasGemmBatchedEx reserve wrapper v0.20.1-rc1 Jesse Gross 2026-04-03 13:26:50 -07:00
  • c8e0878814 enable flash attention for gemma4 (#15296) v0.20.1-rc0 Daniel Hiltgen 2026-04-03 12:46:18 -07:00
  • bb0c58e134 ggml: skip cublasGemmBatchedEx during graph reservation Jesse Gross 2026-04-03 11:26:03 -07:00
  • 036ed1b9b5 model/parsers: fix gemma4 arg parsing when quoted strings contain " (#15254) Devon Rifkin 2026-04-02 22:52:51 -07:00
  • 3536ef58f6 bench: add prompt calibration, context size flag, and NumCtx reporting (#15158) Daniel Hiltgen 2026-04-02 14:23:53 -07:00