Commit Graph

  • f622b0c5fc launch: disable claude attribution header to preserve KV cache Jesse Gross 2026-03-17 16:35:57 -07:00
  • 5d0000634c cmd/launch: check for both npm and git before installing OpenClaw (#14888) Bruce MacDonald 2026-03-17 18:20:05 -07:00
  • 676d9845ba launch: register websearch for openclaw (#14914) v0.18.2-rc0 Parth Sareen 2026-03-17 15:03:15 -07:00
  • e37a9b4c01 cloud_proxy: for the web_search legacy path, flush on newlines (#14897) Devon Rifkin 2026-03-17 13:30:17 -07:00
  • d727aacd04 mlx: quantized embeddings, fast SwiGLU, and runtime fixes (#14884) Patrick Devine 2026-03-17 11:21:38 -07:00
  • fa69b833cd mlx: add prequantized tensor packing + changes for qwen35 (#14878) Patrick Devine 2026-03-17 11:21:18 -07:00
  • bbbad97686 sched: Model eviction for MLX Jesse Gross 2026-03-02 15:27:34 -08:00
  • bcf6d55b54 launch: fix web search, add web fetch, and enable both for local (#14886) v0.18.1-rc1 v0.18.1 Parth Sareen 2026-03-16 16:26:19 -07:00
  • 810d4f9c22 runner: fix swallowed error in allocModel graph reservation easonysliu 2026-03-14 10:35:40 +08:00
  • 856c047a6c cmd/launch: skip --install-daemon when systemd is unavailable (#14883) v0.18.1-rc0 Bruce MacDonald 2026-03-16 13:50:04 -07:00
  • 79c1e93c00 bench: improve benchmarking tool (#14240) Daniel Hiltgen 2026-03-15 11:47:31 -07:00
  • f8b657c967 cmd/launch: add guards for headless mode (#14837) Parth Sareen 2026-03-14 00:10:02 -07:00
  • 10fefe0d57 config: use native OpenClaw Ollama onboarding (#14829) Bruce MacDonald 2026-03-13 16:28:40 -07:00
  • 2f9a68f9e9 rocm: doc driver constraints (#14833) Daniel Hiltgen 2026-03-13 15:53:35 -07:00
  • 3980c0217d server: decompress zstd request bodies in cloud passthrough middleware (#14827) v0.18.0-rc2 v0.18.0 Bruce MacDonald 2026-03-13 15:06:47 -07:00
  • 870599f5da launch: remove warning for default policy (#14830) Parth Sareen 2026-03-13 15:01:38 -07:00
  • abf8e8e9c8 middleware: handle non-JSON error responses gracefully (#14828) Bruce MacDonald 2026-03-13 14:50:49 -07:00
  • f3f31a8192 anthropic: close thinking block before tool_use when no text in between (#14825) v0.18.0-rc1 Shivam Tiwari 2026-03-14 01:42:05 +05:30
  • 9e7ba835da cmd: still populate ollama ls when using ollama run <model:cloud> (#14824) v0.18.0-rc0 Devon Rifkin 2026-03-13 12:24:45 -07:00
  • 347f17b8d1 launch: add compact window for claude code (#14823) Parth Sareen 2026-03-13 12:09:23 -07:00
  • 081b9eb423 api/create: always propagate :cloud source for cloud models (#14822) Devon Rifkin 2026-03-13 11:58:00 -07:00
  • bb867c6fdb launch: fix headless --yes integration flow and policy scoping (#14815) Parth Sareen 2026-03-13 11:45:36 -07:00
  • 81f4506a61 docs: document reasoning_effort support in OpenAI-compatible API (#14821) Cadu 2026-03-13 14:57:14 -03:00
  • 76925f1284 cmd: TUI model ordering (#14814) Parth Sareen 2026-03-13 10:19:22 -07:00
  • f676231de9 server: remove experimental aliases support (#14810) v0.17.8-rc4 Devon Rifkin 2026-03-12 20:27:24 -07:00
  • af5f7c0a9e cmd: refactor tui and launch (#14609) Parth Sareen 2026-03-12 18:39:06 -07:00
  • a6b27d776b ci: fix missing windows zip file (#14807) v0.17.8-rc3 Daniel Hiltgen 2026-03-12 16:14:00 -07:00
  • 5db08d47e8 docs: add image generation capability page mintlify/image-generation-1773352582 mintlify[bot] 2026-03-12 21:56:23 +00:00
  • 539741199e mlx: perf improvements (#14768) v0.17.8-rc2 Daniel Hiltgen 2026-03-12 12:01:28 -07:00
  • 010af4e730 clean up hoyyeva/server-context-length-local-config Eva Ho 2026-03-12 14:32:19 -04:00
  • 6287a80587 add tests Eva Ho 2026-03-12 14:31:15 -04:00
  • ebd4d0e498 server: use server's context length to set as part of config for local models Eva Ho 2026-03-12 14:26:35 -04:00
  • 8f45236d09 middleware: enable local tool model for web search (#14787) Eva H 2026-03-11 17:51:39 -04:00
  • 97013a190c openai: split mixed thinking stream chunks via ToChunks (#14648) Parth Sareen 2026-03-11 14:21:29 -07:00
  • c222735c02 mlx: only log load errors when MLX is needed (#14764) Daniel Hiltgen 2026-03-11 10:31:31 -07:00
  • 87d21c7fc0 MLX: harden for init failures (#14777) Daniel Hiltgen 2026-03-10 22:52:23 -07:00
  • 7563f51fdf sample: fix lint issues in penalty sampler jmorganca/faster-reptition-penalties jmorganca 2026-03-10 22:08:39 -07:00
  • 4ad9f16d68 Fix repeat penalty default jmorganca 2026-03-10 21:45:29 -07:00
  • d64d0d235d Add sampler reset on reprocess jmorganca 2026-03-10 21:37:09 -07:00
  • 54e05172a0 Revert "runner: add token history sampling parameters to ollama runner (#14537)" (#14776) Jeffrey Morgan 2026-03-10 21:07:52 -07:00
  • 4fdeb59325 convert: handle layers_block_type config field jmorganca/convert-nemotron jmorganca 2026-03-10 20:51:52 -07:00
  • 464186e995 config: qwen3.5 recommendations (#14758) Parth Sareen 2026-03-10 18:04:57 -07:00
  • 8c4d5d6c2f cloud_proxy: send ollama client version (#14769) Devon Rifkin 2026-03-10 15:53:25 -07:00
  • bc72b14016 docs: update claude code docs (#14770) Parth Sareen 2026-03-10 15:52:41 -07:00
  • 61086083eb server: add experimental web search and web fetch routes (#14753) Parth Sareen 2026-03-09 21:52:12 -07:00
  • 62d1f01ab4 ci: Fix windows build (#14754) v0.17.8-rc1 Daniel Hiltgen 2026-03-09 19:27:59 -07:00
  • 10e51c5177 MLX: add header vendoring and remove go build tag (#14642) v0.17.8-rc0 Daniel Hiltgen 2026-03-09 17:24:45 -07:00
  • 3e06bde643 mlx: get parameters from modelfile during model creation (#14747) Patrick Devine 2026-03-09 15:33:24 -07:00
  • 6be2de8214 app: auto update should be enabled when reset to defaults (#14741) Eva H 2026-03-09 15:02:36 -04:00
  • ebb1b9ec14 rocm: update linux to v7.2 (#14391) Daniel Hiltgen 2026-03-09 08:26:55 -07:00
  • d126467d5d x/mlxrunner: replace sampler interface chain with single stateful Sampler (#14652) Patrick Devine 2026-03-07 17:50:57 -08:00
  • afb4c62fbf cloud_proxy: handle stream disconnects gracefully (#14685) Devon Rifkin 2026-03-06 19:18:52 -08:00
  • e790dc435b mlx: int4 groupsize 64 (#14682) Patrick Devine 2026-03-06 16:39:47 -08:00
  • 288077c3a3 build: smarter docker parallelism (#14653) Daniel Hiltgen 2026-03-06 16:36:22 -08:00
  • 4425c54eda create: fix localhost handling (#14681) Daniel Hiltgen 2026-03-06 16:35:58 -08:00
  • 778899a5d2 docs: format compat docs (#14678) Michael Yang 2026-03-06 14:53:17 -08:00
  • 4eab60c1e2 Reapply "don't require pulling stubs for cloud models" again (#14608) Jeffrey Morgan 2026-03-06 14:27:47 -08:00
  • 1af850e6e3 parsers: repair unclosed arg_value tags in GLM tool calls (#14656) Bruce MacDonald 2026-03-06 14:08:34 -08:00
  • 9b0c7cc7b9 cmd: override stale entries for context window pi (#14655) v0.17.7-rc2 v0.17.7 Parth Sareen 2026-03-05 16:30:24 -08:00
  • 6928630601 mlx: prevent remote creation mismatch (#14651) Daniel Hiltgen 2026-03-05 14:59:00 -08:00
  • 9896e3627f cmd/config: fix cloud model limit lookups in integrations (#14650) v0.17.7-rc1 Parth Sareen 2026-03-05 13:57:28 -08:00
  • 15732f0ea7 cmd: use native Ollama API endpoint for OpenClaw (#14649) Bruce MacDonald 2026-03-05 13:29:17 -08:00
  • 562c76d7cc cmd: add qwen3.5 context length for launch (#14626) v0.17.7-rc0 Parth Sareen 2026-03-04 14:10:52 -08:00
  • 122c68c151 server: loosen thinking level constraint (#14625) Parth Sareen 2026-03-04 13:42:18 -08:00
  • 947200698f cmd/pi: fix happy path install parth-pi-thinking ParthSareen 2026-03-04 03:00:31 -08:00
  • 82848a7806 model: fix renderer and parser for qwen3.5 (#14605) v0.17.6 Jeffrey Morgan 2026-03-03 20:58:29 -08:00
  • 39982a954e Revert "Reapply "don't require pulling stubs for cloud models"" (#14606) Jeffrey Morgan 2026-03-03 20:56:10 -08:00
  • e9f6ea232f Add qwen3.5-next-moe support to MLX runner and models (#14417) Patrick Devine 2026-03-03 16:39:22 -08:00
  • 110eff01a9 chore: remove old imagegen LLMs models (#14597) Patrick Devine 2026-03-03 13:23:40 -08:00
  • 799e51d419 Reapply "don't require pulling stubs for cloud models" Jeffrey Morgan 2026-03-03 13:17:10 -08:00
  • e8fcb29586 model/renderers: fix glm-ocr image tags in renderer prompts (#14584) Victor-Quqi 2026-03-04 04:51:34 +08:00
  • 97d2f05a6d Revert "don't require pulling stubs for cloud models (#14574)" (#14596) Jeffrey Morgan 2026-03-03 12:51:23 -08:00
  • 8207e55ec7 don't require pulling stubs for cloud models (#14574) Devon Rifkin 2026-03-03 10:46:33 -08:00
  • ad16bffc7d mlx: Remove peak memory from the API Jesse Gross 2026-03-02 11:37:52 -08:00
  • c1e3ef4bcc mlxrunner: Refcount pinned tensors Jesse Gross 2026-03-02 12:48:02 -08:00
  • a3093cd5e5 cmd/opencode: rename provider from "Ollama (local)" to "Ollama" (#14566) Parth Sareen 2026-03-02 14:17:18 -08:00
  • 23d4cad1a2 server: verify digest is not empty on create (#14555) Bruce MacDonald 2026-03-02 13:43:35 -08:00
  • 86513cb697 runner: add token history sampling parameters to ollama runner (#14537) v0.17.5 Jeffrey Morgan 2026-03-01 19:16:07 -08:00
  • 3490e9590b model/qwen3next: avoid crash in in DeltaNet when offloading (#14541) Jeffrey Morgan 2026-03-01 18:44:04 -08:00
  • 67ce53b9b5 wip sampling pdevine/sampling-penalties Patrick Devine 2026-02-27 21:43:19 -08:00
  • dd497534c4 allow think/nothink in mlxrunner Patrick Devine 2026-02-26 23:29:04 -08:00
  • 560626fb43 cleanup Patrick Devine 2026-02-26 17:28:38 -08:00
  • 1a23c1a810 add qwen3.5 Patrick Devine 2026-02-25 16:21:42 -08:00
  • a6c1aa4da5 smaller recurrent cache Patrick Devine 2026-02-25 15:50:41 -08:00
  • 330b19b73f server: chunk quantization writes to reduce create memory usage jmorganca/fix-create-quantization-memory jmorganca 2026-02-28 23:21:37 -08:00
  • 8da09b1e7e qwen3next: add compatibility with imported GGUF models (#14517) Jeffrey Morgan 2026-02-28 14:21:42 -08:00
  • a60b9adcce mlxrunner: Fix prompt eval timing and count metrics Jesse Gross 2026-02-26 14:45:14 -08:00
  • a16f96658b mlxrunner: Enforce model context limit Jesse Gross 2026-02-25 15:07:09 -08:00
  • 18ab09b431 mlxrunner: Propagate pipeline errors to client via api.StatusError Jesse Gross 2026-02-26 12:23:06 -08:00
  • 638faeac54 mlxrunner: Report actual memory usage from runner Jesse Gross 2026-02-25 15:06:37 -08:00
  • 241ac6e003 check with HEAD before upload dongchen/resumable_transfer_fix Dong Chen 2026-02-27 14:29:17 -08:00
  • 17dc6918cf handle edge case when a blob has been uploaded earlier Dong Chen 2026-02-11 09:34:00 -08:00
  • dd5eb6337d mlxrunner: Fix panic on full KV cache hit Jesse Gross 2026-02-25 17:03:31 -08:00
  • 79917cf80b show peak memory usage (#14485) Patrick Devine 2026-02-26 18:38:27 -08:00
  • 857cffd22a bugfix: fix crash bug in token cache logic pdevine/sampling-cache-error Patrick Devine 2026-02-26 18:29:40 -08:00
  • cc90a035a0 model/parsers: add stable tool call indexing for glm47 and qwen3 parsers (#14484) v0.17.4 Parth Sareen 2026-02-26 18:14:29 -08:00
  • d98dda4676 model: fix qwen3 tool calling in thinking (#14477) v0.17.3 Jeffrey Morgan 2026-02-26 16:13:18 -08:00
  • d69ddc1edc fix: window app crash on startup when update is pending (#14451) v0.17.2 Eva H 2026-02-26 16:47:12 -05:00
  • 9bf41969f0 app: fix first update check delayed by 1 hour (#14427) v0.17.1 Eva H 2026-02-25 18:29:55 -05:00
  • 4d5ff25724 mlxrunner: Report actual memory usage from runner jessegross/mlx-usage Jesse Gross 2026-02-25 15:06:37 -08:00