ollama/x/mlxrunner/mlx at 4eab60c1e2e25aed9df102de40e9c6f0b1200bdb - ollama - Git Idra Informatica

starred/ollama

mirror of https://github.com/ollama/ollama.git synced 2026-04-23 09:15:44 +02:00

Files

History

Patrick Devine e9f6ea232f Add qwen3.5-next-moe support to MLX runner and models (#14417 )

This change adds support for qwen3.5-next-moe models (qwen3-next/qwen3.5-next/qwen3-coder) to the MLX runner. It also:

* introduces recurrent cache support and related MLX ops
* updates pipeline/runner integration and adds tests
* properly quantizes stacked expert tensors
* a Gated Delta Metal kernel for fast SSM inference
* adds new MLX calls for Conv1d, DepthwideConv1d, Contiguous, Exp, Log, SoftmaxAxis

2026-03-03 16:39:22 -08:00

..

update mlx-c bindings to 0.5.0 (#14380 )

2026-02-23 16:44:29 -08:00

.gitignore

Add MLX runner with GLM4-MoE-Lite model support (#14185 )

2026-02-10 14:57:57 -08:00

act.go

Add MLX runner with GLM4-MoE-Lite model support (#14185 )

2026-02-10 14:57:57 -08:00

array_test.go

Add MLX runner with GLM4-MoE-Lite model support (#14185 )

2026-02-10 14:57:57 -08:00

array.go

mlxrunner: Refcount pinned tensors

2026-03-02 15:56:06 -08:00

CMakeLists.txt

update mlx-c bindings to 0.5.0 (#14380 )

2026-02-23 16:44:29 -08:00

dtype.go

Add MLX runner with GLM4-MoE-Lite model support (#14185 )

2026-02-10 14:57:57 -08:00

dynamic.c

mlx: remove noisy error output from dynamic library loading (#14346 )

2026-02-20 23:46:07 -08:00

dynamic.go

mlx: try loading library via rpath before searching directories (#14322 )

2026-02-19 10:55:02 -08:00

dynamic.h

Add MLX runner with GLM4-MoE-Lite model support (#14185 )

2026-02-10 14:57:57 -08:00

fast.go

mlxrunner: Fix memory leaks with pin/sweep lifecycle management

2026-02-23 09:50:07 -08:00

gated_delta.go

Add qwen3.5-next-moe support to MLX runner and models (#14417 )

2026-03-03 16:39:22 -08:00

generated.c

update mlx-c bindings to 0.5.0 (#14380 )

2026-02-23 16:44:29 -08:00

generated.h

update mlx-c bindings to 0.5.0 (#14380 )

2026-02-23 16:44:29 -08:00

io.go

mlxrunner: Fix memory leaks with pin/sweep lifecycle management

2026-02-23 09:50:07 -08:00

memory.go

show peak memory usage (#14485 )

2026-02-26 18:38:27 -08:00

mlx.go

Add qwen3.5-next-moe support to MLX runner and models (#14417 )

2026-03-03 16:39:22 -08:00

nn.go

Add MLX runner with GLM4-MoE-Lite model support (#14185 )

2026-02-10 14:57:57 -08:00

ops_extra.go

Add qwen3.5-next-moe support to MLX runner and models (#14417 )

2026-03-03 16:39:22 -08:00

ops.go

mlxrunner: Fix memory leaks with pin/sweep lifecycle management

2026-02-23 09:50:07 -08:00

random.go

mlxrunner: Fix memory leaks with pin/sweep lifecycle management

2026-02-23 09:50:07 -08:00

slice.go

mlxrunner: Fix memory leaks with pin/sweep lifecycle management

2026-02-23 09:50:07 -08:00

stream.go

Add MLX runner with GLM4-MoE-Lite model support (#14185 )

2026-02-10 14:57:57 -08:00