This website requires JavaScript.
Explore
idra
informatica
.it
Help
Register
Sign In
starred
/
ollama
Watch
1
Star
0
Fork
0
You've already forked ollama
mirror of
https://github.com/ollama/ollama.git
synced
2026-04-26 18:55:53 +02:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
9e3618d663e39e1cbddf5181be657d94cc1a8e52
ollama
/
x
/
models
History
Daniel Hiltgen
48ad7085c4
mlx: Improve gemma4 performance with fused operations (
#15587
)
...
* mlx: Improve gemma4 performance with fused operations * review comments
2026-04-14 18:04:04 -07:00
..
gemma3
mlx: quantized embeddings, fast SwiGLU, and runtime fixes (
#14884
)
2026-03-17 11:21:38 -07:00
gemma4
mlx: Improve gemma4 performance with fused operations (
#15587
)
2026-04-14 18:04:04 -07:00
glm4_moe_lite
models: fuse MLP activation functions via mlx_compile
2026-04-14 16:38:32 -07:00
llama
models: fuse MLP activation functions via mlx_compile
2026-04-14 16:38:32 -07:00
nn
mlx: add mxfp4/mxfp8/nvfp4 importing (
#15015
)
2026-03-24 13:45:44 -07:00
qwen3
models: fuse MLP activation functions via mlx_compile
2026-04-14 16:38:32 -07:00
qwen3_5
models: fuse MLP activation functions via mlx_compile
2026-04-14 16:38:32 -07:00
qwen3_5_moe
MLX: add header vendoring and remove go build tag (
#14642
)
2026-03-09 17:24:45 -07:00