mirror of
https://github.com/ollama/ollama.git
synced 2026-04-17 23:54:05 +02:00
* MLX - dynamic loading of mlx-c Create a wrapper layer to indirect the dependency on mlx-c so the main ollama binary does not have a load-time dependency on mlx-c, mlx, and on linux, cuda. Lazy load the library via dlopen so we can adjust the path to ensure the dependencies are found and fail gracefully if not present. * review comments * fix broken tests
MLX Engine
Experimental MLX backend for running models on Apple Silicon and CUDA.
Build
go build -tags mlx -o engine ./x/imagegen/cmd/engine
Text Generation
./engine -model /path/to/model -prompt "Hello" -max-tokens 100
Options:
-temperature- sampling temperature (default 0.7)-top-p- nucleus sampling (default 0.9)-top-k- top-k sampling (default 40)
Supports: Llama, Gemma3, GPT-OSS
Image Generation
./engine -zimage -model /path/to/z-image -prompt "a cat" -output cat.png
Options:
-width,-height- image dimensions (default 1024x1024)-steps- denoising steps (default 9)-seed- random seed (default 42)