mirror of
https://github.com/ollama/ollama.git
synced 2026-04-21 16:25:42 +02:00
* WIP - MLX backend with gemma3 * MLX: add cmake and go tag build toggles To build the new MLX backend code: cmake --preset MLX cmake --build --preset MLX --parallel cmake --install build --component MLX go build -tags mlx . Note: the main.go entrypoint for the MLX engine will change in a follow up commit. * add experimental image generation runtime * add experimental image generation runtime * MLX: wire up cuda build for linux * MLX: get dependencies correct and dedup This is still too large for a unified github artifact, but is now "correct" for the mlx_cuda_v13 directory. * fix relative link bug in dedup * Add darwin build and readme * add go build tag for mlx dependent code and wire up build_darwin.sh * lint cleanup * macos: build mlx for x86 This will be CPU only. * cuda build instructions and fix drift from mlx bump * stale comment * Delete agent helper doc * Clean up readme.md * Revise README for tokenizer clarity and details Updated README to clarify tokenizer functionality and removed correctness section. --------- Co-authored-by: jmorganca <jmorganca@gmail.com>
25 lines
748 B
Markdown
25 lines
748 B
Markdown
# Experimental Features
|
|
|
|
## MLX Backend
|
|
|
|
We're working on a new experimental backend based on the [MLX project](https://github.com/ml-explore/mlx)
|
|
|
|
Support is currently limited to MacOS and Linux with CUDA GPUs. We're looking to add support for Windows CUDA soon, and other GPU vendors. To build:
|
|
|
|
```
|
|
cmake --preset MLX
|
|
cmake --build --preset MLX --parallel
|
|
cmake --install --component MLX
|
|
go build -tags mlx .
|
|
```
|
|
|
|
On linux, use the preset "MLX CUDA 13" or "MLX CUDA 12" to enable CUDA with the default Ollama NVIDIA GPU architectures enabled.
|
|
|
|
## Image Generation
|
|
|
|
Based on the experimental MLX backend, we're working on adding imagegen support. After running the cmake commands above:
|
|
|
|
```
|
|
go build -o imagegen ./x/imagegen/cmd/engine
|
|
```
|