mirror of
https://github.com/ollama/ollama.git
synced 2026-04-27 19:25:55 +02:00
This implements the Open Compute Microscaling (MX) FP4 format as a tensor type with backend implementations focusing on mulmat and mulmatid on CPU, CUDA, and Metal.