mirror of
https://github.com/ollama/ollama.git
synced 2026-04-24 09:46:01 +02:00
Replace the raw *mlx.Array token input with a ForwardBatch struct that carries InputIDs alongside sequence metadata (SeqIDs, SeqLens). InputIDs remain [1, N] shaped — all model code is unchanged beyond the signature.