mirror of
https://github.com/ollama/ollama.git
synced 2026-04-26 02:36:09 +02:00
When we later have a large batch running purely on a CPU, this results the error: GGML_ASSERT(talloc->buffer_id >= 0) Disabling this means that we will incrementally reallocate memory as the graph grows. Fixes #10410