ollama

mirror of https://github.com/ollama/ollama.git synced 2026-04-25 02:06:11 +02:00

Files

Jeffrey Morgan 3b3bf6c217 x/imagegen: replace memory estimation with actual weight size (#13848 )

Remove static VRAM estimation (EstimateVRAM, CheckMemoryRequirements)
which wasn't helpful. Instead, report the actual tensor weight size
from the manifest for ollama ps.

- Remove memory estimation check from runner startup
- Remove EstimateVRAM, CheckMemoryRequirements, modelVRAMEstimates
- Add TotalTensorSize() to get actual weight size from manifest
- Use weight size for Server.vramSize instead of estimates

Note: This is better than showing 0 or inaccurate estimates, but the
weight size is a drastic underestimation of actual memory usage since
it doesn't account for activations, intermediate tensors, or MLX
overhead. Future work should query real-time memory from MLX
(e.g., MetalGetActiveMemory) for accurate reporting.

2026-01-22 18:32:41 -08:00

agent

x/cmd: enable web search and web fetch with flag (#13690 )

2026-01-12 13:59:40 -08:00

cmd

cmd: enable multi-line input and shift enter (#13694 )

2026-01-14 17:52:46 -08:00

create

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

imagegen

x/imagegen: replace memory estimation with actual weight size (#13848 )