ollama

mirror of https://github.com/ollama/ollama.git synced 2026-04-18 06:54:09 +02:00

Author	SHA1	Message	Date
Daniel Hiltgen	79c1e93c00	bench: improve benchmarking tool (#14240 ) New features: - Warmup phase to eliminate cold-start outliers - time-to-first-token measured in each epoch - VRAM/memory tracking to identify CPU spillover - Controlled prompt length - Defaults to 6 epochs and 200 tokens max Benchstat fixes: - ns/request instead of ns/op — non-standard unit created a separate group instead of grouping with timing metrics - Token count as the N field — benchstat interprets N as iteration count for statistical weighting, not as a token count	2026-03-15 11:47:31 -07:00
Eloi Torrents	dac4f17fea	cmd/bench: fix binary name in README (#13276 )	2025-12-10 14:16:58 -08:00
Julia Scheaffer	56b8fb024c	cmd/bench: fix options table in cmd/bench/README.md (#13216 )	2025-12-10 14:07:48 -08:00
Patrick Devine	d7fd72193f	tests: basic benchmarking test framework (#12964 ) This change adds a basic benchmarking test framework for Ollama which can be used to determine the prefill, eval, load duration, and total duration for running a given model or models.	2025-11-15 18:17:40 -08:00