mirror of
https://github.com/ollama/ollama.git
synced 2026-04-18 00:54:05 +02:00
When a zstd-compressed request (e.g. from Codex CLI) hits /v1/responses with a cloud model the request failed. Fix by decompressing zstd bodies before model extraction, so cloud models are detected and proxied directly without the writer being wrapped.