Prompt Caching

Fixing OpenCode Prompt Cache Misses When Using GPT via Third-Party Proxy

March 26, 2026 · 539 words · 3 min

While using OpenCode with GPT 5.3 Codex for daily development, I noticed abnormally high token consumption — around 69K input tokens per request with virtually zero cache hits. The same model and proxy worked fine with Codex CLI, where caching functioned as expected. This post documents the full debugging and resolution process.