Fixing OpenCode Prompt Cache Misses When Using GPT via Third-Party Proxy
March 26, 2026 · 539 words · 3 min
While using OpenCode with GPT 5.3 Codex for daily development, I noticed abnormally high token consumption — around 69K input tokens per request with virtually zero cache hits. The same model and proxy worked fine with Codex CLI, where caching functioned as expected. This post documents the full debugging and resolution process.