What changed under the hood, why it matters for your bills, and how to use the new bits.
Upstream prompt-cache pass-through, semantic cache, smart router, batch API, content moderation, PII redaction, vision token math fix, 7 new models behind 3 new providers, prompt templates, fine-tune lifecycle, reserved throughput, and Prometheus/Grafana observability — all additive. Existing OpenAI SDK code keeps working without a single line changed.
Read post →