The newsroom

(Paid) The future of AI is cache‑shaped

AI’s future is cache-shaped: reuse answers to cut costly inference, deliver stable, fast, predictable responses.

AI feels like magic: ask, answer. But every trick has a cost, and in AI the cost is inference.

Inference is the thinking bit—and thinking at scale is expensive in money and energy. Multiply by millions of questions and the bill climbs faster than a teenager on mobile data.

So the future looks familiar. When computation gets pricey, we cache. We reuse. We store answers, stop recomputing, and move on.

AI is about to rediscover its inner 1998.

The return of the canned answer

Today’s chatbots behave like overeager interns: every question triggers a full performance. Even if they’ve heard it seven times, they do the dance from scratch.

AWS’s new subscription-style bundles: a small announcement with big consequences

AWS’s new flat-rate CloudFront bundles hint at a shift from metered cloud pricing to predictable, use-case-based plans. If expanded, this model could reduce overspend risk, reshape FinOps, anchor pricing in data usage and push cloud toward a more modern, simplified, subscription-like future.

Culture is the quiet context

Context, scopes, maturity and pathways: how FinOps adoption really starts

FinOps adoption starts with context, moves through scopes, defines maturity and follows pathways. This layered model turns the Foundation’s framework into a practical plan that fits intent, not theory.

The return of the canned answer

Read next