In a chat setting you hit the cache every time you add a new prompt: all histori...

jonhohle · 2025-12-19T21:45:37 1766180737

I’m shocked that this hasn’t been a thing from the start. That seems like table stakes for automating repetitive tasks.

qeternity · 2025-12-19T22:36:04 1766183764

It has been a thing. In a single request, this same cache is reused for each forward pass.

It took a while for companies to start metering it and charging accordingly.

Also companies invested in hierarchical caches that allow longer term and cross cluster caching.