With improvements on the algorithm side and new techniques, even older hardware ...

Zigurd · 2025-10-04T14:26:44 1759588004

I get what you're saying and the reasoning behind it, but older hardware has never been useful where power consumption is part of determining usefulness.

chatmasta · 2025-10-04T14:35:33 1759588533

This is the biggest threat to the GPU economy – software breakthroughs that enable inference on commodity CPU hardware or specialized ASIC boards that hyperscalers can fabricate themselves. Google has a stockpile of TPUs that seem fairly effective, although it’s hard to tell for certain because they don’t make it easy to rent them.

Zigurd · 2025-10-04T14:49:23 1759589363

I don't think we will need to wait for anything as unpredictable as a breakthrough. Optimizing inference for the most clearly defined tasks, which are also the tasks where value is most readily quantified, like coding, is underway now.

xadhominemx · 2025-10-04T14:41:44 1759588904

More efficient inference = more reasoning token. Hyperscaler ASICs are closing the gap at the hardware/system level, yes.