It isn’t even *that* unreasonable for the AI companies to not be profitable at t...

andyferris · 2025-07-20T23:51:43 1753055503

I think it’s the time slice problem.

Locally I need to pay for my GPU hardware 24x7. Some electricity but mostly going to be hardware cost at my scale (plus I have excess free energy to burn).

Remotely I probably use less than an hour of compute a day. And only workdays.

Combined with batching being computationally more efficient it’s hard to see anything other than local inference ALWAYS being 10x more expensive than data centre inference.

(Would hope and love to be proven wrong about this as it plays out - but that’s the way I see it now).