Interesting that the hardware is NVidia Blackwell, not Google TPUs. That means G...

crowcroft · 2025-04-12T16:48:34 1744476514

Getting a whole business set up to build TPU hardware for third parties (design, build, sell, support, etc.) is probably not worth it when there is overflowing demand for TPUs in their cloud already.

Businesses running their own hardware probably prefer CUDA as well for being more generally useful.

bitexploder · 2025-04-12T15:57:11 1744473431

Part of the reason for this is likely due to customers preference to have CUDA available which TPUs do not support. TPU is superior for many use cases but customers like the portability of targeting CUDA

j5r5myk · 2025-04-12T20:03:50 1744488230

Which use cases are TPUs superior for?

riwsky · 2025-04-13T00:25:29 1744503929

Running Gemini models, for one.

alienthrowaway · 2025-04-12T17:13:05 1744477985

What are the pros of using CUDA-enabled devices for inference?

bitexploder · 2025-04-12T17:49:51 1744480191

My limited understanding is that CUDA wins on smaller batches and jobs but TPU wins on larger jobs. It is just easier to use and better at typical small workloads. At some point for bigger ML loads and inference TPU starts making sense.

re-thc · 2025-04-12T15:58:37 1744473517

> not Google TPUs

They're in limited supply. Even Google doesn't have enough for their own use.

WalterGR · 2025-04-12T18:39:55 1744483195

Google doesn’t make TPUs available to 3rd parties, right? I assume there would be tremendous reverse-engineering risk if they were to include them?

fc417fc802 · 2025-04-12T21:09:49 1744492189

Not really. Reverse engineering a modern chip is no small feat. Any company capable of it is also capable of designing their own from scratch. However getting something taped out (and debugged) on a modern process is massively expensive.