Interesting! this was already the case with TPUs easily beating A100s. We sell S...

memossy · on March 11, 2024

The v5es and v5ps are pretty amazing at running SD, giving code for SD3 now to optimise it on those.

v5es are particularly interesting given the millions that will land and the large pod sizes, particularly well constructed for million token context windows.

doctorpangloss · on March 11, 2024

But you and I can't buy a TPU. You and I can buy an H100.

isoprophlex · on March 11, 2024

Speak for yourself! I can't even afford 1/10th of an H100.

elorant · on March 11, 2024

He's probably speaking about availability, not affordability.

leansensei · on March 12, 2024

It was an attempt at humor, obviously.

renewiltord · on March 11, 2024

Which TPUs do you use? Cloud-hosted or your own hardware? Interesting insight.

1024core · on March 11, 2024

"TPUs" are a Google-only product, available* only on GCP.

* Notwithstanding the Choral boards

MasterScrat · on March 11, 2024

We started on V3s, now fully moved to V4s with some V5Es, investigating a full move towards V5E & V5P

memossy · on March 11, 2024

We use v4s, v5es & v5ps. Mostly v5ps, very stable int8 training (versus the horror that is fp8 stability)