There's massive hardware and energy infra built out going on. None of that is sp...

Mehvix · 2025-10-24T17:07:34 1761325654

>None of that is specialized to run only transformers at this point

isn't this what [etched](https://www.etched.com/) is doing?

imtringued · 2025-10-24T17:18:33 1761326313

Only being able to run transformers is a silly concept, because attention consists of two matrix multiplications, which are the standard operation in feed forward and convolutional layers. Basically, you get transformers for free.

kadushka · 2025-10-24T18:28:50 1761330530

devil is in the details