This was my first thought too. Even if transformers turn out to be the holy grai... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Me1000 on Dec 17, 2023 \| parent \| context \| favorite \| on: Transformers on Chips This was my first thought too. Even if transformers turn out to be the holy grail for LLMs, people are still interested in diffusion models for image generation. I think we’re about to see a lot of interesting specialized silicon for neural nets in the coming years, but locking yourself into a specific kind of model seems a little too specialized right now.

dontwearitout on Dec 17, 2023 [–]

Diffusion models could actually be implemented with transformers, hypothetically. Their training and inference is what makes diffusion models unique, not the model architecture.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact