Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

So it turns out the hardware needed to run a pretrained model is pretty much the same as the hardware needed to train a model. In both cases, it means lots of matrix multiplication.

Of course, training a model takes longer given the same amount of processing power - but for applications like video processing, just applying the model can be pretty demanding.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: