So it turns out the hardware needed to run a pretrained model is pretty much the same as the hardware needed to train a model. In both cases, it means lots of matrix multiplication.
Of course, training a model takes longer given the same amount of processing power - but for applications like video processing, just applying the model can be pretty demanding.
Of course, training a model takes longer given the same amount of processing power - but for applications like video processing, just applying the model can be pretty demanding.