1. Partial client side execution of the models, then main model execution in cloud. (Example: text encoder for Stable Diffusion runs local, UNet runs cloud)
2. Local LoRAs for “dynamic ML”
3. WebGPU will be new standard to unlock models in the web
Even though the article had some good calculations and overview, I am not sure word-for-word is how I would describe its relationship to the OP.
Some predictions in the article:
1. Partial client side execution of the models, then main model execution in cloud. (Example: text encoder for Stable Diffusion runs local, UNet runs cloud)
2. Local LoRAs for “dynamic ML”
3. WebGPU will be new standard to unlock models in the web
Even though the article had some good calculations and overview, I am not sure word-for-word is how I would describe its relationship to the OP.