As an example, INT8 support in WebGPU would enable running quantized models, all...

		atgctg on July 17, 2023 \| parent \| context \| favorite \| on: WebGPU – All of the cores, none of the canvas As an example, INT8 support in WebGPU would enable running quantized models, allowing larger LLMs to run locally in the browser. See Limitations section here: https://fleetwood.dev/posts/running-llms-in-the-browser