Using the hardware decoder requires creating a new decoder for each size of image you're decoding, and round-trip latency to get the frame back and do compositing on the web page. In addition, most hardware decoders have a limited number of decode contexts available, forcing the decodes to be serialized. Not a great prospect for an average website with tens to hundreds of images.