The performance of the chip is matched to the memory size.
I think it’s a U shaped curve.
Beyond 80GB, today, the larger chip would maybe all of these: yield less, scale worse, take too much power, etc.
Like this matching of compute resources to RAM is partly the difference between CPUs and GPUs.
Anyway, it’s just to say that it isn’t a business decision. The extra RAM in the M2 doesn’t help the GPU much for the same tasks the H100 excels at, because it isn’t performant enough to use that RAM anywhere near the same way an H100 would, and if it were, there would have to be less RAM. The H100 doesn’t even have a graphics engine. It’s complicated.
> The performance of the chip is matched to the memory size.
That may be approximately true if you only look at a single generation of consumer graphics cards at a time. If you compare across generations or include non-gaming workloads the correlation falls apart.
I think it’s a U shaped curve.
Beyond 80GB, today, the larger chip would maybe all of these: yield less, scale worse, take too much power, etc.
Like this matching of compute resources to RAM is partly the difference between CPUs and GPUs.
Anyway, it’s just to say that it isn’t a business decision. The extra RAM in the M2 doesn’t help the GPU much for the same tasks the H100 excels at, because it isn’t performant enough to use that RAM anywhere near the same way an H100 would, and if it were, there would have to be less RAM. The H100 doesn’t even have a graphics engine. It’s complicated.