Wait a few month and they will have a distilled model with the same performance ...

peepeepoopoo98 · 2024-12-22T22:34:05 1734906845

100X efficiency improvement (doubtful) still means that costs grow 200X faster than benchmark performance.

achierius · 2024-12-22T22:34:41 1734906881

Even assuming that past rates of inference cost scaling hold up, we would only expect a 2 OoM decrease after about a year or so. And 1% of 3.5b is still a very large number.

popcorncowboy · 2024-12-24T05:39:35 1735018775

And to your point "past performance is not indicative of future results". The extrapolate to infinity approach is the mindfever of this field.