This seems very rushed because of DeepSeek's R1 and Anthropic's Claude 3.7 Sonne...

bitshiftfaced · 2025-02-27T21:40:39 1740692439

This strikes me as the opposite of rushed. I get the impression that they've been sitting on this for a while and couldn't make it look as good as previous improvements. At some point they had to say, "welp here it is, now we can check that box and move on."

apsec112 · 2025-02-27T20:18:58 1740687538

At least according to WSJ, they had planned to release it earlier but struggled to get the model quality up, especially relative to cost

bhouston · 2025-02-27T20:21:30 1740687690

they do have coding benchmarks, I summarized them here: https://news.ycombinator.com/item?id=43197955