Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This seems very rushed because of DeepSeek's R1 and Anthropic's Claude 3.7 Sonnet. Pretty underwhelming, they didn't even show programming? In the livestream, they struggled to come up with reasons why I should prefer GPT-4.5 over GPT-4o or o1.


This strikes me as the opposite of rushed. I get the impression that they've been sitting on this for a while and couldn't make it look as good as previous improvements. At some point they had to say, "welp here it is, now we can check that box and move on."


At least according to WSJ, they had planned to release it earlier but struggled to get the model quality up, especially relative to cost


they do have coding benchmarks, I summarized them here: https://news.ycombinator.com/item?id=43197955




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: