Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Sorry, but this was ChatGPT/o1 with access to code execution (Python) and it used almost 4 minutes to do reasoning. It had done a few checks with smaller numbers, all of which had failed. And it proceeded to make a wrong conclusion (with high confidence).


Of course it failed. Tell it to write a program.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: