Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

But if Gemini 2.5 pro was considered to be the strongest coder lately, does SWE-bench really reflect reality?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: