Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The original comment says nothing about benchmarking, they just say that an AI can’t one shot their complex task?


When I read

"My favorite thing to ask the models designed for programming is ....... None of them ever get it right"

I read "benchmark".




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: