Hacker Newsnew | past | comments | ask | show | jobs | submit | hackthegibson2's submissionslogin
1.Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult (simonw.substack.com)
5 points by hackthegibson2 4 months ago | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: