Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The blind test at lmarena.ai does give it a higher Elo than GPT-4o (API), Claude, and Gemini 1.5 Pro. It seems that people do enter real-life scenarios in the arena.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: