Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
lorey
24 days ago
|
parent
|
context
|
favorite
| on:
Without benchmarking LLMs, you're likely overpayin...
Yes, absolutely. This aligns with what we found. It seems to be necessary to be very clear on scoring (at least for Opus 4.5).
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: