Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Sure, that does make things easier: one of the reasons Go took so long to solve is that one cannot define an objective score for Go beyond the end result being a boolean win or loose.

But IRL? Lots of measures exist, from money to votes to exam scores, and a big part of the problem is Goodhart's law — that the easy-to-define measures aren't sufficiently good at capturing what we care about, so we must not optimise too hard for those scores.



> Sure, that does make things easier: one of the reasons Go took so long to solve is that one cannot define an objective score for Go beyond the end result being a boolean win or loose.

Winning or losing a Go game is a much shorter term objective than making or losing money at a job.

> But IRL? Lots of measures exist

No, not that are shorter term than winning or losing a Go game. A game of Go is very short, much much shorter than the time it takes for a human to get fired for incompetence.


Time horizon is a completely different question to what I'm responding to.

I agree the time horizon of current SOTA models isn't particularly impressive. Doesn't matter in this point.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: