And using humans as 'the benchmark' is risky in itself as it can leave us with blind spots on AI behavior. For example we find humans aren't as general as we expected, or the "we made the terminator and it's exterminating mankind, but it's not AGI because it doesn't have feelings" issues.
> Said one park ranger, “There is considerable overlap between the intelligence of the smartest bears and the dumbest tourists.”
[1] https://www.schneier.com/blog/archives/2006/08/security_is_a...