The point of my comment is that even the distribution represents intelligence. If you give it a tricky Yes/No question that results in a distribution that's 99.97% "Yes" and negligible values for every other token, that is interesting. Hofstadter is surprised you can do any amount of non-trivial reasoning in a single forward pass.