> Cicero participated anonymously in 40 games of Diplomacy in a “blitz” league o...

gcanyon · on Nov 22, 2022

There's an interesting question of "how much do the literal best humans suck at this?" For example, in chess Magnus Carlsen might be able to beat Stockfish given a handicap of just a pawn or two. An even better computer player than Stockfish might give up three or more pawns, but even a perfect player would likely lose to Carlsen if giving up a rook. -- I'm making this up, I don't think anyone knows the real values, but as far as I know no one is remotely projecting that perfect play could overcome e.g. a queen handicap.

Similarly, in Go it seems unlikely that perfect play could overcome a nine-stone handicap (again, I could be wrong, I'm not remotely a dan-level player).

All to say, it seems likely that Diplomacy is a game where the difference between "the best human play" and "the best possible play" is much larger than either Go or Chess.

SonOfLilit · on Nov 23, 2022

We happened to talk about this at the Go club this evening. The strong chess players more or less agreed with you about the chess predictions, and the dan level Go players say AIs today can give the best pros definitely a 3 stone handicap (tried and tested), probably 4 or more, and perfect play is a few stones more (unclear how many, but probably not many, so not 9 stones altogether)

gcanyon · on Nov 23, 2022

Thanks -- what Go club?

I figured 9 was too much, but I had no idea what number less than 9 to pick, so I stuck with what I thought was a near-certain upper limit.

I once played with 9 stones against a dan level player. It was close...until it wasn't :-)

SonOfLilit · on Nov 23, 2022

I attend the Ramat Gan Go Club, but there are Go clubs everywhere around the world, and they tend to be in the same places HN commenters live, go figure. See e.g. https://www.usgo.org/where-play-go

morgante · on Nov 23, 2022

> All to say, it seems likely that Diplomacy is a game where the difference between "the best human play" and "the best possible play" is much larger than either Go or Chess.

Definitely, Diplomacy in general is substantially understudied compared to Go or Chess (largely because it's a tiny community). You can play for less than a year and get to top-level performance and much of the established wisdom/strategy of players is fairly bad.

Even the best Diplomacy players are only scratching the performance of how good someone could be.

treis · on Nov 22, 2022

I'm a little bit suspicious of this. They're not explicit about the scoring but taking the average of top 3 results is a huge advantage to those that played more games.

Diplomacy is a bit of a choose your own adventure game too. Like there's an objective criteria (average SCs at the agreed end of game) but the human tendency is to try and win individual games. Humans will often choose to play sub-optimal strategies for better entertainment value.

I think the real accomplishment here is the ability to fool humans into thinking their not playing a bot. That's an impressive thing to do even these days.