So it's basically just GPT-4, according to the benchmarks, with a slight edge for multimodal tasks (ie audio, video).
Google does seem to be quite far behind, GPT-4 launched almost a year ago.
But that's my point. It doesn't matter who's better exactly right now. Let's see how this plays out over the next few years.
Whether one company or another is 10% better or worse than another at some metric right now -- that just couldn't be less relevant in terms of how this will ultimately end up.
This is interesting in that it implies that catching up is possible if you have enough data, engineers and compute. This also potentially implies that adjacent players such as Nvidia could gain an edge long term because they are a leader in one of the three.