OpenAI has been hiding their datasets, and certainly haven't credited me for the...

wrasee · 2025-01-29T17:41:59 1738172519

It is a remarkable achievement. But if “some training data from OpenAI” turns out to essentially be a wholesale distillation of their entire model (along with Llama etc) I do think that somewhat dampens the spirit of it.

We don’t know that of course. OpenAI claim to have some evidence and I guess we’ll just have to wait and see how this plays out.

There’s also a substantial difference between training of the entire internet and one that very specifically targets your competitor's products (or any specific work directly).

ambicapter · 2025-01-29T16:52:01 1738169521

Only weird if you think what OpenAI did should be the norm.

wrasee · 2025-01-29T17:23:35 1738171415

Right. I think many here are enjoying the Schadenfreude against OpenAI, but that hardly makes it right. It just makes it a race to the bottom.