Perhaps, but over enough data pattern matching can becomes generalization ... On...

Perhaps, but over enough data pattern matching can becomes generalization ...

One of the interesting DeepSeek-R results is using a 1st generation (RL-trained) reasoning model to generate synthetic data (reasoning traces) to train a subsequent one, or even "distill" into a smaller model (by fine tuning the smaller model on this reasoning data).

Maybe "Data is all you need" (well, up to a point) ?