Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Perhaps, but over enough data pattern matching can becomes generalization ...

One of the interesting DeepSeek-R results is using a 1st generation (RL-trained) reasoning model to generate synthetic data (reasoning traces) to train a subsequent one, or even "distill" into a smaller model (by fine tuning the smaller model on this reasoning data).

Maybe "Data is all you need" (well, up to a point) ?



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: