Its a symptom of asking the models to provide answers that are not exactly in th... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		ActorNightly 7 months ago \| parent \| context \| favorite \| on: A.I. Is Getting More Powerful, but Its Hallucinati... Its a symptom of asking the models to provide answers that are not exactly in the training set, so the internal interpolation that the models do probably runs into edge cases where statistically it goes down the wrong path.

mountainriver 7 months ago [–]

This is exactly it, it’s the result of RLVR, where we force the model to reason about how to get to an answer when that information isn’t in its base training.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact