Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
mountainriver
8 months ago
|
parent
|
context
|
favorite
| on:
A.I. Is Getting More Powerful, but Its Hallucinati...
This is exactly it, it’s the result of RLVR, where we force the model to reason about how to get to an answer when that information isn’t in its base training.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: