Hacker News new | past | comments | ask | show | jobs | submit login

> outperformed zero-shot GPT-4o

Cool stuff! Does this do RLHF or just pretraining? If the latter, how did you manage to beat GPT 4?




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: