Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

These super large models seem better at keeping track of unsaid nuance. I wonder if that can still be distilled into smaller models or if there is a minimum size for a minimum level of nuance even given infinite training.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: