Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You’re thinking “any data”, he’s thinking “useful data for training an LLM”.


But isn't the underlying hypothesis that any data with sufficient underlying pattern is ultimately useful if you throw enough compute at it? His own argument is that even from the nonsense of the internet LLMs could extract general models of the world...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: