Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is evocative of “cramming”, a paper from a few years ago, where the author tried to find the best model they could train for a day on a modern laptop: https://arxiv.org/abs/2212.14034




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: