Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

github has a bunch of them for years, the most known from Andrej Karpathy:

https://github.com/karpathy/nanoGPT

some other have MoE implemented.




nanoGPT is awesome (and I highly recommend his videos on it), but it’s closer to a direct reproduction of GPT-2, so it’s cool to have a really clean implementation of some newer ideas.


nanoGPT contains some new ideas. https://github.com/karpathy/minGPT is more plain




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: