Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I've been working through [0]. Like a lot of math, the notation is daunting, but once you become familiar with it, it really is a nice tool for thought.

[0]: https://arxiv.org/abs/2207.09238



This! The best resource I've found to explain transformers, that made them clear to me. I wish all deep learning papers were written like this, using pseudocode.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: