Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

>I would be interested in reading a paper that does a good job of explaining what a parameter ends up representing in an LLM model.

https://distill.pub/2020/circuits/ https://transformer-circuits.pub/2025/attribution-graphs/bio...



That's an interesting paper and worth reading. Not sure it has answered my question but I did learn some things from it that I had not considered.

This was the quote I resonated with :-)

"... the discoveries we highlight here only capture a small fraction of the mechanisms of the model."

It sometimes feels a bit like papers on cellular biology with DNA discussions in which descriptions of the enzymes and proteins involved are insightful but the mechanism that operates the reaction remains opaque.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: