Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
10Gb/s Ethernet: what I did to get it working in my home (gilesthomas.com)
223 points by gpjt 2 days ago | past | 168 comments
10Gb Ethernet: what I had to (re)learn (gilesthomas.com)
2 points by ibobev 2 days ago | past | discuss
10Gb Ethernet: what I had to (re)learn (gilesthomas.com)
1 point by gpjt 3 days ago | past | 1 comment
LLM from scratch, part 33 – what I learned from the appendices (gilesthomas.com)
5 points by gpjt 9 days ago | past | discuss
LLM from scratch (32l) – Interventions: updated instruction fine-tuning results (gilesthomas.com)
1 point by gpjt 11 days ago | past | discuss
An LLM becomes more coherent as we train it (gilesthomas.com)
1 point by ibobev 13 days ago | past | discuss
How an LLM becomes more coherent as we train it (gilesthomas.com)
3 points by gpjt 14 days ago | past
LLM from scratch, part 32k – Interventions: gradient accumulation (gilesthomas.com)
2 points by gpjt 16 days ago | past
Interventions: Trying to train a better model in the cloud (gilesthomas.com)
1 point by ibobev 21 days ago | past
LLM from scratch, part 32j – trying to train a better model in the cloud (gilesthomas.com)
2 points by gpjt 22 days ago | past
Writing an LLM from scratch, part 32i – Interventions: what is in the noise? (gilesthomas.com)
2 points by ibobev 23 days ago | past
Writing an LLM from scratch, part 32i – Interventions: what is in the noise? (gilesthomas.com)
1 point by gpjt 24 days ago | past
Writing an LLM from scratch, part 32h – Interventions: full fat float32 (gilesthomas.com)
2 points by ibobev 25 days ago | past
Writing an LLM from scratch, part 32h – Interventions: full fat float32 (gilesthomas.com)
7 points by gpjt 28 days ago | past
Automating starting Lambda Labs instances (gilesthomas.com)
2 points by ibobev 28 days ago | past
Writing an LLM from scratch, part 32g – Interventions: weight tying (gilesthomas.com)
2 points by ibobev 37 days ago | past
Writing an LLM from scratch, part 32g – Interventions: weight tying (gilesthomas.com)
2 points by gpjt 38 days ago | past
Writing an LLM from scratch, part 32f – Interventions: weight decay (gilesthomas.com)
6 points by gpjt 39 days ago | past
Writing an LLM from scratch, part 32e – Interventions: the learning rate (gilesthomas.com)
3 points by ibobev 46 days ago | past
Writing an LLM from scratch, part 32e – Interventions: the learning rate (gilesthomas.com)
3 points by gpjt 52 days ago | past
Writing an LLM from scratch, part 32a – Interventions: training a baseline model (gilesthomas.com)
3 points by ibobev 81 days ago | past
Writing an LLM from scratch, part 32B – Interventions: gradient clipping (gilesthomas.com)
1 point by ibobev 81 days ago | past
Writing an LLM from scratch, part 32c – Interventions: removing dropout (gilesthomas.com)
1 point by ibobev 81 days ago | past
Writing an LLM from scratch, part 32d – Interventions: adding attention bias (gilesthomas.com)
1 point by ibobev 81 days ago | past
Writing an LLM from scratch, part 32d – Interventions: adding attention bias (gilesthomas.com)
6 points by gpjt 84 days ago | past
Writing an LLM from scratch, part 32c – Interventions: removing dropout (gilesthomas.com)
1 point by gpjt 85 days ago | past
Writing an LLM from scratch, part 32B – Interventions: gradient clipping (gilesthomas.com)
2 points by gpjt 86 days ago | past
Writing an LLM from scratch, part 32a – Interventions: training a baseline model (gilesthomas.com)
1 point by gpjt 87 days ago | past
Getting a Custom PyTorch LLM onto the Hugging Face Hub (gilesthomas.com)
1 point by ibobev 3 months ago | past
Getting a Custom PyTorch LLM onto the Hugging Face Hub (gilesthomas.com)
1 point by gpjt 3 months ago | past

Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: