| | Understanding and Coding the KV Cache in LLMs from Scratch (sebastianraschka.com) |
|
6 points by sbbq 4 days ago | past | discuss
|
| | Understanding and Coding the KV Cache in LLMs from Scratch (sebastianraschka.com) |
|
2 points by tosh 4 days ago | past | discuss
|
| | Coding LLMs from the Ground Up: A Complete Course (sebastianraschka.com) |
|
4 points by sbbq 10 days ago | past | discuss
|
| | Coding LLMs from the Ground Up: A Complete Course (sebastianraschka.com) |
|
2 points by mdp2021 41 days ago | past
|
| | The State of Reinforcement Learning for LLM Reasoning (sebastianraschka.com) |
|
8 points by yaiml 58 days ago | past
|
| | The State of Reinforcement Learning for LLM Reasoning (sebastianraschka.com) |
|
9 points by jonbaer 62 days ago | past
|
| | The State of Reinforcement Learning for LLM Reasoning (sebastianraschka.com) |
|
4 points by mdp2021 63 days ago | past
|
| | The State of LLM Reasoning Models (sebastianraschka.com) |
|
2 points by Philpax 88 days ago | past
|
| | The State of Reasoning Models (sebastianraschka.com) |
|
4 points by sbbq 3 months ago | past
|
| | The State of LLM Reasoning Models Part 1: Inference-Time Compute Scaling Methods (sebastianraschka.com) |
|
3 points by yaiml 3 months ago | past
|
| | Understanding Reasoning LLMs (sebastianraschka.com) |
|
473 points by sebg 4 months ago | past | 188 comments
|
| | Understanding Reasoning LLMs (sebastianraschka.com) |
|
4 points by sbbq 4 months ago | past
|
| | Noteworthy LLM Research Papers of 2024 Megapost (sebastianraschka.com) |
|
5 points by yaiml 4 months ago | past
|
| | Implementing a Byte Pair Encoding (BPE) Tokenizer from Scratch (sebastianraschka.com) |
|
2 points by headalgorithm 5 months ago | past
|
| | Implementing a Byte Pair Encoding (BPE) Tokenizer from Scratch (sebastianraschka.com) |
|
4 points by sbbq 5 months ago | past
|
| | AI Research Recap 2024: From New Scaling Laws to Scaling Inference Compute (sebastianraschka.com) |
|
1 point by sbbq 5 months ago | past
|
| | Noteworthy AI Research Papers of 2024 (Part One) (sebastianraschka.com) |
|
1 point by birdculture 5 months ago | past
|
| | Noteworthy AI Research Papers of 2024 (Part One) (sebastianraschka.com) |
|
1 point by sbbq 5 months ago | past
|
| | Collection of 1k LLM Research Papers of 2024 (sebastianraschka.com) |
|
4 points by sbbq 5 months ago | past
|
| | LLM Research Papers: The 2024 List (sebastianraschka.com) |
|
5 points by ModelForge 6 months ago | past
|
| | LLM Research Papers: The 2024 List (sebastianraschka.com) |
|
1 point by mdp2021 6 months ago | past
|
| | Understanding Multimodal LLMs (sebastianraschka.com) |
|
2 points by lapnect 7 months ago | past
|
| | Understanding Multimodal LLMs: The Main Techniques and Latest Models (sebastianraschka.com) |
|
4 points by sbbq 7 months ago | past
|
| | Building a GPT-Style LLM Classifier from Scratch (sebastianraschka.com) |
|
2 points by mdp2021 9 months ago | past
|
| | Building LLMs from the Ground Up: A 3-Hour Coding Workshop (sebastianraschka.com) |
|
970 points by mdp2021 9 months ago | past | 136 comments
|
| | Show HN: New LLM Pre-Training and Post-Training Paradigms (sebastianraschka.com) |
|
2 points by rasbt 10 months ago | past
|
| | New LLM Pre-Training and Post-Training Paradigms: How Modern LLMs Are Trained (sebastianraschka.com) |
|
5 points by sbbq 10 months ago | past
|
| | Developing an LLM: Building, Training, Finetuning (sebastianraschka.com) |
|
1 point by Anon84 on June 13, 2024 | past
|
| | Understanding the LLM Development Cycle: Building, Training, Finetuning (sebastianraschka.com) |
|
3 points by rasbt on June 8, 2024 | past
|
| | The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM (sebastianraschka.com) |
|
5 points by rasbt on May 12, 2024 | past
|
|
|
| More |