Hacker News new | past | comments | ask | show | jobs | submit | from login
Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
2 points by sbbq on April 2, 2024 | past
Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
2 points by tosh on April 1, 2024 | past
Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
1 point by Anon84 on March 31, 2024 | past
Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
2 points by rasbt on March 31, 2024 | past
AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research (sebastianraschka.com)
3 points by rasbt on March 3, 2024 | past
Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch (sebastianraschka.com)
96 points by rasbt on Feb 18, 2024 | past | 10 comments
AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs (sebastianraschka.com)
20 points by rasbt on Feb 3, 2024 | past
Naive Bayes and Text Classification I – Introduction and Theory (2014) (sebastianraschka.com)
2 points by vikrum on Jan 22, 2024 | past
Coding Self-Attention, Multi-Head Attention, Cross-Attention, Causal-Attention (sebastianraschka.com)
142 points by rasbt on Jan 14, 2024 | past | 11 comments
Ten Noteworthy AI Research Papers of 2023 (sebastianraschka.com)
128 points by danboarder on Jan 6, 2024 | past | 19 comments
Noteworthy AI Research Papers of 2023 (sebastianraschka.com)
3 points by rasbt on Jan 1, 2024 | past
Ten Noteworthy AI Research Papers of 2023 (sebastianraschka.com)
9 points by lucasus on Dec 30, 2023 | past
Research Papers in November 2023 (sebastianraschka.com)
1 point by Anon84 on Dec 10, 2023 | past
AI Research Papers in November 2023: hallucinations and reasoning capabilities (sebastianraschka.com)
5 points by rasbt on Dec 9, 2023 | past
Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation) (sebastianraschka.com)
342 points by rasbt on Nov 19, 2023 | past | 27 comments
Why would a famous former university ML professor make his posts paywalled? (sebastianraschka.com)
7 points by behnamoh on Nov 6, 2023 | past | 1 comment
AI and Open Source in 2023 (sebastianraschka.com)
123 points by belter on Nov 4, 2023 | past | 67 comments
AI Research Papers (October 2023) (sebastianraschka.com)
5 points by rasbt on Nov 4, 2023 | past
AI and Open Source in 2023: A Review of the Year's Highs and Lows (sebastianraschka.com)
2 points by rasbt on Oct 23, 2023 | past
AI chips, acquisitions, new "small" open-source LLMs, and new LoRA techniques (sebastianraschka.com)
5 points by rasbt on Oct 9, 2023 | past
AI news editorial from custom AI chips to new "small" LLMs like phi and Mistral (sebastianraschka.com)
1 point by rasbt on Oct 8, 2023 | past
AI research papers summaries and highlights (Aug to Sep) (sebastianraschka.com)
3 points by rasbt on Sept 24, 2023 | past
Optimizing LLMs from a Dataset Perspective (sebastianraschka.com)
138 points by alexmolas on Sept 15, 2023 | past | 24 comments
PyTorch: Cross-Entropy vs. Negative Log Likelihood (sebastianraschka.com)
2 points by auraham on Sept 12, 2023 | past
Training and aligning LLMs with RLHF and RLHF alternatives (sebastianraschka.com)
102 points by rasbt on Sept 10, 2023 | past | 14 comments
Understanding Llama 2 and the New Code Llama LLMs (sebastianraschka.com)
170 points by rasbt on Aug 30, 2023 | past | 34 comments
Llama 2, CodeLlama, and GPT-4 performance: recent LLM developments and research (sebastianraschka.com)
1 point by rasbt on Aug 27, 2023 | past
AI Research Highlights in 3 Sentences or Less (July-August 2023) (sebastianraschka.com)
1 point by rasbt on Aug 12, 2023 | past
Does it beat LLMs? NN+Gzip method reimplemented and explained step-by-step (sebastianraschka.com)
3 points by rasbt on July 30, 2023 | past
State of Computer Vision 2023 (sebastianraschka.com)
1 point by eugenOrl on July 24, 2023 | past

Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: