Submissions from sebastianraschka.com

		Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
		2 points by sbbq on April 2, 2024 \| past
		Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
		2 points by tosh on April 1, 2024 \| past
		Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
		1 point by Anon84 on March 31, 2024 \| past
		Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
		2 points by rasbt on March 31, 2024 \| past
		AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research (sebastianraschka.com)
		3 points by rasbt on March 3, 2024 \| past
		Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch (sebastianraschka.com)
		96 points by rasbt on Feb 18, 2024 \| past \| 10 comments
		AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs (sebastianraschka.com)
		20 points by rasbt on Feb 3, 2024 \| past
		Naive Bayes and Text Classification I – Introduction and Theory (2014) (sebastianraschka.com)
		2 points by vikrum on Jan 22, 2024 \| past
		Coding Self-Attention, Multi-Head Attention, Cross-Attention, Causal-Attention (sebastianraschka.com)
		142 points by rasbt on Jan 14, 2024 \| past \| 11 comments
		Ten Noteworthy AI Research Papers of 2023 (sebastianraschka.com)
		128 points by danboarder on Jan 6, 2024 \| past \| 19 comments
		Noteworthy AI Research Papers of 2023 (sebastianraschka.com)
		3 points by rasbt on Jan 1, 2024 \| past
		Ten Noteworthy AI Research Papers of 2023 (sebastianraschka.com)
		9 points by lucasus on Dec 30, 2023 \| past
		Research Papers in November 2023 (sebastianraschka.com)
		1 point by Anon84 on Dec 10, 2023 \| past
		AI Research Papers in November 2023: hallucinations and reasoning capabilities (sebastianraschka.com)
		5 points by rasbt on Dec 9, 2023 \| past
		Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation) (sebastianraschka.com)
		342 points by rasbt on Nov 19, 2023 \| past \| 27 comments
		Why would a famous former university ML professor make his posts paywalled? (sebastianraschka.com)
		7 points by behnamoh on Nov 6, 2023 \| past \| 1 comment
		AI and Open Source in 2023 (sebastianraschka.com)
		123 points by belter on Nov 4, 2023 \| past \| 67 comments
		AI Research Papers (October 2023) (sebastianraschka.com)
		5 points by rasbt on Nov 4, 2023 \| past
		AI and Open Source in 2023: A Review of the Year's Highs and Lows (sebastianraschka.com)
		2 points by rasbt on Oct 23, 2023 \| past
		AI chips, acquisitions, new "small" open-source LLMs, and new LoRA techniques (sebastianraschka.com)
		5 points by rasbt on Oct 9, 2023 \| past
		AI news editorial from custom AI chips to new "small" LLMs like phi and Mistral (sebastianraschka.com)
		1 point by rasbt on Oct 8, 2023 \| past
		AI research papers summaries and highlights (Aug to Sep) (sebastianraschka.com)
		3 points by rasbt on Sept 24, 2023 \| past
		Optimizing LLMs from a Dataset Perspective (sebastianraschka.com)
		138 points by alexmolas on Sept 15, 2023 \| past \| 24 comments
		PyTorch: Cross-Entropy vs. Negative Log Likelihood (sebastianraschka.com)
		2 points by auraham on Sept 12, 2023 \| past
		Training and aligning LLMs with RLHF and RLHF alternatives (sebastianraschka.com)
		102 points by rasbt on Sept 10, 2023 \| past \| 14 comments
		Understanding Llama 2 and the New Code Llama LLMs (sebastianraschka.com)
		170 points by rasbt on Aug 30, 2023 \| past \| 34 comments
		Llama 2, CodeLlama, and GPT-4 performance: recent LLM developments and research (sebastianraschka.com)
		1 point by rasbt on Aug 27, 2023 \| past
		AI Research Highlights in 3 Sentences or Less (July-August 2023) (sebastianraschka.com)
		1 point by rasbt on Aug 12, 2023 \| past
		Does it beat LLMs? NN+Gzip method reimplemented and explained step-by-step (sebastianraschka.com)
		3 points by rasbt on July 30, 2023 \| past
		State of Computer Vision 2023 (sebastianraschka.com)
		1 point by eugenOrl on July 24, 2023 \| past
		More