These super large models seem better at keeping track of unsaid nuance. I wonder...

		hnuser123456 8 months ago \| parent \| context \| favorite \| on: Hot take: GPT 4.5 is a nothing burger These super large models seem better at keeping track of unsaid nuance. I wonder if that can still be distilled into smaller models or if there is a minimum size for a minimum level of nuance even given infinite training.