Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

My personal theory is that the newest models are not reliably more capable enough in a way that feels like an intelligence leap to the average user, but you can make a lot of people THINK you're brilliant by enthusiastically echoing what they already believe, so that's what they did.


I wonder if this is similar to how that experiment[0] where they attempted to domesticate foxes led to traits like floppy ears.

Which is to say, even when attempting to objectively select for "well aligned" behavior, human tendency to favor non-material signals of "friendliness" still leaks in.

[0] https://en.m.wikipedia.org/wiki/Domesticated_silver_fox




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: