ChatGPT just told me to put the turkey in my toaster oven legs facing the door, and you think it can replace school. Unless there is a massive architectural change that can be provably verified by third parties, this can never be. I’d hate for my unschooled surgeon to check an llm while I’m under.
I see, I overlooked the 'toaster' part. That's a good world model benchmark question for models and a good reading comprehension question for humans. :-P
GPT 5.1 Pro made the same mistake ("Face the legs away from the door.") Claude Sonnet 4.5 agreed but added "Note: Most toaster ovens max out around 10-12 pounds for a whole turkey."
Gemini 3 acknowledged that toaster ovens are usually very compact and that the legs shouldn't be positioned where they will touch the glass door. When challenged, it hand-waved something to the effect of "Well, some toaster ovens are large countertop convection units that can hold up to a 12-pound turkey." When asked for a brand and model number of such an oven, it backtracked and admitted that no toaster oven would be large enough.
Changing the prompt to explicitly specify a 12-pound turkey yielded good answers ("A 12-pound turkey won't fit in a toaster oven - most max out at 4-6 pounds for poultry. Attempting this would be a fire hazard and result in dangerously uneven cooking," from Sonnet.)
Don't worry, someone will put another hack on top the model to teach it to handle this specific case better. That will totally fix the problem, right? Right?
A trained professional making their best guess is far more capable and trustworthy than the slop LLMs put out. So yeah, winging it is a good alternative here.
Yeah, when I started I actually had no intention of doing a chat app, but it was a good way to learn how it works. Eventual goal with this is a sort of hybrid notes app. Not an llm bolted into Notes, either, but kind of a mix. Find myself using the llm app as a knowledge repo and it’s just not built for that. I’ll post a TestFlight link later if you want to try it out.
My money is on a larger iPad (think Microsoft Studio) that takes the place of an iMac. Likely 15” at first, larger later. I really enjoy a keyboard and trackpad with an iPad and an external monitor. This would be my only machine if I could run Xcode/Terminal on it.
There's some hyperbole in what Bezmenov claims will be the consequences, but psyops and information warfare are very much real, and have been in use for decades. If he defected from the USSR and worked as a propagandist, he would be very familiar with what these tools are capable of. And if you take a look at the current state of many Western countries, it aligns well with the effects of psyops. Whether it was done by internal or external forces doesn't matter much, but it would be naive to think enemies of the West aren't engaging in it, just as the West is[1].
There's a very interesting read on the USMC University website that covers political warfare (which this would arguably fall into). Its freely available.
reply