Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I asked ChatGPT, Claude, Gemini and DeepSeek what the AE and OE mean in "Harman AE OE 2018 curve". All of them made up complete bullshit, even for the OE (Over Ear) term. AE is Around Ear. The OE term is absurdly easy to find even with the most basic of search skills, and is in fact the fourth result on Google.

The problem with LLMs isn't that they can't do great stuff: it's that you can't trust them to do it consistently. Which means you have to verify what they do, which means you need domain knowledge.

Until the next big evolution in LLMs or a revolution from something else, we'll be alright.



Both Gemini 2.5 Flash and Kagi's small built in model in their search got this right first try.


That is my point though. Gemini got it wrong for me. Which means it is inconsistent.

Say you and I ask Gemini what the perfect internal temperature for a medium-rare steak is. It tells me 72c, and it tells you 55c.

Even if it tells 990 people 55c and 10 people 55c, with a tens to hundreds of million users that is still a gargantuan amount of ruined steaks.


I know what you're saying, I guess it depends on the use case and it depends on the context. Pretty much like asking someone off the street something random. Ask someone about an apple some may say a computer and others a fruit.

But you're right though.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: