The thing is that since these models aren't actually doing reasoning and don't p...

aeonik · 2025-08-18T02:13:59 1755483239

I understand they don't have a logic engine built into them, ie no deduction, but I do think their inference is a weak form of reasoning, and I'm not sure about world model.

I suppose it depends on the definition of model.

I currently do consider the transformer weights to be a world model, but having a rigid one based on statistical distributions tend to create pretty wonky behavior at times.

That's why I do agree, relying on your own understanding the code is the best way.

It's amazing seeing these things produce some beautiful functions and designs, and then promptly forget that it exists, and then begin writing incompatible, half re-implemented non-idiomatic code.

If you're blind to what they are doing, it's just going to be layers upon layers of absolute dreck.

I don't think they will get out of cul-de-sacs without a true deductive engine, and a core of hard, testable facts to build on. (I'm honestly a bit surprised that this behavior didn't emerge early in training to be honest).

Though I think humans minds are the same way, in this respect, and fall for the same sort of traps. Though at least our neurons can rewire themselves on the fly.

I know a LOT of people who sparingly use their more advanced reasoning faculties, and instead primarily rely on vibes, or pre-trained biases. Even though I KNOW they are capable of better.

logicprog · 2025-08-19T01:46:54 1755568014

Good comment. I'm pretty much on the same page, my only disagreement is that transformers, if they are a world model, are a world model of some sort of semiotic shadow world, not an experiential physical consistent world like ours, so they're not equipped to handle modelling our world.

aeonik · 2025-08-20T00:04:23 1755648263

Semiotic shadow world works for me. I think that's a great description.