> This is not surprising. A human would suffer from similar errors at a similar ...

HervalFreire · on March 19, 2023

Keep in mind there's a token limit. Once you pass that limit it no longer remembers.

Yes. You are pointing out various flaws which again is quite obvious. Everyone knows of the inconsistencies with these LLMs.

Too this I again say that the LLM understands some things and doesn't understand other things, its understanding of things is inconsistent and incomplete.

The only thing needed to prove understanding is to show chatGPT building something that can only be built by pure understanding. If you see one instance of this, then it's sufficient to say on some level chatGPT understands aspects of your query rather then doing a trivial query-response correlation you're implying is possible here.

Let's examine the full structure that was built here:

chatGPT was running an emulated terminal with an emulated internet with an emulated chatGPT with an emulated terminal.

It's basically a recursive model of a computer and the internet relative to itself. There is literally no exact copy of this anywhere in it's training data. chatGPT had to construct this model via correctly composing multiple concepts together.

The composition cannot occur correctly without chatGPT understanding how the components compose.

It's kind of strange that this was ignored. It was the main point of the example. I didn't emphasize this because this structure is obviously the heart of the argument if the article was read to the end.

Literally to generate the output of the final example chatGPT has to parse bash input execute the command over a simulated internet onto a simulated version of himself and again parse the bash sub command. It has a internal stack that it must use to put all the output together into a final json output.

So while It is possible for simple individual commands to be correlated with similar training data... for the highly recursive command on the final prompt.... There is zero explanation for how chatGPT can pick this up off of some correlation. There is virtually no identical structure on the internet... It has to understand the users query and compose the response from different components. That is the only explanation left.