I see what you’re getting at but it does feel like goalposts are being moved, no? By and large we can ask a computer today a question and it will almost certainly spit back a sensible (!= correct) answer. We can ask what the words mean and ask it to translate it to other languages, and we can have a conversation.
How are we disentangling comprehension of natural language itself from comprehension of the subject matter being discussed via said language sample?
I think that by most reasonable metrics LLMs can reasonably be said to comprehend natural language itself. However they clearly are deficient in logic and reasoning, as well as comprehension of many of the concepts that the natural language is used to express.