From the tweet: >> And yet the practitioners of CoT swear that any and every pro...

FeepingCreature · on May 12, 2024

I think these scenarios are compatible if we view LLMs as "fragile reasoners": they can occasionally reason, but it is an intermittent state that is easily disturbed. In such a world, we would expect to see that people who want LLMs to work can make them work with difficulty, and people who want or expect LLMs to fail can make them fail easily - or rather, maybe less adversarially phrased, one can generate examples of either outcome.

mistermann · on May 12, 2024

>> And yet the practitioners of CoT swear that any and every problem...

> Where the authors conclude:

> ...is a surprisingly strong generalized planner

Wtf is going on here?