Wait, so the trick is they reach into the context and basically switch '</think>...

danans · 2025-02-06T15:41:59 1738856519

Not sure if your pun was intended, but 'wait' probably works so well because of the models being trained on text structured like your comment, where "wait" is followed by a deeper understanding.

gield · 2025-02-06T10:49:27 1738838967

Yes, that's explicitly mentioned in the blog post:

>In s1, when the LLM tries to stop thinking with "</think>", they force it to keep going by replacing it with "Wait".

luc4sdreyer · 2025-02-06T14:58:45 1738853925

Yes, that's one of the tricks.