>In s1, when the LLM tries to stop thinking with "</think>", they force it to keep going by replacing it with "Wait".