Maybe they need to evoke a sort of sleep so they can clear these out while dreaming, sorta like if humans don’t sleep enough hallucination start penetrating waking life…
Not at all, but I think a lot of these companies have something in place which is roughly equivalent to a budget of resources they are willing to put towards processing your requests in a given time frame (independently of context windows) that artificially acts that way.
I can get a couple of hours of good responses out of Gemini (with a fixed price monthly payment) working on a project per day before quality takes a serious nosedive.