Yeah, I figure this is also why it often says “Ah, I found the problem! Let me c...

adastra22 · 2025-09-05T16:36:07 1757090167

We don’t know how Claude code is internally implemented. I would not be surprised at all if they literally inject that string as an alternative context and then go with the higher probability output, or if RLHF was structured in that way and so it always generates the same text.

data-ottawa · 2025-09-05T18:02:27 1757095347

Very likely RLHF, based only on how strongly aligned open models repeatedly reference a "policy" despite there being none in the system prompt.

I would assume that priming the model to add these tokens ends up with better autocomplete as mentioned above.

steveklabnik · 2025-09-05T18:52:33 1757098353

Claude Code is a big pile of minified Typescript, and some people have effectively de-compiled it.

sejje · 2025-09-05T19:30:47 1757100647

So how does it do it?

steveklabnik · 2025-09-05T19:37:39 1757101059

I haven't read this particular code, I did some analysis of various prompts it uses, I didn't hear about anything specific like this. Mostly wanted to say "it's at least possible to dig into it if you'd like," not that I had the answer directly.

Aeolun · 2025-09-06T00:22:20 1757118140

Couldn’t you have claude itself de-minify it?

steveklabnik · 2025-09-06T15:05:36 1757171136

Maybe. It’s not something I have enough of an interest in to out the time into trying it out.