Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yeah, I figure this is also why it often says “Ah, I found the problem! Let me check the …”. It hasn’t found the problem, but it’s more likely to continue with the solution if you jam that string in there.


We don’t know how Claude code is internally implemented. I would not be surprised at all if they literally inject that string as an alternative context and then go with the higher probability output, or if RLHF was structured in that way and so it always generates the same text.


Very likely RLHF, based only on how strongly aligned open models repeatedly reference a "policy" despite there being none in the system prompt.

I would assume that priming the model to add these tokens ends up with better autocomplete as mentioned above.


Claude Code is a big pile of minified Typescript, and some people have effectively de-compiled it.


So how does it do it?


I haven't read this particular code, I did some analysis of various prompts it uses, I didn't hear about anything specific like this. Mostly wanted to say "it's at least possible to dig into it if you'd like," not that I had the answer directly.


Couldn’t you have claude itself de-minify it?


Maybe. It’s not something I have enough of an interest in to out the time into trying it out.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: