I wonder if this will survive distillation. I vaguely recall that most open mode... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		QuadmasterXLII 22 days ago \| parent \| context \| favorite \| on: SynthID – A tool to watermark and identify content... I wonder if this will survive distillation. I vaguely recall that most open models answer “ I am chat gpt” when asked who they are, as they’re heavily trained on openai outputs. If the version of chatgpt used to generate the training data had a watermark, a sufficiently powerful function approximator would just learn the watermark.

xpe 22 days ago [–]

Are you expecting a distilled model to be sufficiently powerful to capture the watermark? I wouldn’t.

Additionally, I don’t think the watermark has to be deterministic.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact