> I've seen very little convincing discussion about what to do about this proble... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		pelorat on Oct 27, 2024 \| parent \| context \| favorite \| on: ZombAIs: From Prompt Injection to C2 with Claude C... > I've seen very little convincing discussion about what to do about this problem. I think we will need adversarial AI agents whose task is to monitor other agents for anything suspicious. Every input and output would be scrutinized and either approved or rejected.

MattPalmer1086 on Oct 27, 2024 [–]

They will also be vulnerable to the same attack though.

kchr on Oct 27, 2024 | [–]

It's AI agents all the way down

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact