Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

For anyone wondering if they're fake: I'm extremely familiar with these systems and believe most of these screenshots. We found similar flaws with automated red teaming in our 2022 paper: https://arxiv.org/abs/2202.03286


I wonder if Google is gonna be ultra conservative with Bard, betting that this ultimately blows up in Microsoft's face.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: