Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I wonder, as AI becomes more and more complex and incomprehensible, and the risk of loss of control becomes greater and greater, that the solution will simply be two adversarial AI's, one which generates, the other to detect deception/misalignment, and that at some point, when things have become so advanced beyond comprehension, we just have to trust the ying yang balance of good vs evil AI gods. /s


Program is that detector AI can be misaligned too, especially if it is at similar capability level as generator AI. https://www.youtube.com/watch?v=0pgEMWy70Qk




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: