I wonder, as AI becomes more and more complex and incomprehensible, and the risk... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		SubiculumCode 5 months ago \| parent \| context \| favorite \| on: Trustworthy AI Without Trusted Data I wonder, as AI becomes more and more complex and incomprehensible, and the risk of loss of control becomes greater and greater, that the solution will simply be two adversarial AI's, one which generates, the other to detect deception/misalignment, and that at some point, when things have become so advanced beyond comprehension, we just have to trust the ying yang balance of good vs evil AI gods. /s

tintor 5 months ago [–]

Program is that detector AI can be misaligned too, especially if it is at similar capability level as generator AI. https://www.youtube.com/watch?v=0pgEMWy70Qk

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact