Hacker News new | past | comments | ask | show | jobs | submit | from login
Scheming Reasoning Evaluations (apolloresearch.ai)
2 points by matthberg 7 days ago | past | discuss
Towards Safety Cases for AI Scheming (apolloresearch.ai)
2 points by doener 5 months ago | past
Scheming Reasoning Evaluations (apolloresearch.ai)
2 points by cglong 5 months ago | past | 1 comment
An evaluation of frontier AI models: OpenAI's o1 was capable of scheming (apolloresearch.ai)
1 point by seraphsf 5 months ago | past | 1 comment
Scheming reasoning evaluations – o1 results (apolloresearch.ai)
4 points by amrrs 5 months ago | past | 1 comment
The Evals Gap (apolloresearch.ai)
4 points by sundarurfriend 6 months ago | past
Research on strategic deception presented at the UK's AI Safety Summit (apolloresearch.ai)
2 points by ek750 on Nov 4, 2023 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: