Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
transcriptase
81 days ago
|
parent
|
context
|
favorite
| on:
Expanding on what we missed with sycophancy
They can when there are entire teams dedicated to adding guardrails via hidden system prompts and running all responses through other LLMs trained on flagging and editing certain things before the original output gets relayed to the user.
Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: