Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
AvAn12
7 months ago
|
parent
|
context
|
favorite
| on:
Emergent Misalignment: Narrow Finetuning Can Produ...
Is the opposite testable? Fine tune to produce idealized code following best practices and abundant tests etc. Does this lead to highly ethical responses to general prompts? And are their other dimensions in addition to good-vs-malicious code?
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: