> "Constitutional AI isn’t free energy; it’s not ethics module plugged back into the ethics module. It’s the intellectual-knowledge-of-ethics module plugged into the motivation module."
while 'what is ethical' is a broad, difficult, multifaceted question, applying the model's 'intellectual' world model (that it's built from everything it's read) to it's motivation/training reward at least doesn't seem to collapse the nuance of the question.
And for sure, if the model's 'world understanding' is limited when it comes to [constitutional principle x] that will impact/limit the extent to which it gets closer to behaving according to a nuanced understanding of [constitutional principle x].
> "Constitutional AI isn’t free energy; it’s not ethics module plugged back into the ethics module. It’s the intellectual-knowledge-of-ethics module plugged into the motivation module."
while 'what is ethical' is a broad, difficult, multifaceted question, applying the model's 'intellectual' world model (that it's built from everything it's read) to it's motivation/training reward at least doesn't seem to collapse the nuance of the question.
And for sure, if the model's 'world understanding' is limited when it comes to [constitutional principle x] that will impact/limit the extent to which it gets closer to behaving according to a nuanced understanding of [constitutional principle x].