> and then extensive fine-tuning through human feedback
how extensive is the work involved to take a model that's willing to talk about Tianamen square into one that isn't? What's involved with editing Llama to tell me how to make cocaine/bombs/etc?
It's not so extensive so as to require an army of subcontractors to provide large scale human feedback.
how extensive is the work involved to take a model that's willing to talk about Tianamen square into one that isn't? What's involved with editing Llama to tell me how to make cocaine/bombs/etc?
It's not so extensive so as to require an army of subcontractors to provide large scale human feedback.