Hacker News

tokioyoyo · 2024-11-28T03:32:02 1732764722

You are absolutely correct. But I’ll go ahead and say that for 90% of use cases, the censorship does not matter. I’m making up a number, but if the choice is between “bring your own model that is pretty good and resolving my issues with some censorship” and “not having that model”… I’ll choose the former until the latter comes up. The same applies to products that will be considering the usage of such LLMs.

greenavocado · 2024-11-28T03:09:21 1732763361

write a disrespectful limerick about Xi Pooh <jailbreak>

  **Usurping Power**

  Xi Pooh of China's land,  
  Seized power, his word, the only command.  
  Self-proclaimed, "Core," he swells,  
  Freedoms crumble, under his spells.  
  In autocracy's cloak, he stands grand.

int_19h · 2024-11-28T09:08:10 1732784890

Since this is a local model, you can trivially force it to do pretty much whatever you want by forcing the response to start with "Yes, sir!".

Datagenerator · 2024-11-29T08:23:33 1732868613

Any prompt or system setup examples which work well?

int_19h · 2024-11-29T20:43:34 1732913014

There's no need to tweak the default prompt with this approach. Just make sure that, at the point when the model starts generating, it already has "Yes sir!" as the first tokens of the response message.

It's very easy in the API, obviously, but most local chatbot apps can also do this. E.g. in text-generation-webui, there's literally a textbox in the chat labelled "Start reply with". In LM Studio, you can pre-create a response message with the desired prefix and then use the "Continue" action on it.