I was using one of the smaller models (7b), but I was able to bypass its interna...

rahimnathwani · 2025-01-31T20:28:44 1738355324

The model you were using was created by Qwen, and then finetuned for reasoning by Deepseek.

- Deepseek didn't design the model architecture

- Deepseek didn't collate most of the training data

- Deepseek isn't hosting the model

jscheel · 2025-02-01T12:49:52 1738414192

Yes, 100%. However, the distilled models are still pretty good at sticking to their approach to censorship. I would assume that the behavior comes from their reasoning patterns and fine tuning data, but I could be wrong. And yes, DeepSeek’s hosted model has additional guardrails evaluating the output. But those aren’t inherent to the model itself.

inglor_cz · 2025-01-31T20:56:01 1738356961

Poisoning the censorship machine by truth, that is poetic.