I’m talking about pruning a local LLM not using their service. There are plenty of ways to prune and distill. Heck DeepSeek was distilled from other models. You could simply run a distillation using Hex, then convert those outputs back to the target language.