Anything other than their 671b model are just distilled models on top of Qwen and Llama using their 671b reasoning data output, right?
Anything other than their 671b model are just distilled models on top of Qwen and Llama using their 671b reasoning data output, right?