Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

There’s a huge difference both in capabilities and in meaning between “variations of r1” and “r1 distill”. ollama is intentionally misleading people on this but the distills are much much worse


They're really not? Both subjectively and in benchmarks there is no world in which the delta between the models deserves a "much much".




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: