Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They're not 3.5 models, which are 175B param, they're 220B and the idea is the fine tuning is different for each which is where the 'expert' comes in.


Ah, that makes sense, thanks for clarifying




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: