Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

We already have really strong models that run on a consumer GPU, and really strong frameworks and libraries to support them.

The issue is (1) the extra size supports extra knowledge/abilities for the model. (2) a lot of the open source models are trained in a way to not compete with the paid offerings, or lack the data set of useful models.

Specifically, it seems like the tool-use heavy “agentic” work is not being pushed to open models as aggressively as the big closed models. Presumably because that’s where the money is.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: