Wait, it's a 100M model. It's certainly very good they open sourced it, but its definitely not big considering we have 330B models. Perhaps it's big for this type of a model?
I for one would love to see a lot more highly capable small models that can be run on mobile devices and if on desktops, they don't need fiber to download.
I for one would love to see a lot more highly capable small models that can be run on mobile devices and if on desktops, they don't need fiber to download.