Hacker News new | past | comments | ask | show | jobs | submit login

Yeah, the license thing is definitely a problem. It's hard to get excited about an academic research license for a 3B or 8B model when the Llama 3.1 and 3.2 models are SO good, and are licensed for commercial usage.



to be clear- these ministal model are also licensed for commercial use, but not freely licensed for commercial use. and meta also has restrictions on commercial use (have to put “Built with Meta Llama 3” and need to pay meta if you exceed 700 million monthly users)


You need to pay meta if you have 700 million users as of the Llama 3 release date. Not at any time going forward.


... or presumably if you build a successful company and then try to sell that company to Apple, Microsoft, Google or a few other huge companies.


> need to pay meta if you exceed 700 million monthly users

Seems like a good problem to have


Qwen 2.5 models are better than Llama and Mistral.


I disagree. I tried the small ones but they too frequently output Chinese when the prompt is English.


I never had this problem but i guess it depends on the prompt.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: