Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's cheaper because you are unlikely to run your local AI at top capacity 24/7 so you have unused capacity which you are paying for.


The calculation shows it's cheaper even if you run local AI 24/7


They are specifically referring to usage of APIs where you just pay by the token, not by compute. In this case, you aren’t paying for capacity at all, just usage.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: