Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You think they are caching? Even though one of the parameters is temperature? Can of worms, and should be reflected in the pricing if true, don't get me started if they are charging per token for cached responses.

I just don't see it.



You can keep around the KV cache from previous generations which lowers the cost of prompts significantly.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: