Hacker News new | past | comments | ask | show | jobs | submit login

How did you chose the 8192 token thinking budget? I've often seen Deepseek R1 use way more than that.



Arbitrary, and even with this budget it is already more verbose (and slower) overall than all the other thinking models - check tokens and latency in the table.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: