Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Do you mean that there is no correlation between confidence and false positives or other errors?


elzbardico is pointing out how the author is having the confidence value generated in the output of the response rather than it being the confidence of the output.


Is there research solid knowledge on this?


this trick is being used by many apps (including Github copilot reviews). The way I see it, is that if the agent has an eager-to-please problem, then you give it a way out


Thanks. I was talking about the confidence measure.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: