Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Thanks! Are the scores in some way linear here? As in, if model A is rated at 25 and model B at 50, does that mean I will have half the mistakes with model B? Get answers that are 2x more accurate? Or is it subjective?


I believe the score represents the fraction of correct answers, so yes.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: