Hacker News new | past | comments | ask | show | jobs | submit login

There was a guy who followed a tutorial about how to fine tune mistral with DPO, who has zero computer science skills and his model ended up at the top of the hugging face leader board among the opensource models with 7 billion parameters. Some random guy managed to outdo the creators of the LLM.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: