Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The tokenizer in llama.cpp probably needs fixing then or it has some other bug.


Definitely. I tried gemma2:27B model with phrases like "translate the following sentence to language X" and it even failed to understand the task and spat out completely irrelevant things, like math formulas.

OTOH, smaller model did it perfectly.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: