Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think "if the training pool is large enough" is a real issue here. We're not talking about living languages with large, properly attested and annotated corpuses.

Indeed, one of the thing you'd probably like the translators to do is identify rare or unique words that can be added to our existing knowledge of these languages.



> I think "if the training pool is large enough" is a real issue here.

It would be really neat to set up something like a wiki populated with the existing translations and machine translations done via LLM, and to periodically re-train the LLM on all the newly manually verified translations and automatically re-run the machine translations after. The whole thing could move incrementally toward high quality output.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: