Hacker News new | past | comments | ask | show | jobs | submit login

> using a language model on top of character level OCR

But if you know you're going to use a language model after the OCR, then you don't OCR to a single character, but rather to a distribution of character similarity (e.g. the 90% least similar or clipping at a certain similarity threshold). Then the language model should have more to work with (although TBH its work becomes more complicated).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: