I find it interesting that there's all these independent AI-OCR Projects but sti...

Annatar01 · 2025-10-20T10:51:26 1760957486

I dont know, but maybe existing commercial OCR is still on top, and also using ML. Recently tried a free trial for OCR/reading Sütterlin and it was a weird feeling being so outclassed in reading.

rsolva · 2025-10-20T11:50:21 1760961021

Mistral offers their OCR commercially through their API and in their Chat services, at least.

https://mistral.ai/news/mistral-ocr

simlevesque · 2025-10-20T13:48:50 1760968130

https://cloud.google.com/document-ai

prats226 · 2025-10-20T20:09:37 1760990977

https://docstrange.nanonets.com/ as well, wrapper on top of 7B version of https://huggingface.co/nanonets/Nanonets-OCR2-3B

daemonologist · 2025-10-20T14:26:19 1760970379

There are commercial OCR offerings from the big cloud providers (plus, like, Adobe). In my experience they generally outperform anything open-weights, although there's been a lot of improvement in VLMs in the past year or two.

aleinin · 2025-10-20T15:58:55 1760975935

One that I’ve seen recently is https://reducto.ai It appears to be an OCR wrapper.

Eisenstein · 2025-10-20T10:54:55 1760957695

It is because the AI is not actually doing OCR. It is giving an interpretation of what the text in an image is by ingesting vision tokens and mapping them onto text tokens.

So you either have to be fine with a lot of uncertainty as to the accuracy of that interpretation or you have to wait for an LLM that can do it in a completely reproducible way every time.