I'd imagine many nested named capturing groups may trip even the best automated ...

janfoeh · 2025-03-07T15:54:54 1741362894

Thanks for sharing! I have to admit I do not have the necessary brain cycles to spare today, but OCR processing is indeed of interest to me, and I will take a more in-depth look in the upcoming days.

The idea of an exclusionary approach sounds interesting as well. I'll have to think about that a bit.

dleeftink · 2025-03-08T05:12:26 1741410746

Check out WordNinja in case regex doesn't cut it! [0]

[0]: https://github.com/keredson/wordninja

janfoeh · 2025-03-08T11:23:39 1741433019

Will do, thanks again!