I'd ask that you consider local alternatives, or at the very least make it opt-in. It would be very annoying to have some awesome open-source document manager send off my documents to the cloud.
There are some surprisingly powerful OCR libraries you could run locally, Tesseract.js not the least of them.
There are some surprisingly powerful OCR libraries you could run locally, Tesseract.js not the least of them.
Cheers, and awesome project!
http://tesseract.projectnaptha.com/