I've been working on some structured OCR tools recently (in the context of reading resume pdfs and allowing much more useful search filters over them than our ATS system allows) and I've found Gemini with structured outputs capable of doing a fantastic job. I'm curious, do you have any rough pointers for how to do this self-hosted?