Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I've been working on some structured OCR tools recently (in the context of reading resume pdfs and allowing much more useful search filters over them than our ATS system allows) and I've found Gemini with structured outputs capable of doing a fantastic job. I'm curious, do you have any rough pointers for how to do this self-hosted?


Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: