To be 100% honest it's been a while since I looked into libraries for it, so I couldn't say.
Your second comment rings true, and in my opinion, we are there. Highly recommend throwing some PDFs at AWS Textract and checking out the quality, it wasn't there a few years ago, can safely state it's there now though. I threw stuff at it that previously would just spit out trash, and it handled it fairly well, specifically for table data extraction (I was looking at public stock market quarterly reports).
Cost is the kicker for me, 1000 pages for $15, adds up fairly quickly at any sort of scale!
Your second comment rings true, and in my opinion, we are there. Highly recommend throwing some PDFs at AWS Textract and checking out the quality, it wasn't there a few years ago, can safely state it's there now though. I threw stuff at it that previously would just spit out trash, and it handled it fairly well, specifically for table data extraction (I was looking at public stock market quarterly reports).
Cost is the kicker for me, 1000 pages for $15, adds up fairly quickly at any sort of scale!