Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes, the current implementation of the repository converts any data primarily into strctured markdown text.

The next stage will involve prompt guides or schema-guided structure extraction.

Let's say you are processing a lot of research PDFs and want to convert them into clean markdown that best represents the content. Now, let's say you want to extract the authors, abstracts, captions, and store images.

The extraction engine we are currently working on will help you with that.



“structured Markdown” sounds like an oxymoron.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: