Yes, the current implementation of the repository converts any data primarily into strctured markdown text.
The next stage will involve prompt guides or schema-guided structure extraction.
Let's say you are processing a lot of research PDFs and want to convert them into clean markdown that best represents the content. Now, let's say you want to extract the authors, abstracts, captions, and store images.
The extraction engine we are currently working on will help you with that.
The next stage will involve prompt guides or schema-guided structure extraction.
Let's say you are processing a lot of research PDFs and want to convert them into clean markdown that best represents the content. Now, let's say you want to extract the authors, abstracts, captions, and store images.
The extraction engine we are currently working on will help you with that.