This is a wicked cool site, but you need to put in screenshots of the input (how...

dpapathanasiou · on Jan 28, 2011

I'm working on an FAQ/Help page which will show some of those features in more detail.

The algorithm I use is a variation of the code described here: http://denis.papathanasiou.org/?p=343 except the output is html, not text, so that I can take account things like font sizes and paragraph breaks.

If you signup and try it (it's free for the first 3 days), you'll see that the parser renders each pdf page as text, and it's up to you to decide which range of pages you want to use in your book.

Feel free to contact me by the form on that site, and I can reply in more detail.