Hi,
I have some PDFs which have two columns of text on the page. I am extracting the text using the Raw option, which is great because it extracts the text in reading order without trying to retain any columnar layout. However, it would be even better to convert the PDF to Markdown, preserving meaningful formatting information such as headers and bullets, but unfortunately the Markdown conversion mixes up lines from the adjacent columns. Is there any way to do a Markdown conversion in reading order, similar to the text Raw mode?
Thanks,
David R.