How to format two column OCR PDF into one column

0

I am having trouble reading a PDF on my Kindle because the OCR text is displayed in two columns per page. (IE The book was scanned two pages at a time, them OCR'd.) How can I format this into one column (or any style for that matter) so that I can read it on my Kindle?

grossmae

Posted 2012-05-12T03:43:59.727

Reputation: 137

Answers

5

The latest version of k2pdfopt can either keep the OCR with native PDF output option (-mode 2col) or restore the OCR using its own OCR. Also, running the k2pdfopt output through Calibre, as some have suggested, is not necessary if you use the right settings in k2pdfopt.

Edit: v2.x of k2pdfopt, released 3-Sep-2013, will preserve previously OCR'd text by default even when not used in native PDF output mode (e.g. when re-flowing text).

willus

Posted 2012-05-12T03:43:59.727

Reputation: 476

1

Also note that it is possible to run k2pdfopt directly on your kindle using kindlepdfviewer or its forks. Further information and links are given here.

– Tim – 2013-09-25T13:20:03.250

1

Found this guide. Basically using k2pdfopt to convert the two columns into just one. And then running that through Calibre for a cleaner version.

This is a partial solution as it removes the OCR benefits. The resulting file is only images and thus very large.

grossmae

Posted 2012-05-12T03:43:59.727

Reputation: 137