How to make a searchable PDF document from a scan AND a source Word document?

2

3

Well, I have a scanned PDF with some slightly changes made by hand and a source file. I wish to make a PDF, which would be searchable (based on the text from the source, the changes would remain as they are).

I am searching a free (and even better - portable) software which would allow me to somehow "combine" the images from a scan and the text from the source DOC file. So it SEEMS like the image is selectable and searchable.

UPD: use case: I have the source DOC file. Then, I printed it. Then, I made some notes by hand on the sheet with the printed document. Then - I scanned it. What I want - is making a PDF with the scanned images, but at the same time the text on this image should be selectable and searchable. Like the "OCR" feature of the Acrobat, but without doing actual OCR - cause I have the oiginal source text - and with an freeware and portable software.

Evengard

Posted 2012-03-28T10:56:56.767

Reputation: 1 500

Take a look to pdfsandwich command, here the website of the project.

– Pablo A – 2018-04-05T05:59:05.140

Why not just make the doc into a pdf (perhaps with the scan as a background)? – soandos – 2012-03-28T11:41:21.240

The text will be doubled by this way. Well, I am already thinking about making somehow the letters transparent and inserting the scans as a background... But isn't there a better solution?

And I have no ideas about how to make a text transparent in Office Word – Evengard – 2012-03-28T11:47:12.320

I'm havinging some truoble understanding your question. Can you please clarify your question? For example, please reword "some slightly changes made by hand and a source file"... I'm not explaining properly... Please just make your question easier to understand. – wizlog – 2012-03-29T16:29:58.197

I mean - I have the source DOC file. Then, I printed it. Then, I made some notes by hand on the sheet with the printed document. Then - I scanned it. What I want - is making a PDF with the scanned images, but at the same time the text on this image should be selectable and searchable. Like the "OCR" feature of the Acrobat, but without doing actual OCR - cause I have the oiginal source text - and with an freeware and portable software. – Evengard – 2012-03-31T18:57:34.233

Answers

0

Ehow tech posted three methods of converting Word documents to PDF (aka Portable Document Format) two of which I am sure work fine, not sure about Zamar.

  1. Go to the Zamzar website. Zamzar provides free conversion to and from different formats. This option works well if you don't need to convert Word documents to PDF frequently.

  2. Purchase and install Adobe Acrobat. At the time of publication, Adobe Acrobat Standard was selling for approximately $300 (now only $139). A new "Save as PDF" option is added to [Microsoft Word] after installing Acrobat. Most libraries, schools, Sony PCs, Work lapotps (the ones provided by your company) already have Adobe Acrobat installed.

  3. Microsoft Office Add-in: Microsoft Save as PDF or XPS This add-in allows you to export and save to the PDF and XPS formats in eight 2007 Microsoft Office programs.

wizlog

Posted 2012-03-28T10:56:56.767

Reputation: 12 320

start="4">

  • OpenOffice.org reads Word docs and writes PDF
  • < – RedGrittyBrick – 2012-03-28T12:32:36.730

    1This is not exactly what I wanted. Sure, I can transform a plain DOC in PDF, this is not a problem at all. But I need to add a "selection layer" or something like this to an existing PDF document (or just the images of this document), so the scanned document gets the ability of being selected and searched.

    It seems that Acrobat is making such a layer when we use it's "OCR Recognition" feature - but I don't need OCR - I already have the original text. And I can't use Acrobat because of some restrictions. I need a freeware and portable software. – Evengard – 2012-03-29T05:30:46.263