2
I'm pasting selected portions from a few different PDFs into MS Word, and in some cases entire sentences get pasted together as one word without spaces in between (eg: whatImeanistextlikethis). Manually inserting spaces between every single word gets exhausting. Is there any built-in way or a plugin to automatically do such a thing? MS Word obviously knows what words are supposed to be, from its dictionary, so I'm hoping there's a plugin or macro out there that utilizes this to do an intelligent splitting.
Update: I've now tried Ctrl+V, Paste Text Only, Paste Special (Unformatted Text) and Paste Special (Unformatted Unicode Text), with no difference. Pasting into plain old notepad also pastes the words clumped together.
Update2: Here's some relevant links for anyone else looking into splitting words automatically:
1Yes, Word does recognize words spelled similar to those stored in it's dictionary, and tries to guess what you mean. In the case you described, I seriously doubt it can differentiate the words within a sentence without spaces. It likely sees it as one big misspelled word. Are the words together in the PDF? If not, you might want to try a different way to paste them into Word. – CharlieRB – 2014-04-07T19:27:24.750
No, the words aren't together in the PDF, but come out together when copied and pasted. Could you give me an example of a different way to paste them? – sundar - Reinstate Monica – 2014-04-08T06:59:50.570
You've not shared what method you are using now; context menu, Ctrl+V, menu, text only, merged format, etc. There are many methods to go about it. Tell us what you've tried. And without having the PDF file, I can not try anything to tell you what will work. You might want to post a link to the file if it doesn't contain any private information. – CharlieRB – 2014-04-08T11:34:46.427
I've now tried Ctrl-V, Paste Text Only, Paste Special (Unformatted Text) and Paste Special (Unformatted Unicode Text), no difference. Pasting into plain old notepad also pastes the words clumped together. The PDF cannot be shared though, I'll look into writing a macro myself to do this. (And of course will update with an answer if I do manage it!) – sundar - Reinstate Monica – 2014-04-08T12:15:17.747
I'd suggest using OCR to capture the text from the PDF. It will save you a lot of time and frustration trying to write a macro that will know where to separate the words. – CharlieRB – 2014-04-08T12:20:12.213
Have you tried another PDF viewer to copy from? Fixing the problem of non-spaced text would be quite hard and error prone. – cjb110 – 2014-04-08T14:08:48.810
Have you tried saving the PDF as a .docx file? – Karen927 – 2014-04-08T19:56:48.840
@Karen927 How do I do that? – sundar - Reinstate Monica – 2014-04-08T20:26:48.617
If you have Adobe Pro, you can save PDFs as Word files (just select "save as" and choose the Word extension). If you don't have it, you could find a relatively cheap license for version 9.0. – Karen927 – 2014-04-09T20:13:11.837