This has happened to me many times in Microsoft Word when trying to export a simple manuscript to PDF. A 5–8 page Word document, ~50 KB in size, will end up as a 10+ MB PDF file, which is far too large to reasonably email to someone.
Rene's answer is on the right track—the problem is that fonts get embedded into the document—but just using one of the standard typefaces won't necessarily solve the problem.
All of my documents were in Times New Roman, using nothing fancier than bold and italics. Or so I thought. It turns out that I have automatic kerning enabled in my default template (for obvious reasons). When exporting to PDF, Word was actually embedding each of those ligatures as a separate font object into the document, bloating it beyond all belief.
The fix is simple, you just have to remember to do it each time:
- Select all of the text in the document.
- Format → Font → Advanced
- Uncheck "Kerning for fonts"
![](../../I/static/images/0e0d2a9c273ca528a4f84aa42fddfb5e06fe5c414b8867aa9d7afc8e5f492c55.png)
Interestingly, you can leave ligatures, contextual alternatives, and other advanced typography features enabled; they have no perceptible effect on the size of the resulting PDF.
Re-export the document as a PDF, and it's down to a hundred or so KB. Unfortunately, the kerning is sub-par, so I wouldn't recommend printing this way, but it works fine for emailing a document.
28My guess is that the PDF embeds the font, which is necessary if a document is to be truly portable. – AFH – 2015-09-30T08:29:14.710
you can open properties to see if the font is embedded or not – phuclv – 2015-09-30T08:31:41.780
Can you add a link to the
pdf
and maybe thedocx
too? – Hastur – 2015-09-30T08:37:04.3232Yes, the font subset is embedded. That might be it. I've tried to repeat the same sentence a few hundred times and the PDF file size only grew by 4 kB which is just about right. (DOCX stayed on 12kB which is no surprise as that is a zipped format and repeated text will take hardly any new bytes.) – Borek Bernard – 2015-09-30T08:37:05.217
The setting "Minimum size (publishing online)" probably only affects the quality of embedded images, not of fonts. – Arjan – 2015-09-30T09:07:23.080
@AFH Spot on! I wonder if it will also embed common fonts such as Arial – MonkeyZeus – 2015-09-30T12:38:48.197
1
@AFH It does not embed Arial. http://i.stack.imgur.com/aUZgt.png
– MonkeyZeus – 2015-09-30T12:43:03.6371Thinking about it from a Kolmogorov complexity standpoint, Microsoft Word is larger than your average PDF viewer, by much more than a few hundred kB. – hobbs – 2015-09-30T15:06:39.667
8I think the real question is why your wordprocessing format is so much bigger than the equivalent LaTeX ... :-p – Toby Speight – 2015-09-30T16:40:34.603
1Also, remember that DOCX is really just a zip file so you have built-in compression at the document level. PDF has some internal compression techniques (streams) but there's lots a preamble (tokens/names) surrounding those that don't get any compression applied. – Chris Haas – 2015-09-30T20:51:04.663