A simple 5-page text document should not be 30MB. But sometimes it is — and the reason isn't always obvious. PDF files can contain several types of hidden bloat that inflate their size far beyond what's visible on screen. This guide breaks down the five main causes, how to identify which one is affecting your file, and exactly how to fix each one for free.
This is by far the most common cause. When you scan a document or embed a photo into a PDF, the image is stored at whatever resolution it was captured at. Scanners default to 300 DPI — which is appropriate for printing but 2–4× higher resolution than any screen actually displays.
A single A4 page scanned at 300 DPI in colour is roughly 1–3MB. A 20-page scanned document is therefore 20–60MB before any other factors. If your PDF is unexpectedly large and contains scanned pages or photos, this is almost certainly the cause.
How to identify it: Open the PDF and zoom in to 200%. If the images look very sharp at high zoom, they are high-resolution. If they look slightly blurry even at 100%, they may already be compressed.
PDF files can embed images in multiple formats — JPEG (compressed), PNG (lossless compressed), or raw bitmap (uncompressed). Some PDF creation tools — particularly older software, certain CAD programs, and some Microsoft Office export settings — embed images as uncompressed bitmaps.
An uncompressed bitmap is 3 bytes per pixel. A 1000×1000 pixel image is 3MB uncompressed. The same image as a JPEG at medium quality is typically 50–200KB — a 15–60× difference. A PDF with several uncompressed images can be enormous even if the images aren't high-resolution.
How to identify it: If your PDF was created from a Word document, PowerPoint, or design software and is much larger than you'd expect, this is a likely cause. A 5-page Word document exported to PDF should be under 1MB; if it's 15MB, uncompressed images are almost certainly present.
PDFs embed font files to ensure the document looks the same on any device, even if the recipient doesn't have the fonts installed. This is a feature, not a bug — but it adds size.
A single font file can be 200KB–2MB. A document that uses many different fonts — or fonts with large character sets (Chinese, Arabic, Devanagari) — can accumulate several megabytes in embedded font data alone.
Well-behaved PDF creators embed only the characters actually used ("font subsetting"), which keeps sizes small. Poorly-behaved tools embed the entire font file. Design software like InDesign and Illustrator sometimes defaults to embedding complete font files, particularly for licensed fonts.
How to identify it: If your PDF is text-only with minimal images and still unusually large, fonts may be the cause. Check with a PDF inspector tool — Adobe Acrobat's Document Properties shows embedded fonts and their sizes.
Every time a PDF is edited and saved, many tools store the full revision history — not just the current version of the document. This means every draft, every deleted paragraph, every repositioned image may still be inside the file, invisible but consuming space.
Additionally, PDFs can carry extensive metadata: author name, creation software, edit timestamps, comments, annotations, form field data, and tool-specific data from the application that created them. Design software like Illustrator and Affinity Publisher is particularly prone to embedding large amounts of metadata and preview data.
How to identify it: Compare the file size to what you'd expect based on visible content. A 2-page document with mostly text should be under 200KB. If it's 5MB, there's hidden data. This is common with PDFs that have been through multiple rounds of editing.
PDFs created from vector design tools — Adobe Illustrator, InDesign, Affinity Designer, CorelDRAW — often contain multiple layers, hidden objects that aren't visible in the final document, and complete versions of earlier states.
A design file exported to PDF may retain the original artwork layers, alternative text versions, hidden bleed marks, or working elements that were turned off but not deleted. The PDF viewer only shows what's enabled, but all the hidden content is in the file.
How to identify it: If the PDF was created by a designer or exported from design software and the visible content seems simple, this is likely. Re-exporting from the source with "Flatten Layers" and "Discard Hidden Data" options enabled before compressing often yields dramatic size reductions.
For most large PDFs — regardless of the underlying cause — compression addresses the symptoms effectively:
Try Maximum compression first — it addresses causes 1–4 effectively. If the result is still unexpectedly large, the file likely has cause 5 (design layers) and needs to be re-exported from the source.
No registration. No file size limit. Your file never leaves your browser.
📚 Compress My PDF Now