how-to redaction security

How to Verify Your PDF Redaction Actually Worked

TaxRedact Team
| | 6 min read

You’ve redacted your PDF. The black boxes look solid. The sensitive data appears to be hidden. But is it really gone?

Before you share that document, run these four simple tests to verify your redaction actually worked.

Test 1: The Copy-Paste Test

This is the quickest way to check for fake redactions.

How to Do It

  1. Open your redacted PDF
  2. Press Ctrl+A (Windows) or Cmd+A (Mac) to select all text
  3. Press Ctrl+C / Cmd+C to copy
  4. Open a text editor (Notepad, TextEdit, etc.)
  5. Paste the content

What to Look For

  • If redaction worked: The sensitive text won’t appear in the pasted content
  • If redaction failed: The “hidden” text will appear in your paste, fully readable

Why This Works

Fake redactions (drawn shapes, highlights) only add visual elements on top of text. The underlying text remains selectable. True redaction removes the text from the content stream, so there’s nothing to select or copy.

Test 2: The Search Test

Use your PDF viewer’s search function to hunt for sensitive data.

How to Do It

  1. Open the redacted PDF
  2. Open the search dialog (Ctrl+F / Cmd+F)
  3. Search for specific words or numbers that should have been redacted
    • Social Security Number digits
    • Names that were supposed to be hidden
    • Account numbers
    • Any unique text that should be gone

What to Look For

  • If redaction worked: Search returns no results
  • If redaction failed: Search finds and highlights the “redacted” text, often visibly underneath the black box

Pro Tip

Search for partial strings. If redacting “123-45-6789”, search for “6789” or “123-45” separately. The full string might not match due to formatting, but partial matches reveal the problem.

Test 3: The File Size Test

True redaction typically reduces file size because data is removed.

How to Do It

  1. Note the file size of your original PDF
  2. Note the file size of your redacted PDF
  3. Compare

What to Look For

  • If redaction worked: File size should decrease (or stay roughly the same if very little was redacted)
  • If redaction failed: File size increases because you’ve added elements (black boxes) without removing anything

Caveats

This test isn’t definitive. File size can change for other reasons:

  • Recompression of images
  • Font subsetting changes
  • Metadata modifications

But if your file got significantly larger after “redaction,” that’s a red flag.

Test 4: The pdftotext Test (Advanced)

For the most thorough check, extract all text using command-line tools.

How to Do It (Mac/Linux)

pdftotext your-document.pdf -

This outputs all text content to your terminal. You can also save to a file:

pdftotext your-document.pdf output.txt

How to Do It (Windows)

Download pdftotext as part of Xpdf tools, then run:

pdftotext.exe your-document.pdf output.txt

What to Look For

Open the output and search for any text that should have been redacted. This extracts text directly from the PDF’s content stream, bypassing all visual layers.

Alternative: Online PDF to Text

If you can’t install tools, several online services convert PDF to text. Be cautious about uploading sensitive documents to third-party services.

Bonus: Visual Inspection

Sometimes fake redactions are visible on close inspection.

What to Look For

  1. Selection highlighting: Can you select text behind the black box? Your cursor shouldn’t be able to select anything if the text is truly gone.

  2. Box edges: If you zoom in, fake redactions sometimes have gaps around the edges where text peeks through.

  3. Inconsistent appearance: True redaction typically looks uniform. Fake redactions might have varying opacity or color based on underlying text.

What To Do If Tests Fail

If your redactions fail any of these tests, the data is still in your document. Here’s what to do:

Don’t Share the Document

Obviously. The sensitive data is exposed.

Use Proper Redaction Tools

Switch to software that performs true redaction:

  • Adobe Acrobat Pro — Use the dedicated Redaction tool (Edit > Redact Text & Images), not drawing tools
  • TaxRedact — AI-powered detection and true redaction
  • Specialized legal/government tools — Many industries have approved redaction software

Start Fresh

Don’t try to “fix” a failed redaction. Start with your original document and use proper redaction tools from the beginning.

Verify Again

After re-redacting with proper tools, run all four tests again. Never assume redaction worked—verify it.

Creating a Verification Checklist

For high-stakes documents, create a formal verification process:

PDF Redaction Verification Checklist

Document: ________________
Date: ________________
Verified by: ________________

[ ] Copy-paste test completed
    [ ] No sensitive text found in paste output

[ ] Search test completed
    [ ] Searched for: ________________
    [ ] No matches found

[ ] File size comparison
    [ ] Original size: ________ KB
    [ ] Redacted size: ________ KB
    [ ] Size decreased or stayed same

[ ] Text extraction test completed
    [ ] Method used: ________________
    [ ] No sensitive text found

[ ] Visual inspection completed
    [ ] No text visible at edges
    [ ] Cannot select text behind boxes

Verification Status: [ ] PASSED  [ ] FAILED

Prevention is Better Than Detection

The best approach is to use proper redaction tools from the start:

  1. Use dedicated redaction features, not drawing tools
  2. Look for “permanent redaction” or “remove text” in tool descriptions
  3. Avoid tools that only offer “annotation” or “markup” options
  4. When in doubt, test on a sample document first

Summary

TestWhat It ChecksTime Required
Copy-PasteText still in content stream30 seconds
SearchSpecific strings exist1 minute
File SizeData removed vs. added30 seconds
pdftotextComplete text extraction2 minutes
VisualEdge visibility, selection1 minute

Run all five tests. They only take a few minutes and could prevent a catastrophic data exposure.


Want automatic verification? TaxRedact performs true PDF redaction that passes all these tests. AI-powered detection finds sensitive data, then permanently removes it from your documents.