Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association · ...

18
A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org www.pdfa.org 2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days Europe 2018 Pdf Redaction Kevin Willems Software developer @ iText Software 1 1

Transcript of Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association · ...

Page 1: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

PDF Days Europe 2018

Pdf Redaction

Kevin Willems Software developer @ iText Software

1 1

Page 2: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

PDF Days Europe 2018

2 2

Page 3: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

What are we going to see

What is redaction

Why should you redact

What should you redact

Methods of redaction

When to redact

Getting redaction right

3

Page 4: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

What is redaction?

Redaction is removing privileged, sensitve or protected information

from a document.

This is typically done before publishing,

distributing or archiving

documents.

4

Page 5: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

What is redaction?

Ms. Cowley was shot by Mr. James on 4:11 pm.

5

Page 6: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

Why should you redact?

• Reduce classification level

Secrecy protection

• Guarantee safety and anonymity • Secure sensative data • GDPR Compliance

Privacy protection

6

Page 7: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

Why should you redact?

7

Page 8: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

Why should you redact?

8

Page 9: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

What should be redacted?

Employee and customer data

• Social Security numbers • Medical information • Names • Drivers license or

professional license numbers

• Individuals’ addresses, dates and months of birth,..

• Other unique identifiers

Company Data

• Financial documents and files

• Proprietary information and trade secrets

9

Page 10: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

What should be redacted?

10

Page 11: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

Methods of redaction

Page location/region

Regular Expressions

Artificial Intelligence

Manual

11

Page 12: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

Page location/region specific redaction

12

Page 13: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

Redaction using regex

Social Security Numbers: xxx-xxx-xxxx Credit Card Numbers: xxxx xxxx xxxx xxxx or 3xxx xxxxxx xxxxx Email addresses: [email protected]/org/net/(other TLDs)

13

Page 14: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

Artificial Intelligence

Face recognition

Semantic analysis

14

Page 15: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

Manual Redaction

Try to avoid

When unavoidable:

follow a documented

process

15

Page 16: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

When to redact

Upon acquisition

Prior to distribution

Once the ‘work’ is

done

Prior to archival

Prior to disposal

16

Page 17: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

Getting it right!

17

Determine your tools

Decide on what to redact

Decide on the method(s) you will use

Decide when the data should be redacted

Implement your redaction process

Page 18: Redaction · 2019. 3. 3. · A PDF Association Presentation · © 2018 by PDF Association ·   2018-05-14 Kevin Willems, Software developer at iText Software BVBA PDF Days ...

A PDF Association Presentation · © 2018 by PDF Association · www.pdfa.org

www.pdfa.org

2018-05-14

Kevin Willems, Software developer at iText Software BVBA

18

PDF Days Europe 2018

18

https://itextpdf.com/

PDF

iText Questions?

Redaction GDPR

compliance

workflow archive

Risk