Data Pseudonymization & Anonymization Under GDPR

Data pseudonymization, data anonymization, what’s the difference? In this article we compare the most important benefits that come from applying each data processing technique.

Anonymization and pseudonymization are two terms that have been broadly discussed since the introduction of the General Data Protection Regulation. Nevertheless, the topic still gives rise to a host of doubts and questions among people whose businesses are affected by GDPR.

In case you haven’t had the chance to explore the main differences between those methods, here are some great definitions of the terms:

What is data pseudonymization?

According to Article 4(5) GDPR:

pseudonymisation means the processing of personal data in such a manner that the personal data can no longer be attributed to a specific data subject without the use of additional information, provided that such additional information is kept separately and is subject to technical and organisational measures to ensure that the personal data are not attributed to an identified or identifiable natural person.

Also, as we can read in Recital 26:

Personal data which have undergone pseudonymisation, which could be attributed to a natural person by the use of additional information should be considered to be information on an identifiable natural person …

What is data anonymization?

In Opinion 05/2014 on Anonymisation Techniques by The Article 29 Working Party, we can read the following about anonymous data:

[…] the data must be stripped of sufficient elements such that the data subject can no longer be identified. More precisely, that data must be processed in such a way that it can no longer be used to identify a natural person by using ‘all the means likely reasonably to be used’ by either the controller or a third party. An important factor is that the processing must be irreversible.

Age: 30
↓
[Aggregation/K-Anonymity]
↓
Age: 20-35
An example of data anonymization using Aggregation/K-Anonymity

In layman’s terms, the main difference is that while pseudonymous data still allows for some form of re-identification, anonymous data can’t be re-identified. That’s why the former is still considered personal data, while the latter isn’t.

However, it’s important to know that the topics of data anonymization and pseudonymization are a lot more complex.

We’ve dug deep into the characteristics of both methods in a series of blog posts. If you’re still not familiar with them, go ahead and check them out:

Because now it’s time to show the operational impacts of both methods. As you probably know, applying anonymization or data pseudonymization techniques to your data can bring some considerable benefits and liberate you from certain obligations set out in GDPR.

You’re probably wondering just how it does this. We’ve decided to explain it by presenting a simple comparison where we put data pseudonymization and anonymization up against identified personal data.

PII vs personal data

Learn how to recognize PII and personal data to stay away from privacy issues.

Download your copy

If you’re interested in exactly how anonymization and pseudonymisation can liberate you from certain obligations imposed by GDPR, there’s a very helpful table below that sheds some more light on the matter.

The table covers the 15 most important GDPR obligations and their impact on identified, pseudonymous and anonymous data:

GDPR obligation	Identified Data	Pseudonymous Data	Anonymous Data
Notifying data subject about collecting data	Required	Required	Not required*
Obtaining consent	Required	Required	Not required*
Ability to exercise right to erasure	Required	Not required**	Not required
Ability to exercise right to access	Required	Not required**	Not required
Ability to exercise right to data portability	Required	Required***	Not required
Ability to exercise right to data rectification	Required	Not required**	Not required
Ability to exercise right to object	Required	Not required**	Not required
Processing pre-GDPR data	Not allowed	Not allowed	Allowed
Presenting basis for cross-border transfer	Required	Required	Not required
Protection by design	Not met	Partially met	Partially met
Data breach notificatification	Required	Depends on the method****	Not required
Data retention limitation	Required	Required	Not required
Documentation obligation	Required	Required	Not required
Singing a data processing agreement with a vendor	Required	Required	Not required
Appling the data minimization principle	Advisable	Advisable	Advisable

* It’s important to remember that if you want to anonymize new data collected from your website, then you’ll either need to obtain consent to collect personal data (like cookies, IP addresses and device ID) and then apply anonymization techniques, or only collect anonymous data from the start.

** However, to do that, you should demonstrate that due to applying certain pseudonymization techniques you’re not able to identify the data subject.

*** The latest opinion of the Information Commissioner’s Office indicates that pseudonymized data should be included in the scope of portability rules.

**** Only if you’re able to demonstrate to the authorities that identifying a person from the breached data set is nearly impossible.

If you’d like to learn more about the topics covered in this comparison, we invite you to read dedicated articles on our blog:

Data pseudonymization vs. anonymization – some conclusions

As you can clearly see from the comparison above, anonymization is definitely one of the best ways to ensure the safety of data you collect. This extra measure of security lets you freely exploit your data collection in ways that wouldn’t be legally permissible when it comes to non-anonymized data.

Data pseudonymization is also a good way to reduce restrictions involved in handling personal data in the age of GDPR. And it’s a bit easier to perform.

However, there are also some considerable benefits of using personal data in its pure (original) form. That’s why you really need to think through the pros and cons of each option before making a final decision.

We’re here to answer all your burning questions about this subject. So if you’d like to learn more about each method or about how Piwik PRO can help you apply them to your data, don’t hesitate to contact us. Our team will be happy to help!

The most important benefits of data pseudonymization and anonymization under GDPR

What is data pseudonymization?

What is data anonymization?

Data pseudonymization vs. anonymization – some conclusions

Karolina Lubowicka

We’re Launching the Piwik PRO Partner Program!

Predictive Customer Acquisition Platform: 4 Ways it Can Help You Win New Clients

Categories

Similar posts