Anonymized data

Anonymized data is a type of data that has been processed to remove any personally identifiable information (PII) or personal data. Such data is often used in research, analytics, and other data-driven activities, as well as for compliance with privacy regulations.

According to GDPR, anonymized data has been altered in such a way that it can’t be used to identify a specific person. Since anonymized data can’t be restored, it isn’t considered personal data under GDPR. This means it is exempt from GDPR.

Some examples of compliant data anonymization methods include:

Randomization:
- Noise addition – Where personal identifiers are expressed imprecisely, for instance: height: 180 cm → height: 320 cm
- Substitution – Where personal identifiers are shuffled within a table or replaced with random values, for instance: ZIP: 10120 → ZIP: postcode
Generalization:
- Aggregation – Where personal identifiers are generalized into a range or group, for instance: age: 30 → age: 20-35

Removing any identifiable information from a dataset allows for meaningful analysis without compromising the privacy of individuals.

Examples of use cases for anonymized data include:

Measuring the effectiveness of marketing campaigns.
Analyzing the behavior of website or mobile app users.
Analyzing trends and patterns.

Anonymized data

The comparison of 9 HIPAA-compliant web analytics platforms

EU hosting vs. EU sovereignty: Why the difference matters for privacy-first analytics

Other definitions

Recent posts from Piwik PRO blog