Back to Top

Anonymization practices

No international standard defines the methods for anonymizing data, acceptable levels of risk, or recommended measures of information loss. How much and the type of protection required is specific to each dataset, depending on the sensitivity and “commercial value" of the content, and to each specific legal and cultural environment. It is therefore useful to document some practices. This is, however, not an easy task, as agencies that anonymize their datasets do not communicate much on the methods implemented and the levels of risk in the data they disseminate.

This limited access to knowledge combined with a lack of experience in using the tools and methods makes it difficult for many agencies to implement “optimal” solutions. By optimal we mean; meet their obligations towards privacy protection but also their obligation to release data useful for policy monitoring and evaluation. In order to bridge this gap in practical guidelines The World Bank is currently working on a project funded by the Knowledge for Change Program II. The program seeks to build a knowledge base through experimentation on a diverse set of microdata. This knowledge will then be translated into a practice guide for public release. The practice guide will fill this critical gap by documenting research conducted at the World Bank through a large-scale evaluation of anonymization techniques, and (ii) translating these results into practical guidelines. This practice guide is expected to be released at the end of December 2014