Privacy, Data and the Individual. Diferentially Data sets : methods and mitigations series
| dc.contributor.author | Soria Comas, Jordi | |
| dc.contributor.ror | https://ror.org/02jjdwm75 | |
| dc.date.accessioned | 2024-07-02T16:08:59Z | |
| dc.date.available | 2024-07-02T16:08:59Z | |
| dc.date.issued | 2020-03-27 | |
| dc.description.abstract | Data set releases are the most convenient way to make data available for secondary use: in principle, they allow analysts to carry out any data analysis task (e.g., exploratory data analysis). However, data set releases are a great threat to privacy. This is the issue that privacy preserving data publishing (PPDP) aims to address. Among the available sanitization methods, differential privacy (DP) stands out for the strong privacy guarantees it offers. The fact that DP offers protection regardless of the side information available to intruders is very convenient in the current landscape (pervasive data collection and many untrusted data controllers). However, such strong guarantees have a downside: the information loss we incur when using DP is likely to be large. As a result, there is no standard methodology to generate DP data sets and the use of DP for PPDP is rather limited. In this work, we review the main approaches used in the generation of DP data sets (i.e., histograms, and record aggregation and masking), and describe the advantages and the limitations of each of these approaches in terms of computational cost and information loss. Next, we describe some of the strategies that have been proposed to mitigate the previously described limitations. Among these, we highlight two common strategies: to increase the privacy budget, and to use a relaxed version of DP. Using large privacy budgets is common; however, it has an important downside: DP itself becomes meaningless. Using relaxed versions of DP allows us reduce the information loss while keeping reduced but meaningful privacy guarantees. | |
| dc.description.keyword | Data set | |
| dc.description.keyword | Conjuntos de datos | |
| dc.description.keyword | Analysis | |
| dc.description.keyword | Análisis | |
| dc.description.keyword | Personal Data | |
| dc.description.keyword | Datos personales | |
| dc.description.keyword | Privacy | |
| dc.description.keyword | Privacidad | |
| dc.description.keyword | Marketing | |
| dc.description.keyword | Technology | |
| dc.description.keyword | Tecnología | |
| dc.description.keyword | General Data Protection Regulation | |
| dc.description.keyword | GDPR | |
| dc.description.keyword | Regulación General de Protección de Datos | |
| dc.description.keyword | RGPD | |
| dc.format | application/pdf | |
| dc.identifier.citation | Soria-Comas, J. (2020). Privacy, Data and the Individual. Diferentially Data sets : methods and mitigations series. Zenodo. https://doi.org/10.5281/zenodo.3731233 | |
| dc.identifier.doi | https://doi.org/10.5281/zenodo.3731233 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14417/2765 | |
| dc.language.iso | en | |
| dc.license | https://creativecommons.org/licenses/by/4.0/legalcode | |
| dc.publisher | IE University | |
| dc.relation.center | IE Center for The Governance of Change | |
| dc.relation.entity | IE University | |
| dc.rights | info:eu-repo/semantics/openAccess | |
| dc.rights.accessRights | info:eu-repo/semantics/openAccess | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/legalcode | |
| dc.subject.keyword | Data set | |
| dc.subject.keyword | Conjuntos de datos | |
| dc.subject.keyword | Analysis | |
| dc.subject.keyword | Análisis | |
| dc.subject.keyword | Personal Data | |
| dc.subject.keyword | Datos personales | |
| dc.subject.keyword | Privacy | |
| dc.subject.keyword | Privacidad | |
| dc.subject.keyword | Marketing | |
| dc.subject.keyword | Technology | |
| dc.subject.keyword | Tecnología | |
| dc.subject.keyword | General Data Protection Regulation | |
| dc.subject.keyword | GDPR | |
| dc.subject.keyword | Regulación General de Protección de Datos | |
| dc.subject.keyword | RGPD | |
| dc.title | Privacy, Data and the Individual. Diferentially Data sets : methods and mitigations series | |
| dc.type | info:eu-repo/semantics/report | |
| dc.version.type | info:eu-repo/semantics/publishedVersion | |
| dspace.entity.type | Publication |
Bloque original
1 - 1 de 1
