Efficient k-anonymous microaggregation of multivariate numerical data via principal component analysis

Por: Monedero, DR, Mezher, AM, Colome, XC, Forne, J, Soriano, M

Publicada: 1 ene 2019

Web: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85068614484&doi=10.1016%2fj.ins.2019.07.042&partnerID=40&md5=6e5191eb8ab29c7516fbf387f649f99d

Resumen:
k-Anonymous microaggregation is a widespread technique to address the problem of protecting the privacy of the respondents involved beyond the mere suppression of their identifiers, in applications where preserving the utility of the information disclosed is critical. Unfortunately, microaggregation methods with high data utility may impose stringent computational demands when dealing with datasets containing a large number of records and attributes. This work proposes and analyzes various anonymization methods which draw upon the algebraic-statistical technique of principal component analysis (PCA), in order to effective reduce the number of attributes processed, that is, the dimension of the multivariate microaggregation problem at hand. By preserving to a high degree the energy of the numerical dataset and carefully choosing the number of dominant components to process, we manage to achieve remarkable reductions in running time and memory usage with negligible impact in information utility. Our methods are readily applicable to high-utility SDC of large-scale datasets with numerical demographic attributes. © 2019 The Authors. Preprint submitted to Elsevier, Inc. © 2019 Elsevier Inc.

Filiaciones:
Monedero, DR:
Univ Politecn Cataluna, Dept Telemat Engn, E-08034 Barcelona, Spain

Mezher, AM:
Univ Politecn Cataluna, Dept Telemat Engn, E-08034 Barcelona, Spain

Colome, XC:
Univ Politecn Cataluna, Dept Telemat Engn, E-08034 Barcelona, Spain

Forne, J:
Univ Politecn Cataluna, Dept Telemat Engn, E-08034 Barcelona, Spain

Soriano, M:
Univ Politecn Cataluna, Dept Telemat Engn, E-08034 Barcelona, Spain

CTTC, E-08860 Barcelona, Spain

ISSN: 00200255

INFORMATION SCIENCES

Editorial
Elsevier Inc., STE 800, 230 PARK AVE, NEW YORK, NY 10169 USA, Estados Unidos America

Tipo de documento: Article
Volumen: 503 Número:
Páginas: 417-443

DOI: 10.1016/j.ins.2019.07.042

WOS Id: 000483425200024

Efficient k-anonymous microaggregation of multivariate numerical data via principal component analysis

MÉTRICAS