New multi-dimensional sorting based k-anonymity microaggregation for statistical disclosure control
Paper
Paper/Presentation Title | New multi-dimensional sorting based k-anonymity microaggregation for statistical disclosure control |
---|---|
Presentation Type | Paper |
Authors | Mahmood, Abdun Naser (Author), Kabir, Md Enamul (Author) and Mustofa, Abdul K. (Author) |
Journal or Proceedings Title | Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering (Book series) |
Journal Citation | 106, pp. 256-272 |
Number of Pages | 17 |
Year | 2013 |
ISBN | 9783642368820 |
9783642368837 | |
Digital Object Identifier (DOI) | https://doi.org/10.1007/978-3-642-36883-7_16 |
Web Address (URL) of Paper | https://link.springer.com/chapter/10.1007/978-3-642-36883-7_16 |
Conference/Event | 8th International ICST Conference, SecureComm 2012: Security and Privacy in Communication Networks |
Event Details | 8th International ICST Conference, SecureComm 2012: Security and Privacy in Communication Networks Event Date 03 to end of 05 Sep 2012 Event Location Padua, Italy |
Abstract | In recent years, there has been an alarming increase of online identity theft and attacks using personally identifiable information. The goal of privacy preservation is to de-associate individuals from sensitive or microdata information. Microaggregation techniques seeks to protect microdata in such a way that can be published and mined without providing any private information that can be linked to specific individuals. Microaggregation works by partitioning the microdata into groups of at least k records and then replacing the records in each group with the centroid of the group. An optimal microaggregation method must minimize the information loss resulting from this replacement process. The challenge is how to minimize the information loss during the microaggregation process. This paper presents a new microaggregation technique for Statistical Disclosure Control (SDC). It consists of two stages. In the first stage, the algorithm sorts all the records in the data set in a particular way to ensure that during microaggregation very dissimilar observations are never entered into the same cluster. In the second stage an optimal microaggregation method is used to create k-anonymous clusters while minimizing the information loss. It works by taking the sorted data and simultaneously creating two distant clusters using the two extreme sorted values as seeds for the clusters. The performance of the proposed technique is compared against the most recent microaggregation methods. Experimental results using benchmark datasets show that the proposed algorithm has the lowest information loss compared with a basket of techniques in the literature. |
Keywords | privacy; microaggregation; microdata protection; k-anonymity |
ANZSRC Field of Research 2020 | 469999. Other information and computing sciences not elsewhere classified |
Public Notes | File reproduced in accordance with the copyright policy of the publisher/author. |
Institution of Origin | University of Southern Queensland |
Byline Affiliations | University of New South Wales |
University of Canberra |
https://research.usq.edu.au/item/q2xy6/new-multi-dimensional-sorting-based-k-anonymity-microaggregation-for-statistical-disclosure-control
Download files
1673
total views169
total downloads6
views this month1
downloads this month