Publishing anonymous survey rating data
Article
Article Title | Publishing anonymous survey rating data |
---|---|
ERA Journal ID | 17829 |
Article Category | Article |
Authors | Sun, Xiaoxun (Author), Wang, Hua (Author), Li, Jiuyong (Author) and Pei, Jian (Author) |
Journal Title | Data Mining and Knowledge Discovery |
Journal Citation | 23 (3), pp. 379-406 |
Number of Pages | 28 |
Year | 2011 |
Place of Publication | United States |
ISSN | 1384-5810 |
1573-756X | |
Digital Object Identifier (DOI) | https://doi.org/10.1007/s10618-010-0208-4 |
Web Address (URL) | https://link.springer.com/article/10.1007/s10618-010-0208-4 |
Abstract | We study the challenges of protecting privacy of individuals in the large public survey rating data in this paper. Recent study shows that personal information in supposedly anonymous movie rating records are de-identified. The survey rating data usually contains both ratings of sensitive and non-sensitive issues. The ratings of sensitive issues involve personal privacy. Even though the survey participants do not reveal any of their ratings, their survey records are potentially identifiable by using information from other public sources. None of the existing anonymisation principles (e.g., k-anonymity, l-diversity, etc.) can effectively prevent such breaches in large survey rating data sets. We tackle the problem by defining a principle called (k, ε)-anonymity model to protect privacy. Intuitively, the principle requires that, for each transaction t in the given survey rating data T , at least (k - 1) other transactions in T must have ratings similar to t, where the similarity is controlled by ε. The (k, ε)-anonymity model is formulated by its graphical representation and a specific graph-anonymisation problem is studied by adopting graph modification with graph theory. Various cases are analyzed and methods are developed to make the updated graph meet (k, ε) requirements. The methods are applied to two real-life data sets to demonstrate their efficiency and practical utility. |
Keywords | graphical representation; survey rating data; anonymity |
ANZSRC Field of Research 2020 | 460401. Cryptography |
460499. Cybersecurity and privacy not elsewhere classified | |
460905. Information systems development methodologies and practice | |
Public Notes | Files associated with this item cannot be displayed due to copyright restrictions. |
Byline Affiliations | Office of Research |
Department of Mathematics and Computing | |
University of South Australia | |
Simon Fraser University, Canada | |
Institution of Origin | University of Southern Queensland |
https://research.usq.edu.au/item/q0w3v/publishing-anonymous-survey-rating-data
1949
total views9
total downloads0
views this month0
downloads this month