TY - GEN
T1 - Data privacy protection by kernel subspace projection and generalized eigenvalue decomposition
AU - Diamantaras, Konstantinos
AU - Kung, Sun Yuan
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/11/8
Y1 - 2016/11/8
N2 - Various internet services, including cloud providers and social networks collect large amounts of information that needs to be processed for statistical or other reasons without breaching user privacy. We present a novel approach where privacy protection can be viewed as a data transformation problem. The problem is formulated as a pair of classification tasks, (a) a privacy-insensitive and (b) a privacy-sensitive task. Then privacy protection is the requirement that, given the transformed data, no classification algorithm may perform well on the sensitive task while hurting the performance on the insensitive task as little as possible. To that end, we introduce a novel criterion called Multiclass Discriminant Ratio which is optimized using the generalized eigenvalue decomposition of a pair of between class scatter matrices. We then formulate a nonlinear extension of this approach using the kernel GED method. Our proposed methods are evaluated using the Human Activity Recognition data set. Using the kernel projected data the performance of the User recognition task is reduced by 89% while the Activity recognition task is reduced only by 7.8%.
AB - Various internet services, including cloud providers and social networks collect large amounts of information that needs to be processed for statistical or other reasons without breaching user privacy. We present a novel approach where privacy protection can be viewed as a data transformation problem. The problem is formulated as a pair of classification tasks, (a) a privacy-insensitive and (b) a privacy-sensitive task. Then privacy protection is the requirement that, given the transformed data, no classification algorithm may perform well on the sensitive task while hurting the performance on the insensitive task as little as possible. To that end, we introduce a novel criterion called Multiclass Discriminant Ratio which is optimized using the generalized eigenvalue decomposition of a pair of between class scatter matrices. We then formulate a nonlinear extension of this approach using the kernel GED method. Our proposed methods are evaluated using the Human Activity Recognition data set. Using the kernel projected data the performance of the User recognition task is reduced by 89% while the Activity recognition task is reduced only by 7.8%.
KW - Kernel Generalized Eigenvalue Decomposition
KW - Privacy Protection
KW - Subspace methods
UR - http://www.scopus.com/inward/record.url?scp=85002152577&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85002152577&partnerID=8YFLogxK
U2 - 10.1109/MLSP.2016.7738831
DO - 10.1109/MLSP.2016.7738831
M3 - Conference contribution
AN - SCOPUS:85002152577
T3 - IEEE International Workshop on Machine Learning for Signal Processing, MLSP
BT - 2016 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings
A2 - Diamantaras, Kostas
A2 - Uncini, Aurelio
A2 - Palmieri, Francesco A. N.
A2 - Larsen, Jan
PB - IEEE Computer Society
T2 - 26th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings
Y2 - 13 September 2016 through 16 September 2016
ER -