TY - JOUR
T1 - DIFFERENTIALLY PRIVATE IMAGE CLASSIFICATION BY LEARNING PRIORS FROM RANDOM PROCESSES
AU - Tang, Xinyu
AU - Panda, Ashwinee
AU - Sehwag, Vikash
AU - Mittal, Prateek
N1 - Publisher Copyright:
© Xinyu Tang, Ashwinee Panda, Vikash Sehwag, and Prateek Mittal.
PY - 2025/3/31
Y1 - 2025/3/31
N2 - In privacy-preserving machine learning, differentially private stochastic gradient descent (DP-SGD) performs worse than SGD due to per-sample gradient clipping and noise addition. A recent focus in private learning research is improving the performance of DP-SGD on private data by incorporating priors that are learned from real-world public data. In this work, we explore how we can improve the privacy-utility trade-off of DP-SGD by learning priors from images generated by random processes and transferring these priors to private data. We propose DP-RandP, a three-phase approach, to combine the benefits of linear probing and full fine-tuning for the synthetic prior. We attain new state-of-the-art accuracy when training from scratch on CIFAR10, CIFAR100, MedMNIST, Camelyon17 and ImageNet for a range of privacy budgets ε ∈ [1, 10]. In particular, we improve the previous best reported accuracy on CIFAR10 from 60.6% to 72.3% for ε = 1.
AB - In privacy-preserving machine learning, differentially private stochastic gradient descent (DP-SGD) performs worse than SGD due to per-sample gradient clipping and noise addition. A recent focus in private learning research is improving the performance of DP-SGD on private data by incorporating priors that are learned from real-world public data. In this work, we explore how we can improve the privacy-utility trade-off of DP-SGD by learning priors from images generated by random processes and transferring these priors to private data. We propose DP-RandP, a three-phase approach, to combine the benefits of linear probing and full fine-tuning for the synthetic prior. We attain new state-of-the-art accuracy when training from scratch on CIFAR10, CIFAR100, MedMNIST, Camelyon17 and ImageNet for a range of privacy budgets ε ∈ [1, 10]. In particular, we improve the previous best reported accuracy on CIFAR10 from 60.6% to 72.3% for ε = 1.
KW - Differential Privacy
KW - Image Classification
UR - http://www.scopus.com/inward/record.url?scp=105002463126&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105002463126&partnerID=8YFLogxK
U2 - 10.29012/jpc.910
DO - 10.29012/jpc.910
M3 - Article
AN - SCOPUS:105002463126
SN - 2575-8527
VL - 15
JO - Journal of Privacy and Confidentiality
JF - Journal of Privacy and Confidentiality
IS - 1
ER -