Robustness of Unsupervised Representation Learning without Labels

[PK22] Aleksandar Petrov, Marta Kwiatkowska. Robustness of Unsupervised Representation Learning without Labels. Technical report arXiv:2210.04076, arXiv. 2022. [pdf] [bib] https://doi.org/10.48550/arXiv.2210.04076

Downloads:

pdf (16.48 MB) $bib$ bib

Links: [Google] [Google Scholar] [CiteSeer]

Available from: https://doi.org/10.48550/arXiv.2210.04076

Abstract. Unsupervised representation learning leverages large unlabeled datasets and is competitive with supervised learning. But non-robust encoders may affect downstream task robustness. Recently, robust representation encoders have become of interest. Still, all prior work evaluates robustness using a downstream classification task. Instead, we propose a family of unsupervised robustness measures, which are model- and task-agnostic and label-free. We benchmark state-of-the-art representation encoders and show that none dominates the rest. We offer unsupervised extensions to the FGSM and PGD attacks. When used in adversarial training, they improve most unsupervised robustness measures, including certified robustness. We validate our results against a linear probe and show that, for MOCOv2, adversarial training results in 3 times higher certified accuracy, a 2-fold decrease in impersonation attack success rate and considerable improvements in certified robustness.