AIセキュリティポータル K Program
Cross-Input Certified Training for Universal Perturbations
Share
Abstract
Existing work in trustworthy machine learning primarily focuses on single-input adversarial perturbations. In many real-world attack scenarios, input-agnostic adversarial attacks, e.g. universal adversarial perturbations (UAPs), are much more feasible. Current certified training methods train models robust to single-input perturbations but achieve suboptimal clean and UAP accuracy, thereby limiting their applicability in practical applications. We propose a novel method, CITRUS, for certified training of networks robust against UAP attackers. We show in an extensive evaluation across different datasets, architectures, and perturbation magnitudes that our method outperforms traditional certified training methods on standard accuracy (up to 10.3\%) and achieves SOTA performance on the more practical certified UAP accuracy metric.
Synthesizing robust adversarial examples
Athalye, A., Engstrom, L., Ilyas, A., Kwok, K.
Published: 2018
Adversarial training and provable defenses: Bridging the gap
Balunović, M., Vechev, M.
Published: 2020
Universal adversarial training with class-wise perturbations
Benz, P., Zhang, C., Karjauv, A., Kweon, I.S.
Published: 2021
(certified!!) adversarial robustness for free!
Carlini, N., Tramer, F., Dvijotham, K. D., Rice, L., Sun, ` M., Kolter, J. Z.
Published: 2023
Towards Evaluating the Robustness of Neural Networks
Nicholas Carlini, David Wagner
Published: 8.17.2016
Certified adversarial robustness via randomized smoothing
J. Cohen, E. Rosenfeld, Z. Kolter
Published: 2019
Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming
Sumanth Dathathri, Krishnamurthy (Dj) Dvijotham, Alex Kurakin, Aditi Raghunathan, Jonathan Uesato, Rudy Bunel, Shreya Shankar, Jacob Steinhardt, Ian Goodfellow, Percy Liang, Pushmeet Kohli
Published: 2020
Scaling the convex barrier with active sets
Alessandro De Palma, Harkirat Singh Behl, Rudy Bunel, Philip H. S. Torr, M. Pawan Kumar
Published: 2021
Share