Designing Adversarially Resilient Classifiers using Resilient Feature Engineering

TOP Literature Database Designing Adversarially Resilient Classifiers using Resilient Feature Engineering

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/1812.06626

PDF

https://arxiv.org/pdf/1812.06626

Paper Information

Author: Kevin Eykholt,Atul Prakash
Published: 12-17-2018
Affiliation: University of Michigan
Country: United States of America
Conference: Computing Research Repository (CoRR)

Labels Estimated by AI

Robustness Certified Robustness Multi-Class Classification

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

We provide a methodology, resilient feature engineering, for creating adversarially resilient classifiers. According to existing work, adversarial attacks identify weakly correlated or non-predictive features learned by the classifier during training and design the adversarial noise to utilize these features. Therefore, highly predictive features should be used first during classification in order to determine the set of possible output labels. Our methodology focuses the problem of designing resilient classifiers into a problem of designing resilient feature extractors for these highly predictive features. We provide two theorems, which support our methodology. The Serial Composition Resilience and Parallel Composition Resilience theorems show that the output of adversarially resilient feature extractors can be combined to create an equally resilient classifier. Based on our theoretical results, we outline the design of an adversarially resilient classifier.