Adversarial Examples for Cost-Sensitive Classifiers

TOP 文献データベース Adversarial Examples for Cost-Sensitive Classifiers

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1910.02095

PDF

https://arxiv.org/pdf/1910.02095

文献情報

作者: Gavin S. Hartnett,Andrew J. Lohn,Alexander P. Sedlack
公開日: 2019-10-5
所属機関: The RAND Corporation
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

ポイズニング攻撃の評価敵対的攻撃手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Motivated by safety-critical classification problems, we investigate adversarial attacks against cost-sensitive classifiers. We use current state-of-the-art adversarially-resistant neural network classifiers [1] as the underlying models. Cost-sensitive predictions are then achieved via a final processing step in the feed-forward evaluation of the network. We evaluate the effectiveness of cost-sensitive classifiers against a variety of attacks and we introduce a new cost-sensitive attack which performs better than targeted attacks in some cases. We also explored the measures a defender can take in order to limit their vulnerability to these attacks. This attacker/defender scenario is naturally framed as a two-player zero-sum finite game which we analyze using game theory.