Massif: Interactive Interpretation of Adversarial Attacks on Deep Learning

TOP 文献データベース Massif: Interactive Interpretation of Adversarial Attacks on Deep Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2001.07769

PDF

https://arxiv.org/pdf/2001.07769

文献情報

作者: Nilaksh Das,Haekyu Park,Zijie J. Wang,Fred Hohman,Robert Firstman,Emily Rogers,Duen Horng Chau
公開日: 2020-1-22
更新日: 2020-2-17
所属機関: Georgia Institute of Technology
所属の国: United States of America
会議名: CHI Extended Abstracts

AIにより推定されたラベル

敵対的攻撃検出深層強化学習

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep neural networks (DNNs) are increasingly powering high-stakes applications such as autonomous cars and healthcare; however, DNNs are often treated as "black boxes" in such applications. Recent research has also revealed that DNNs are highly vulnerable to adversarial attacks, raising serious concerns over deploying DNNs in the real world. To overcome these deficiencies, we are developing Massif, an interactive tool for deciphering adversarial attacks. Massif identifies and interactively visualizes neurons and their connections inside a DNN that are strongly activated or suppressed by an adversarial attack. Massif provides both a high-level, interpretable overview of the effect of an attack on a DNN, and a low-level, detailed description of the affected neurons. These tightly coupled views in Massif help people better understand which input features are most vulnerable or important for correct predictions.

外部データセット

ImageNet