Adversarial Attacks on Graph Classification via Bayesian Optimisation

TOP 文献データベース Adversarial Attacks on Graph Classification via Bayesian Optimisation

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2111.02842

PDF

https://arxiv.org/pdf/2111.02842

文献情報

作者: Xingchen Wan;Henry Kenlay;Binxin Ru;Arno Blaas;Michael A. Osborne;Xiaowen Dong
公開日: 2021-11-4
所属機関: Machine Learning Research Group, University of Oxford
所属の国: United Kingdom
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

敵対的攻撃手法ポイズニンググラフ機械学習の説明可能性

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Graph neural networks, a popular class of models effective in a wide range of graph-based learning tasks, have been shown to be vulnerable to adversarial attacks. While the majority of the literature focuses on such vulnerability in node-level classification tasks, little effort has been dedicated to analysing adversarial attacks on graph-level classification, an important problem with numerous real-life applications such as biochemistry and social network analysis. The few existing methods often require unrealistic setups, such as access to internal information of the victim models, or an impractically-large number of queries. We present a novel Bayesian optimisation-based attack method for graph classification models. Our method is black-box, query-efficient and parsimonious with respect to the perturbation applied. We empirically validate the effectiveness and flexibility of the proposed method on a wide range of graph classification tasks involving varying graph properties, constraints and modes of attack. Finally, we analyse common interpretable patterns behind the adversarial samples produced, which may shed further light on the adversarial robustness of graph classification models.

外部データセット

IMDB-M

PROTEINS

COLLAB

REDDIT-MULTI-5K

MNIST-75sp

Twitter dataset