Can We Trust Your Explanations? Sanity Checks for Interpreters in Android Malware Analysis

TOP 文献データベース Can We Trust Your Explanations? Sanity Checks for Interpreters in Android Malware Analysis

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2008.05895

PDF

https://arxiv.org/pdf/2008.05895

文献情報

作者: Ming Fan,Wenying Wei,Xiaofei Xie,Yang Liu,Xiaohong Guan,Ting Liu
公開日: 2025-3-25
所属機関: MOEKLINNS Lab, Department of Computer Science and Technology, Xi’an Jiaotong University
所属の国: China
会議名

AIにより推定されたラベル

ポイズニング説明アプローチの評価

Abstract

With the rapid growth of Android malware, many machine learning-based malware analysis approaches are proposed to mitigate the severe phenomenon. However, such classifiers are opaque, non-intuitive, and difficult for analysts to understand the inner decision reason. For this reason, a variety of explanation approaches are proposed to interpret predictions by providing important features. Unfortunately, the explanation results obtained in the malware analysis domain cannot achieve a consensus in general, which makes the analysts confused about whether they can trust such results. In this work, we propose principled guidelines to assess the quality of five explanation approaches by designing three critical quantitative metrics to measure their stability, robustness, and effectiveness. Furthermore, we collect five widely-used malware datasets and apply the explanation approaches on them in two tasks, including malware detection and familial identification. Based on the generated explanation results, we conduct a sanity check of such explanation approaches in terms of the three metrics. The results demonstrate that our metrics can assess the explanation approaches and help us obtain the knowledge of most typical malicious behaviors for malware analysis.