XAI(説明可能なAI)

Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers

Authors: Lam Nguyen Tung, Steven Cho, Xiaoning Du, Neelofar Neelofar, Valerio Terragni, Stefano Ruberto, Aldeida Aleti | Published: 2024-10-30 | Updated: 2025-04-08
XAI(説明可能なAI)
モデル性能評価
信頼性分析

X-CBA: Explainability Aided CatBoosted Anomal-E for Intrusion Detection System

Authors: Kiymet Kaya, Elif Ak, Sumeyye Bas, Berk Canberk, Sule Gunduz Oguducu | Published: 2024-02-01 | Updated: 2024-06-02
GNN
XAI(説明可能なAI)
侵入検知システム

X Hacking: The Threat of Misguided AutoML

Authors: Rahul Sharma, Sergey Redyuk, Sumantrak Mukherjee, Andrea Sipka, Sebastian Vollmer, David Selby | Published: 2024-01-16 | Updated: 2024-02-12
XAI(説明可能なAI)
バイアス
モデルの解釈性

Autonomous Threat Hunting: A Future Paradigm for AI-Driven Threat Intelligence

Authors: Siva Raja Sindiramutty | Published: 2023-12-30
AIと自動化の役割
XAI(説明可能なAI)
サイバーセキュリティ

Classification and Explanation of Distributed Denial-of-Service (DDoS) Attack Detection using Machine Learning and Shapley Additive Explanation (SHAP) Methods

Authors: Yuanyuan Wei, Julian Jang-Jaccard, Amardeep Singh, Fariza Sabrina, Seyit Camtepe | Published: 2023-06-27
XAI(説明可能なAI)
ネットワーク脅威検出
マルウェア分類

A Survey on Explainable Artificial Intelligence for Cybersecurity

Authors: Gaith Rjoub, Jamal Bentahar, Omar Abdel Wahab, Rabeb Mizouni, Alyssa Song, Robin Cohen, Hadi Otrok, Azzam Mourad | Published: 2023-03-07 | Updated: 2023-06-11
XAI(説明可能なAI)
サイバーセキュリティ
説明可能性

“Is your explanation stable?”: A Robustness Evaluation Framework for Feature Attribution

Authors: Yuyou Gan, Yuhao Mao, Xuhong Zhang, Shouling Ji, Yuwen Pu, Meng Han, Jianwei Yin, Ting Wang | Published: 2022-09-05
XAI(説明可能なAI)
ロバストな説明可能性
ロバスト分類

On Robust Prefix-Tuning for Text Classification

Authors: Zonghan Yang, Yang Liu | Published: 2022-03-19
XAI(説明可能なAI)
トレードオフ分析
パラメータ調整

Exploiting Explanations for Model Inversion Attacks

Authors: Xuejun Zhao, Wencan Zhang, Xiaokui Xiao, Brian Y. Lim | Published: 2021-04-26 | Updated: 2022-03-14
XAI(説明可能なAI)
プライバシー手法
モデルインバージョン

Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods

Authors: Dylan Slack, Sophie Hilgard, Emily Jia, Sameer Singh, Himabindu Lakkaraju | Published: 2019-11-06 | Updated: 2020-02-03
XAI(説明可能なAI)
敵対的学習
説明可能性に対する攻撃