AIセキュリティポータル K Program
Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning
Share
Abstract
Predictions made by deep learning models are prone to data perturbations, adversarial attacks, and out-of-distribution inputs. To build a trusted AI system, it is therefore critical to accurately quantify the prediction uncertainties. While current efforts focus on improving uncertainty quantification accuracy and efficiency, there is a need to identify uncertainty sources and take actions to mitigate their effects on predictions. Therefore, we propose to develop explainable and actionable Bayesian deep learning methods to not only perform accurate uncertainty quantification but also explain the uncertainties, identify their sources, and propose strategies to mitigate the uncertainty impacts. Specifically, we introduce a gradient-based uncertainty attribution method to identify the most problematic regions of the input that contribute to the prediction uncertainty. Compared to existing methods, the proposed UA-Backprop has competitive accuracy, relaxed assumptions, and high efficiency. Moreover, we propose an uncertainty mitigation strategy that leverages the attribution results as attention to further improve the model performance. Both qualitative and quantitative evaluations are conducted to demonstrate the effectiveness of our proposed methods.
Gradient-based attribution methods
Marco Ancona, Enea Ceolini, Cengiz Oztireli, Markus Gross
Published: 2019
On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation
Sebastian Bach, Alexander Binder, Gregoire Montavon, Frederick Klauschen, Klaus-Robert Muller, Wojciech Samek
Published: 2015
Scalable uncertainty for computer vision with functional variational inference
Eduardo DC Carvalho, Ronald Clark, Andrea Nicastro, Paul HJ Kelly
Published: 2020
Stochastic gradient hamiltonian monte carlo
Tianqi Chen, Emily B. Fox, Carlos Guestrin
Published: 2014
Real time image saliency for black box classifiers
P. Dabkowski, Y. Gal
Published: 2017
The mnist database of handwritten digit images for machine learning research
Li Deng
Published: 2012
Decomposition of uncertainty in bayesian deep learning for efficient and risk-sensitive learning
Stefan Depeweg, Jose-Miguel Hernandez-Lobato, Finale Doshi-Velez, Steffen Udluft
Published: 2018
Understanding deep networks via extremal perturbations and smooth masks
R. Fong, M. Patrick, A. Vedaldi
Published: 2019
Stochastic relaxation, gibbs distributions, and the bayesian restoration of images
Stuart Geman, Donald Geman
Published: 1984
Monte carlo sampling methods using markov chains and their applications
W Keith Hastings
Published: 1970
Guided integrated gradients: An adaptive path method for removing noise
Andrei Kapishnikov, Subhashini Venugopalan, Besim Avci, Ben Wedin, Michael Terry, Tolga Bolukbasi
Published: 2021
What uncertainties do we need in bayesian deep learning for computer vision?
Kendall, A., Gal, Y.
Published: 2017
Learning multiple layers of features from tiny images
Alex Krizhevsky, Geoffrey Hinton
Published: 2009
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan, Alexander Pritzel, Charles Blundell
Published: 12.6.2016
Diverse, global and amortised counterfactual explanations for uncertainty estimates
Dan Ley, Umang Bhatt, Adrian Weller
Published: 2022
Multiplicative normalizing flows for variational Bayesian neural networks
Christos Louizos, Max Welling
Published: 2017
A Practical Bayesian Framework for Backpropagation Networks
MacKay, D. J. C.
Published: 1992
A simple baseline for bayesian uncertainty in deep learning
Wesley J Maddox, Pavel Izmailov, Timur Garipov, Dmitry P Vetrov, Andrew Gordon Wilson
Published: 2019
Explaining nonlinear classification decisions with deep taylor decomposition
Gregoire Montavon, Sebastian Lapuschkin, Alexander Binder, Wojciech Samek, Klaus-Robert Muller
Published: 2017
Reading digits in natural images with unsupervised feature learning
Yuval Netzer, Tao Wang, Adam Coates, Alessandro Bissacco, Bo Wu, Andrew Y Ng
Published: 2011
Robust explainability: A tutorial on gradient-based attribution methods for deep neural networks
Ian E Nielsen, Dimah Dera, Ghulam Rasool, Ravi P Ramachandran, Nidhal Carla Bouaynaya
Published: 2022
Attribution of predictive uncertainties in classification models
I. Perez, P. Skalski, A. Barns-Graham, J. Wong, D. Sutton
Published: 2022
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin
Published: 2.16.2016
Grad-cam: Visual explanations from deep networks via gradient-based localization
R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, D. Batra
Published: 2017
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
Karen Simonyan, Andrea Vedaldi, Andrew Zisserman
Published: 12.21.2013
Full-gradient representation for neural network visualization
Suraj Srinivas, François Fleuret
Published: 2019
Axiomatic attribution for deep networks
Sundararajan, M., Taly, A., Yan, Q.
Published: 2017
Attribution in scale and space
Shawn Xu, Subhashini Venugopalan, Mukund Sundararajan
Published: 2020
Mfpp: Morphological fragmental perturbation pyramid for black-box model explanations
Qing Yang, Xia Zhu, Jong-Kae Fwu, Yun Ye, Ganmei You, Yuan Zhu
Published: 2021
Visualizing and Understanding Convolutional Networks
Matthew D Zeiler, Rob Fergus
Published: 11.13.2013
Share