On the explainable properties of 1-Lipschitz Neural Networks: An Optimal Transport Perspective | AIセキュリティポータル

EN

JA

EN

TOP 文献データベース On the explainable properties of 1-Lipschitz Neural Networks: An Optimal Transport Perspective

arxiv

On the explainable properties of 1-Lipschitz Neural Networks: An Optimal Transport Perspective

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2206.06854

PDF

https://arxiv.org/pdf/2206.06854

文献情報

作者: Mathieu Serrurier;Franck Mamalet;Thomas Fel;Louis Béthune;Thibaut Boissin
公開日: 2022-6-14
更新日: 2024-2-2
所属機関: Université Paul-Sabatier, IRIT, Toulouse, France
所属の国: France
会議名: Conference on Neural Information Processing Systems (NeurIPS)

AIにより推定されたラベル

深層学習手法ロバスト性サンプル複雑性

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Input gradients have a pivotal role in a variety of applications, including adversarial attack algorithms for evaluating model robustness, explainable AI techniques for generating Saliency Maps, and counterfactual explanations.However, Saliency Maps generated by traditional neural networks are often noisy and provide limited insights. In this paper, we demonstrate that, on the contrary, the Saliency Maps of 1-Lipschitz neural networks, learned with the dual loss of an optimal transportation problem, exhibit desirable XAI properties:They are highly concentrated on the essential parts of the image with low noise, significantly outperforming state-of-the-art explanation approaches across various models and metrics. We also prove that these maps align unprecedentedly well with human explanations on ImageNet.To explain the particularly beneficial properties of the Saliency Map for such models, we prove this gradient encodes both the direction of the transportation plan and the direction towards the nearest adversarial attack. Following the gradient down to the decision boundary is no longer considered an adversarial attack, but rather a counterfactual explanation that explicitly transports the input from one class to another. Thus, Learning with such a loss jointly optimizes the classification objective and the alignment of the gradient, i.e. the Saliency Map, to the transportation plan direction.These networks were previously known to be certifiably robust by design, and we demonstrate that they scale well for large problems and models, and are tailored for explainability using a fast and straightforward method.

外部データセット

ClickMe

FashionMNIST

CelebA

Cat vs Dog

Imagenet

参考文献

Existence, stability and scalability of orthogonal convolutional neural networks

E. M. Achour, F. Malgouyres, F. Mamalet

Published: 2021

Proceedings of the 32nd International Conference on Neural Information Processing Systems, NIPS’18

Sanity checks for saliency maps

J. Adebayo, J. Gilmer, M. Muelly, I. Goodfellow, M. Hardt, B. Kim

Published: 2018

Springer Berlin Heidelberg

Existence and stability results in the L1 theory of optimal transportation

L. Ambrosio, A. Pratelli

Published: 2003

A unified view of gradient-based attribution methods for deep neural networks

M. Ancona, E. Ceolini, A. C. Öztireli, M. H. Gross

Published: 2017

Proceedings of the 36th International Conference on Machine Learning, volume 97 of Proceedings of Machine Learning Research

Sorting out Lipschitz function approximation

C. Anil, J. Lucas, R. Grosse

Published: 2019

The Eleventh International Conference on Learning Representations

A unified algebraic perspective on lipschitz neural networks

A. Araujo, A. J. Havens, B. Delattre, A. Allauzen, B. Hu

Published: 2023

Proc.of ICML

Wasserstein generative adversarial networks

M. Arjovsky, S. Chintala, L. Bottou

The many faces of 1-lipschitz neural networks

L. Béthune, A. González-Sanz, F. Mamalet, M. Serrurier

Published: 2021

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20

Evaluating and aggregating feature-based model explanations

U. Bhatt, A. Weller, J. M. F. Moura

Published: 2020

SIAM Journal on Numerical Analysis

An Iterative Algorithm for Computing the Best Estimate of an Orthogonal Matrix

Å. Björck, C. Bowie

Published: 1971