$p$-DkNN: Out-of-Distribution Detection Through Statistical Testing of Deep Representations

TOP 文献データベース $p$-DkNN: Out-of-Distribution Detection Through Statistical Testing of Deep Representations

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2207.12545

PDF

https://arxiv.org/pdf/2207.12545

文献情報

作者: Adam Dziedzic;Stephan Rabanser;Mohammad Yaghini;Armin Ale;Murat A. Erdogdu;Nicolas Papernot
公開日: 2022-7-26
所属機関: University of Toronto & Vector Institute
所属の国: Canada
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

Out-of-Distribution検出モデル性能評価階層的分類手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

The lack of well-calibrated confidence estimates makes neural networks inadequate in safety-critical domains such as autonomous driving or healthcare. In these settings, having the ability to abstain from making a prediction on out-of-distribution (OOD) data can be as important as correctly classifying in-distribution data. We introduce $p$-DkNN, a novel inference procedure that takes a trained deep neural network and analyzes the similarity structures of its intermediate hidden representations to compute $p$-values associated with the end-to-end model prediction. The intuition is that statistical tests performed on latent representations can serve not only as a classifier, but also offer a statistically well-founded estimation of uncertainty. $p$-DkNN is scalable and leverages the composition of representations learned by hidden layers, which makes deep representation learning successful. Our theoretical analysis builds on Neyman-Pearson classification and connects it to recent advances in selective classification (reject option). We demonstrate advantageous trade-offs between abstaining from predicting on OOD inputs and maintaining high accuracy on in-distribution inputs. We find that $p$-DkNN forces adaptive attackers crafting adversarial examples, a form of worst-case OOD inputs, to introduce semantically meaningful changes to the inputs.