The Price of Interpretability

TOP Literature Database The Price of Interpretability

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/1907.03419

PDF

https://arxiv.org/pdf/1907.03419

Paper Information

Author: Dimitris Bertsimas,Arthur Delarue,Patrick Jaillet,Sebastien Martin
Published: 7-8-2019
Affiliation: Sloan School of Management, Massachusetts Institute of Technology
Country: United States of America
Conference

Labels Estimated by AI

Interpretability Model Selection Optimization Strategy

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

When quantitative models are used to support decision-making on complex and important topics, understanding a model's ``reasoning'' can increase trust in its predictions, expose hidden biases, or reduce vulnerability to adversarial attacks. However, the concept of interpretability remains loosely defined and application-specific. In this paper, we introduce a mathematical framework in which machine learning models are constructed in a sequence of interpretable steps. We show that for a variety of models, a natural choice of interpretable steps recovers standard interpretability proxies (e.g., sparsity in linear models). We then generalize these proxies to yield a parametrized family of consistent measures of model interpretability. This formal definition allows us to quantify the ``price'' of interpretability, i.e., the tradeoff with predictive accuracy. We demonstrate practical algorithms to apply our framework on real and synthetic datasets.