ExpProof : Operationalizing Explanations for Confidential Models with ZKPs

TOP Literature Database ExpProof : Operationalizing Explanations for Confidential Models with ZKPs

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2502.03773

PDF

https://arxiv.org/pdf/2502.03773

Paper Information

Author: Chhavi Yadav,Evan Monroe Laufer,Dan Boneh,Kamalika Chaudhuri
Published: 2-6-2025
Updated: 5-28-2025
Affiliation: UC San Diego
Country: United States of America
Conference: International Conference on Machine Learning (ICML)

Labels Estimated by AI

XAI (Explainable AI) Interpretability Model evaluation methods

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

In principle, explanations are intended as a way to increase trust in machine learning models and are often obligated by regulations. However, many circumstances where these are demanded are adversarial in nature, meaning the involved parties have misaligned interests and are incentivized to manipulate explanations for their purpose. As a result, explainability methods fail to be operational in such settings despite the demand \cite{bordt2022post}. In this paper, we take a step towards operationalizing explanations in adversarial scenarios with Zero-Knowledge Proofs (ZKPs), a cryptographic primitive. Specifically we explore ZKP-amenable versions of the popular explainability algorithm LIME and evaluate their performance on Neural Networks and Random Forests. Our code is publicly available at https://github.com/emlaufer/ExpProof.

External Datasets

Adult

Credit

German Credit