BODAME: Bilevel Optimization for Defense Against Model Extraction

TOP 文献データベース BODAME: Bilevel Optimization for Defense Against Model Extraction

Computing Research Repository (CoRR)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2103.06797

PDF

https://arxiv.org/pdf/2103.06797

文献情報

作者: Yuto Mori;Atsushi Nitanda;Akiko Takeda
公開日: 2025-3-25
所属機関: Graduate School of Information Science and Technology, The University of Tokyo
所属の国: Japan
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

モデル性能評価最適化アルゴリズムの選択と評価敵対的攻撃

Abstract

Model extraction attacks have become serious issues for service providers using machine learning. We consider an adversarial setting to prevent model extraction under the assumption that attackers will make their best guess on the service provider's model using query accesses, and propose to build a surrogate model that significantly keeps away the predictions of the attacker's model from those of the true model. We formulate the problem as a non-convex constrained bilevel optimization problem and show that for kernel models, it can be transformed into a non-convex 1-quadratically constrained quadratic program with a polynomial-time algorithm to find the global optimum. Moreover, we give a tractable transformation and an algorithm for more complicated models that are learned by using stochastic gradient descent-based algorithms. Numerical experiments show that the surrogate model performs well compared with existing defense models when the difference between the attacker's and service provider's distributions is large. We also empirically confirm the generalization ability of the surrogate model.