Reinforcement Learning-Based Black-Box Model Inversion Attacks

Proceedings of the 29th Network and Distributed System Security Symposium

Mirror: Model inversion for deep learning network with high fidelity

S. An, G. Tao, Q. Xu, Y. Liu, G. Shen, Y. Yao, J. Xu, X. Zhang

Published: 2022

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)

Rethinking the truly unsupervised image-to-image translation

Kyungjune Baek, Yunjey Choi, Youngjung Uh, Jaejun Yoo, Hyunjung Shim

Published: 2021

Journal of Artificial Intelligence Research

The arcade learning environment: An evaluation platform for general agents

Marc G Bellemare, Yavar Naddaf, Joel Veness, Michael Bowling

Published: 2013

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)

Knowledge-enriched distributional model inversion attacks

Si Chen, Mostafa Kahla, Ruoxi Jia, Guo-Jun Qi

Published: 2021

Proceedings of the IEEE International Conference on Computer Vision Workshops

Know you at one glance: A compact vector representation for low-shot learning

Yu Cheng, Jian Zhao, Zhecan Wang, Yan Xu, Karlekar Jayashree, Shengmei Shen, Jiashi Feng

Published: 2017

Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security

Model inversion attacks that exploit confidence information and basic countermeasures

Matt Fredrikson, Somesh Jha, Thomas Ristenpart

Published: 2015

USENIX Conference on Security Symposium

Privacy in pharmacogenetics: An end-to-end case study of personalized warfarin dosing

Matthew Fredrikson, Eric Lantz, Somesh Jha, Simon Lin, David Page, Thomas Ristenpart

Published: 2014

Proceedings of the 35th International Conference on Machine Learning

Addressing function approximation error in actor-critic methods

Scott Fujimoto, Herke van Hoof, David Meger

Published: 2018

Advances in Neural Information Processing Systems

Generative adversarial nets

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio

Published: 2014

European conference on computer vision

Ms-celeb-1m: A dataset and benchmark for large-scale face recognition

Yandong Guo, Lei Zhang, Yuxiao Hu, Xiaodong He, Jianfeng Gao

Published: 2016

Proceedings of the 35th International Conference on Machine Learning

Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor

Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine

Published: 2018

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Deep residual learning for image recognition

Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun

Published: 2016

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

Label-only model inversion attacks via boundary repulsion

Mostafa Kahla, Si Chen, Hoang Anh Just, Ruoxi Jia

Published: 2022

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

A style-based generator architecture for generative adversarial networks

T. Karras, S. Laine, T. Aila

Published: 2019

International Conference on Learning Representations (ICLR)

Continuous control with deep reinforcement learning

Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, Daan Wierstra

Published: 2016

Proceedings of International Conference on Computer Vision (ICCV)

Deep Learning Face Attributes in the Wild

Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang

Published: 2015

Nature

Human-level control through deep reinforcement learning

V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski

Published: 2015

2014 IEEE international conference on image processing (ICIP)

A data-driven approach to cleaning large face datasets

Hong-Wei Ng, Stefan Winkler

Published: 2014

CVPR 2011 WORKSHOPS

Scaling up biologically-inspired computer vision: A case study in unconstrained face recognition on facebook

Nicolas Pinto, Zak Stone, Todd Zickler, David Cox

Published: 2011

A survey of privacy attacks in machine learning

Maria Rigaki, Sebastian Garcia

Published: 2020

arxiv

Cited by 1

Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning

Ahmed Salem, Apratim Bhattacharya, Michael Backes, Mario Fritz, Yang Zhang

Published: 4.2.2019

Machine learning (ML) has progressed rapidly during the past decade and the major factor that drives such development is the unprecedented large-scale data. As data generation is a continuous process, this leads to ML model owners updating their models frequently with newly-collected data in an online learning scenario. In consequence, if an ML model is queried with the same set of data samples at two different points in time, it will provide different results. In this paper, we investigate whether the change in the output of a black-box ML model before and after being updated can leak information of the dataset used to perform the update, namely the updating set. This constitutes a new attack surface against black-box ML models and such information leakage may compromise the intellectual property and data privacy of the ML model owner. We propose four attacks following an encoder-decoder formulation, which allows inferring diverse information of the updating set. Our new attacks are facilitated by state-of-the-art deep learning techniques. In particular, we propose a hybrid generative model (CBM-GAN) that is based on generative adversarial networks (GANs) but includes a reconstructive loss that allows reconstructing accurate samples. Our experiments show that the proposed attacks achieve strong performance.

Model Extraction Attack Reconstruction Attack Adversarial Attack Detection

Proceedings of the 31st International Conference on Machine Learning

Deterministic policy gradient algorithms

David Silver, Guy Lever, Nicolas Heess, Thomas Degris, Daan Wierstra, Martin Riedmiller

Published: 2014

3rd International Conference on Learning Representations

Very deep convolutional networks for large-scale image recognition

K. Simonyan, A. Zisserman

Published: 2015

Advances in Neural Information Processing Systems

Variational model inversion attacks

Kuan-Chieh Wang, YAN FU, Ke Li, Ashish Khisti, Richard Zemel, Alireza Makhzani

Published: 2021

Proceedings of the 2019 ACM SIGSAC Conference on Computer and Communications Security

Neural network inversion in adversarial setting via background knowledge alignment

Z. Yang, J. Zhang, E.-C. Chang, Z. Liang

Published: 2019

arxiv

Cited by 1

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks

Yuheng Zhang, Ruoxi Jia, Hengzhi Pei, Wenxiao Wang, Bo Li, Dawn Song

Published: 11.17.2019

This paper studies model-inversion attacks, in which the access to a model is abused to infer information about the training data. Since its first introduction, such attacks have raised serious concerns given that training data usually contain privacy-sensitive information. Thus far, successful model-inversion attacks have only been demonstrated on simple models, such as linear regression and logistic regression. Previous attempts to invert neural networks, even the ones with simple architectures, have failed to produce convincing results. We present a novel attack method, termed the generative model-inversion attack, which can invert deep neural networks with high success rates. Rather than reconstructing private training data from scratch, we leverage partial public information, which can be very generic, to learn a distributional prior via generative adversarial networks (GANs) and use it to guide the inversion process. Moreover, we theoretically prove that a model's predictive power and its vulnerability to inversion attacks are indeed two sides of the same coin---highly predictive models are able to establish a strong correlation between features and labels, which coincides exactly with what an adversary exploits to mount the attacks. Our extensive experiments demonstrate that the proposed attack improves identification accuracy over the existing work by about 75\% for reconstructing face images from a state-of-the-art face recognition classifier. We also show that differential privacy, in its canonical form, is of little avail to defend against our attacks.

Privacy Violation Reconstruction Attack Knowledge Extraction Method