Understanding Deep Image Representations by Inverting Them

TOP Literature Database Understanding Deep Image Representations by Inverting Them

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/1412.0035

PDF

https://arxiv.org/pdf/1412.0035

Paper Information

Author: Aravindh Mahendran,Andrea Vedaldi
Published: 11-27-2014
Affiliation: University of Oxford
Country: United Kingdom
Conference: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Labels Estimated by AI

Deep Learning Method XAI (Explainable AI) Model Inversion

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Image representations, from SIFT and Bag of Visual Words to Convolutional Neural Networks (CNNs), are a crucial component of almost any image understanding system. Nevertheless, our understanding of them remains limited. In this paper we conduct a direct analysis of the visual information contained in representations by asking the following question: given an encoding of an image, to which extent is it possible to reconstruct the image itself? To answer this question we contribute a general framework to invert representations. We show that this method can invert representations such as HOG and SIFT more accurately than recent alternatives while being applicable to CNNs too. We then use this technique to study the inverse of recent state-of-the-art CNN image representations for the first time. Among our findings, we show that several layers in CNNs retain photographically accurate information about the image, with different degrees of geometric and photometric invariance.

External Datasets

ImageNet Large Scale Visual Recognition Challenge