The robust generalization of models to rare, in-distribution (ID) samples
drawn from the long tail of the training distribution and to
out-of-training-distribution (OOD) samples is one of the major challenges of
current deep learning methods. For image classification, this manifests in the
existence of adversarial attacks, the performance drops on distorted images,
and a lack of generalization to concepts such as sketches. The current
understanding of generalization in neural networks is very limited, but some
biases that differentiate models from human vision have been identified and
might be causing these limitations. Consequently, several attempts with varying
success have been made to reduce these biases during training to improve
generalization. We take a step back and sanity-check these attempts. Fixing the
architecture to the well-established ResNet-50, we perform a large-scale study
on 48 ImageNet models obtained via different training methods to understand how
and if these biases - including shape bias, spectral biases, and critical bands
- interact with generalization. Our extensive study results reveal that
contrary to previous findings, these biases are insufficient to accurately
predict the generalization of a model holistically. We provide access to all
checkpoints and evaluation code at
https://github.com/paulgavrikov/biases_vs_generalization
外部データセット
cue-conflict dataset
ImageNet
ImageNet v2
ImageNet-ReaL
ImageNet-C
ImageNet-¯C
ImageNet-A
ImageNet-Renditions
ImageNet-Sketch
Stylized ImageNet
参考文献
IEEE Conference on Computer Vision and Pattern Recognition Workshops
Dissecting the high-frequency bias in convolutional neural networks
Antonio A. Abello, Roberto Hirata, Zhangyang Wang
Published: 2021
Are we done with imagenet?
Lucas Beyer, Olivier J. Henaff, Alexander Kolesnikov, Xiaohua Zhai, Aaron van den Oord
Published: 2020
Machine Learning and Knowledge Discovery in Databases
Evasion attacks against machine learning at test time
Battista Biggio, Igino Corona, Davide Maiorca, Blaine Nelson, Nedim Srndic, Pavel Laskov, Giorgio Giacinto, Fabio Roli
Proceedings of the International Conference on Computer Vision (ICCV)
Emerging properties in self-supervised vision transformers
Mathilde Caron, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal, Piotr Bojanowski, Armand Joulin
Published: 2021
Advances in Neural Information Processing Systems
Big self-supervised models are strong semi-supervised learners
Ting Chen, Simon Kornblith, Kevin Swersky, Mohammad Norouzi, Geoffrey E Hinton
Published: 2020
Proceedings of the IEEE/CVF International Conference on Computer Vision
An empirical study of training self-supervised vision transformers
Chen, X., Xie, S., He, K.
Published: 2021
International Conference on Learning Representations
When vision transformers outperform resnets without pre-training or strong data augmentations
Xiangning Chen, Cho-Jui Hsieh, Boqing Gong
Published: 2022
International Conference on Machine Learning
Scaling vision transformers to 22 billion parameters
Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Peter Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin