An Adaptive Empirical Bayesian Method for Sparse Deep Learning

TOP 文献データベース An Adaptive Empirical Bayesian Method for Sparse Deep Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1910.10791

PDF

https://arxiv.org/pdf/1910.10791

文献情報

作者: Wei Deng,Xiao Zhang,Faming Liang,Guang Lin
公開日: 2019-10-24
更新日: 2020-4-14
所属機関: Department of Mathematics, Purdue University
所属の国: United States of America
会議名

AIにより推定されたラベル

最適化戦略収束保証深層学習技術

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

We propose a novel adaptive empirical Bayesian method for sparse deep learning, where the sparsity is ensured via a class of self-adaptive spike-and-slab priors. The proposed method works by alternatively sampling from an adaptive hierarchical posterior distribution using stochastic gradient Markov Chain Monte Carlo (MCMC) and smoothly optimizing the hyperparameters using stochastic approximation (SA). We further prove the convergence of the proposed method to the asymptotically correct distribution under mild conditions. Empirical applications of the proposed method lead to the state-of-the-art performance on MNIST and Fashion MNIST with shallow convolutional neural networks and the state-of-the-art compression performance on CIFAR10 with Residual Networks. The proposed method also improves resistance to adversarial attacks.