Abstract
Bayesian neural networks (BNNs) have drawn extensive interest due to the unique probabilistic representation framework. However, Bayesian neural networks have limited publicized deployments because of the relatively poor model performance in real-world applications. In this paper, we argue that the randomness of sampling in Bayesian neural networks causes errors in the updating of model parameters during training and poor performance of some sampled models in testing. To solve this, we propose to train Bayesian neural networks with Adversarial Distribution as a theoretical solution. To avoid the difficulty of calculating Adversarial Distribution analytically, we further present the Adversarial Sampling method as an approximation in practice. We conduct extensive experiments with multiple network structures on different datasets, e.g., CIFAR-10 and CIFAR-100. Experimental results validate the correctness of the theoretical analysis and the effectiveness of the Adversarial Sampling on improving model performance. Additionally, models trained with Adversarial Sampling still keep their ability to model uncertainties and perform better when predictions are retained according to the uncertainties, which further verifies the generality of the Adversarial Sampling approach.
Original language | English |
---|---|
Title of host publication | Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence: AAAI-22 Technical Tracks 9 |
Publisher | Association for the Advancement of Artificial Intelligence (AAAI) |
Pages | 10110-10117 |
Number of pages | 8 |
ISBN (Electronic) | 1577358767, 9781577358763 |
DOIs | |
Publication status | Published - 28 Jun 2022 |
Event | 36th AAAI Conference on Artificial Intelligence, AAAI 2022 - Virtual, Online Duration: 22 Feb 2022 → 01 Mar 2022 |
Publication series
Name | Proceedings of the 36th AAAI Conference on Artificial Intelligence |
---|---|
Number | 9 |
Volume | 36 |
ISSN (Print) | 2159-5399 |
ISSN (Electronic) | 2374-3468 |
Conference
Conference | 36th AAAI Conference on Artificial Intelligence, AAAI 2022 |
---|---|
City | Virtual, Online |
Period | 22/02/2022 → 01/03/2022 |
Bibliographical note
Funding Information:This work was supported in part by National NSF of China (NO. 61872234, 61732010), Shanghai Key Laboratory of Scalable Computing and Systems. Ruhui Ma is the corresponding author.
Publisher Copyright:
Copyright © 2022, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
ASJC Scopus subject areas
- Artificial Intelligence