Improved Generalization Risk Bounds for Meta-Learning with PAC-Bayes-kl Analysis

Jiechao Guan; Zhiwu Lu; Yong Liu

Improved Generalization Risk Bounds for Meta-Learning with PAC-Bayes-kl Analysis

Jiechao Guan, Zhiwu Lu, Yong Liu

29 Sept 2021 (modified: 13 Feb 2023)ICLR 2022 Conference Withdrawn SubmissionReaders: Everyone

Keywords: PAC-Bayes bounds, meta-learning, localized PAC-Bayes analysis

Abstract: By incorporating knowledge from observed tasks, PAC-Bayes meta-learning algorithms aim to construct a hyperposterior from which an informative prior is sampled for fast adaptation to novel tasks. The goal of PAC-Bayes meta-learning theory is thus to propose an upper bound on the generalization risk over a novel task of the learned hyperposterior. In this work, we first generalize the tight PAC-Bayes-kl bound from independently and identically distributed (i.i.d.) setting to non-i.i.d. meta-learning setting. Based on the extended PAC-Bayes-kl bound, we further provide three improved PAC-Bayes generalization bounds for meta-learning, leading to better asymptotic behaviour than existing results. By minimizing objective functions derived from the improved bounds, we develop three PAC-Bayes meta-learning algorithms for classification. Moreover, we employ localized PAC-Bayes analysis for meta-learning to yield insights into the role of hyperposterior for learning a novel task. In particular, we identify that when the number of training task is large, utilizing a prior generated from an informative hyperposterior can achieve the same order of PAC-Bayes-kl bound as that obtained through setting a localized distribution-dependent prior for a novel task. Experiments with deep neural networks show that minimizing our bounds can achieve competitive performance on novel tasks w.r.t. previous PAC-Bayes meta-learning methods as well as PAC-Bayes single-task learning methods with localized prior.

One-sentence Summary: We provide three improved PAC-Bayes bounds for meta-learning, and demonstrate the ability of meta-learning to adapt to unseen task via localized PAC-Bayes analysis.

Supplementary Material: zip

7 Replies

Loading