Optimistic Rates for Multi-Task Representation Learning

Austin Watkins; Enayat Ullah; Thanh Nguyen-Tang; Raman Arora

Optimistic Rates for Multi-Task Representation Learning

Austin Watkins, Enayat Ullah, Thanh Nguyen-Tang, Raman Arora

Published: 21 Sept 2023, Last Modified: 15 Jan 2024NeurIPS 2023 posterEveryoneRevisionsBibTeX

Keywords: Learning Theory, Multi-task and Transfer Learning, Classification

TL;DR: We show excess bounds for multi-task representation learning that are fast under near-realizability.

Abstract: We study the problem of transfer learning via Multi-Task Representation Learning (MTRL), wherein multiple source tasks are used to learn a good common representation, and a predictor is trained on top of it for the target task. Under standard regularity assumptions on the loss function and task diversity, we provide new statistical rates on the excess risk of the target task, which demonstrate the benefit of representation learning. Importantly, our rates are optimistic, i.e., they interpolate between the standard $O(m^{-1/2})$ rate and the fast $O(m^{-1})$ rate, depending on the difficulty of the learning task, where $m$ is the number of samples for the target task. Besides the main result, we make several new contributions, including giving optimistic rates for excess risk of source tasks (multi-task learning (MTL)), a local Rademacher complexity theorem for MTRL and MTL, as well as a chain rule for local Rademacher complexity for composite predictor classes.

Supplementary Material: pdf

Submission Number: 9057

Loading