Generalization error for portable rewards in transfer imitation learning

Published: 01 Jan 2024, Last Modified: 10 Aug 2024Knowl. Based Syst. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Present generalization error bound for the reward transfer paradigm in TIL.•Evaluate transfer effects and propose alternative reward transfer plans.•Equate minimizing optimizable training error to maximizing RL objective in target.•Apply our main results to evaluate diverse possible transfer effects.
Loading