Cost effective transfer of reinforcement learning policies

Orel Lavie, Asaf Shabtai, Gilad Katz

Published: 2024, Last Modified: 08 Aug 2024Expert Syst. Appl. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•A novel approach for configuring the reward function of DRL-based models.•Our approach enables us to define desired values for various metrics–TPR/FPR, etc.•The DRL model automatically adapt its behavior to reach the various metrics.•A process for “transferring” effective policies from one domain to another.•Out proposed approach is highly robust against adaptive adversarial attacks.