Abstract: Highlights•We propose a policy gradient-based meta-learned parametric metric learning scheme (LM-Metric).•We consider the inter-sample relationship during training to solve the non-decomposability problem of AP optimization.•We validate the effectiveness of our method, by applying it to variouis (DML) loss, e.g., Triplet, Margin, and Proxy-anchor losses.
Loading