RetailNet: Enhancing Retails of Perishable Products with Multiple Selling Strategies via Pair-Wise Multi-Q Learning

Xiyao Ma; Fan Lu; Xiajun Amy Pan; Yanlin Zhou; Xiaolin Andy Li

RetailNet: Enhancing Retails of Perishable Products with Multiple Selling Strategies via Pair-Wise Multi-Q Learning

Xiyao Ma, Fan Lu, Xiajun Amy Pan, Yanlin Zhou, Xiaolin Andy Li

04 May 2019 (modified: 13 Jul 2022)RL4RealLife 2019Readers: Everyone

Abstract: We propose RetailNet, an end-to-end reinforcement learning (RL)-based neural network, to achieve efficient selling strategies for perishable products in order to maximize retailers’ long-term profit. We design Pair-wise Multi-Q network for Q value estimation to model each state-action pair and to capture the interdependence between actions. Generalized Advantage Estimation (GAE)and Entropy are incorporated into the loss function for balancing the tradeoff between exploitation and exploration. Experiments show that Re-tailNet efficiently produces the near-optimal solution, providing practitioners valuable guidance on their inventory replenishment, pricing, and products display strategies in the retailing industry.

0 Replies

Loading