Learning to Minimize Cost to Serve for Multi-Node Multi-Product Order Fulfilment in Electronic Commerce

Pranavi Pathakota, Kunwar Zaid, Anulekha Dhara, Hardik Meisheri, Shaun D'Souza, Dheeraj Shah, Harshad Khadilkar

Published: 01 Jan 2023, Last Modified: 12 May 2023COMAD/CODS 2023Readers: Everyone

Abstract: In the retail industry, electronic commerce (e-commerce) has grown quickly in the last decade and has further accelerated as a result of movement restrictions during the pandemic. While working with logistics and retail industry business collaborators, we found that the cost of delivery of products from the most opportune node in the supply chain (a quantity called the cost-to-serve or CTS) is a key challenge. In this paper, we formally define CTS as a decision-making problem. We then focus on the specific subproblem of delivering multiple products in arbitrary quantities from any warehouse to the customer doorstep. We find that a reinforcement learning (RL) formulation is able to exceed the performance of the state of the art rule based policies, while being significantly faster than traditional optimisation approaches such as mixed-integer linear programming. We hypothesise that scaling up the RL based methodology will have a significant impact on the operating margins of retailers in the ‘new normal’.

0 Replies