A deep reinforcement learning approach for chemical production scheduling

Christian D. Hubbs; Can Li; Nikolaos V. Sahinidis; Ignacio E. Grossmann; John M. Wassick

A deep reinforcement learning approach for chemical production scheduling

Christian D. Hubbs, Can Li, Nikolaos V. Sahinidis, Ignacio E. Grossmann, John M. Wassick

Published: 01 Jan 2020, Last Modified: 19 Jan 2025Comput. Chem. Eng. 2020EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: This work examines applying deep reinforcement learning to a chemical production scheduling process to account for uncertainty and achieve online, dynamic scheduling, and benchmarks the results with a mixed-integer linear programming (MILP) model that schedules each time interval on a receding horizon basis. An industrial example is used as a case study for comparing the differing approaches. Results show that the reinforcement learning method outperforms the naive MILP approaches and is competitive with a shrinking horizon MILP approach in terms of profitability, inventory levels, and customer service. The speed and flexibility of the reinforcement learning system is promising for achieving real-time optimization of a scheduling system, but there is reason to pursue integration of data-driven deep reinforcement learning methods and model-based mathematical optimization approaches.

Loading