REINFORCE with Bound-guided Gradient Estimator for the traveling salesman problem toward scale generalization

Haopeng Duan, Kaiming Xiao, Lihua Liu, Haiwen Chen, Hongbin Huang

Published: 2025, Last Modified: 26 Jul 2025Eng. Appl. Artif. Intell. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•RIDGE introduced: knowledge-inspired REINFORCE for TSP with BHH Theorem.•RIDGE uses sliding average shortest path as adaptive baseline for stability.•RIDGE tops small-scale TSP accuracy, boosts large-scale generalization.•Smaller training sets enhance model generalization.