Network Convergence Indicator for Efficient Robot Grasping Pose Detection Under Limited Computation Resource

Published: 01 Jan 2024, Last Modified: 16 May 2025IEEE Trans. Instrum. Meas. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Industrial edge computing devices typically have limited computational performance. Current high-precision grasping pose detection models are often over-parameterized, resulting in computational inefficiency and making it difficult to deploy on industrial robots. Existing efficient network design methods rely on constructing, training, and validating a large number of new networks to obtain the final target network, which also requires significant computing resources. To address this issue, the network convergence indicator (NCI) is proposed to realize the prediction of network accuracy before training. NCI can evaluate convergence based on the evasion of flat regions and saddle points theory. It is used to control the width and depth of the network model and optimize its inner layers. Based on these principles, NCIGrasp is proposed, which is a high-precision lightweight grasping network achieving outstanding performance on the Cornell and Jacquard datasets with only 10% of the parameters of previous advanced models. NCIGrasp has a small size of 0.69 MB and fast inference speed on the server (~8 ms), making it suitable for edge computing. Experimental results show that NCIGrasp has real-time grasping pose detection ability (~21 ms) for unknown objects on an ordinary laptop with limited computation capability.
Loading