DEFCON: Deformable Convolutions Leveraging Interval Search and GPU Texture Hardware

Malith Jayaweera; Yanyu Li; Yanzhi Wang; Bin Ren; David R. Kaeli

DEFCON: Deformable Convolutions Leveraging Interval Search and GPU Texture Hardware

Malith Jayaweera, Yanyu Li, Yanzhi Wang, Bin Ren, David R. Kaeli

Published: 01 Jan 2024, Last Modified: 30 Sept 2024IPDPS 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Deformable convolutions can improve detection accuracy in Convolution Neural Networks (CNNs) by leveraging flexible spatial sampling in augmenting kernels with learnable offsets. However, the resulting irregular memory access patterns and additional pixel lookup overhead introduced by deformable layers pose inherent challenges when executed on high-throughput devices such as GPUs. To address these challenges, we introduce DEFCON, a systematic approach to optimizing deformable convolutions. DEFCON is designed to provide: (1) better placement of operators in the neural architecture using interval search, (2) reduced computational demands by leveraging lightweight operators, and (3) optimized inference by using GPU texture hardware. By performing an interval search, we reduce the number of deformable layers in our architecture. By leveraging the GPU’s texture hardware, we are able to use lightweight operators to improve the execution performance of layers, without sacrificing prediction accuracy. By combining these approaches, DEFCON increases the inference performance by 2.8× over YOLACT++ implementation, when run on an NVIDIA Jetson AGX Xavier GPU. Our work enables faster and more accurate predictions when performing deformable convolutions.

Loading