2022 (modified: 03 Feb 2023)ICML 2022Readers: Everyone
Abstract:Training large neural network (NN) models requires extensive memory resources, and Activation Compression Training (ACT) is a promising approach to reduce training memory footprint. This paper pres...