Reducing Activation Recomputation in Large Transformer Models

Published: 2023, Last Modified: 13 Nov 2024MLSys 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading