Thorough Characterization and Analysis of Large Transformer Model Training At-Scale

Scott Cheng, Jun-Liang Lin, Murali Emani, Siddhisanket Raskar, Sam Foreman, Zhen Xie, Venkatram Vishwanath, Mahmut T. Kandemir

Published: 11 Jun 2024, Last Modified: 16 Oct 2025CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading