Large Batch Optimization for Deep Learning: Training BERT in 76 minutesDownload PDFOpen Website

Yang You, Jing Li, Sashank Reddi, Jonathan Hseu, Sanjiv Kumar, Srinadh Bhojanapalli, Xiaodan Song, James Demmel, Kurt Keutzer, Cho-Jui Hsieh

23 Sept 2020 (modified: 05 May 2023)ICLR 2020Readers: Everyone
0 Replies

Loading