Large Batch Optimization for Deep Learning: Training BERT in 76 minutesDownload PDF

25 Sept 2019, 19:13 (edited 11 Mar 2020, 07:34)ICLR 2020 Conference Blind SubmissionReaders: Everyone
Original Pdf: pdf
Code:
TL;DR:
Abstract:
Keywords:
10 Replies

Loading