How Many Layers and Why? An Analysis of the Model Depth in TransformersDownload PDFOpen Website

Published: 01 Jan 2021, Last Modified: 13 Jun 2023ACL (student) 2021Readers: Everyone
Abstract: Antoine Simoulin, Benoit Crabbé. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: Student Research Workshop. 2021.
0 Replies

Loading