On the Distribution, Sparsity, and Inference-time Quantization of Attention Values in TransformersDownload PDFOpen Website

2021 (modified: 04 Feb 2022)ACL/IJCNLP (Findings) 2021Readers: Everyone
Abstract: Tianchu Ji, Shraddhan Jain, Michael Ferdman, Peter Milder, H. Andrew Schwartz, Niranjan Balasubramanian. Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. 2021.
0 Replies

Loading