Published: 2021, Last Modified: 11 May 2023ICML 2021Readers: Everyone
Abstract:In this paper, we introduce a two-level attention schema, Poolingformer, for long document modeling. Its first level uses a smaller sliding window pattern to aggregate information from neighbors. I...