Published: 01 Jan 2023, Last Modified: 09 Feb 2024ICML 2023Readers: Everyone
Abstract:Decoding methods for large language models often trade-off between diversity of outputs and parallelism of computation. Methods such as beam search and Gumbel top-k sampling can guarantee a differe...