Mitigating Quantization Errors Due to Activation Spikes in Gated Linear Unit-Based Large Language Models.

Jaewoo Yang, Hayun Kim, Junyung Ji, Younghoon Kim

04 Aug 2025Future Internet 2025EveryoneCC BY-SA 4.0
Loading