Accelerating language giants: A survey of optimization strategies for LLM inference on hardware platforms

Young Chan Kim, Seok Kyu Yoon, Sung Soo Han, Chae Won Park, Jun Oh Park, Jun Ha Ko, Hyun Kim

Published: 01 Mar 2026, Last Modified: 26 Jan 2026Journal of Systems ArchitectureEveryoneRevisionsCC BY-SA 4.0
Loading