Accelerating language giants: A survey of optimization strategies for LLM inference on hardware platforms | OpenReview

Accelerating language giants: A survey of optimization strategies for LLM inference on hardware platforms

Open Webpage

Young Chan Kim, Seok Kyu Yoon, Sung Soo Han, Chae Won Park, Jun Oh Park, Jun Ha Ko, Hyun Kim

Published: 01 Mar 2026, Last Modified: 26 Jan 2026Journal of Systems ArchitectureEveryoneRevisionsCC BY-SA 4.0

External IDs:doi:10.1016/j.sysarc.2026.103690

Loading