Beyond the Limit of Weight-Sharing: Pioneering Space-Evolving NAS with Large Language Models

Xiu Su; Shan You; Hongyan Xu; Xiuxing Li; Jun Long; Yi Chen; Chang Xu

Beyond the Limit of Weight-Sharing: Pioneering Space-Evolving NAS with Large Language Models

Xiu Su, Shan You, Hongyan Xu, Xiuxing Li, Jun Long, Yi Chen, Chang Xu

Published: 01 Jan 2024, Last Modified: 17 Apr 2025ICASSP 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Large language models (LLMs) offer impressive performance across diverse fields, but their increasing complexity raises both design costs and the need for specialized expertise. These challenges are intensified for Neural Architecture Search (NAS) methods reliant on weight-sharing techniques. This paper introduces GNAS, a new NAS method that boosts the search process with the aid of LLMs for efficient model discovery. With insights from existing architectures, GNAS swiftly identifies superior models that can adapt to changing resource constraints. We provide a mathematical framework to facilitate the transfer of knowledge across different model sizes, thereby improving search efficiency. Our experiments conducted on ImageNet, NAS-Bench-Macro, and ChannelBench-Macro confirm the effectiveness of GNAS across both CNN and Transformer architectures.

Loading