Human-likeness of LLMs in the Mental Lexicon

Bei Xiao; Xufeng Duan; David A. Haslett; Zhenguang Cai

Human-likeness of LLMs in the Mental Lexicon

Bei Xiao, Xufeng Duan, David A. Haslett, Zhenguang Cai

Published: 24 May 2025, Last Modified: 18 Jun 2025CoNLL 2025EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Large language models; mental lexicon; word association; machine behavior; psycholinguistics

TL;DR: The study examines whether LLMs mirrors human lexical organization by comparing models and humans through a word association task, a widely used psycholinguistics method for probing mental lexical structures.

Abstract: Recent research has increasingly focused on the extent to which large language models (LLMs) exhibit human-like behavior. In this study, we investigate whether the mental lexicon in LLMs resembles that of humans in terms of lexical organization. Using a word association task—a direct and widely used method for probing word meaning and relationships in the human mind—we evaluated the lexical representations of GPT-4 and Llama-3.1. Our findings reveal that LLMs closely emulate human mental lexicons in capturing semantic relatedness but exhibit notable differences in other properties, such as association frequency and dominant lexical patterns (e.g., top associates). Specifically, LLM lexicons demonstrate greater clustering and reduced diversity compared to the human lexicon, with KL divergence analysis confirming significant deviations in word association patterns. Additionally, LLMs fail to fully capture word association response patterns in different demographic human groups. Among the models, GPT-4 consistently exhibited a slightly higher degree of human-likeness than Llama-3.1. This study highlights both the potential and limitations of LLMs in replicating human mental lexicons, offering valuable insights for applications in natural language processing and cognitive science research involving LLMs.

Supplementary Material: zip

Copyright Agreement: pdf

Submission Number: 213

Loading