Imitaion versus communication: testing for language model

05 Nov 2023 (modified: 26 Jan 2024)PKU 2023 Fall CoRe SubmissionEveryoneRevisionsBibTeXCC BY 4.0
Keywords: language model, GPT, Turing Test
Abstract: This essay delves into the limitations of the Turing Test, primarily the anthropocentric flaw that renders it a measure of AI's ability to imitate human behavior rather than its intelligence. It introduces a solution through communication games and tests, offering a more effective evaluation method. Two examples, the AUT test and a Guessing Game with a language model, illustrate this approach. These games evaluate AI on specific criteria, overcoming the anthropocentric bias. By shifting the focus from imitation to effective communication, we move closer to assessing AI's true capabilities. The essay emphasizes the need for dynamic testing methods as AI technology evolves.
Submission Number: 113
Loading