\documentclass{turing2012}
\usepackage{times}
\usepackage{graphicx}
\usepackage{latexsym}
\usepackage{url}

\begin{document}

\title{What Do We Mean When We Say ``Anthropomorphism''?}

\author{Dorian Liu\institute{School of Philosophy, Psychology and Language Sciences, University of Edinburgh, email: zhaoting.liu6@gmail.com} \and Cathy Li\institute{School of Mathematics, University of Edinburgh}}

\maketitle
\bibliographystyle{AISB}

\begin{abstract}
``That is just anthropomorphism'' is among the most frequent rebuttals in
current debates about machine minds, and it is standardly used to end the
discussion. We ask what the charge asserts, and when it is legitimate. Using
Clever Hans as a running example, we analyse an attribution of mind into four
elements: a subject S, an attributed property P, the conditions C that
P requires, and the indicator f on which the attributor relies. A
legitimate charge of anthropomorphism, we argue, requires the accuser to
produce a trivial competing explanation of the indicator. We then identify
three ways its prominent uses fail to do so. Finally, we argue that only with
the charge set aside can we see what these disputes are really about.
\end{abstract}

\section{Introduction}

``That is just anthropomorphism.'' In debates over whether machines have minds,
this is among the most common replies, and it is normally meant to settle them;
yet whether the charge holds, or is merely asserted, is rarely examined. In its
general form, it says that ``we project human-like qualities onto other things
on the basis of only superficial similarities'' \cite{sethmythology2026}.

In practice, however, a charge of anthropomorphism usually carries two claims
at once. The first is an epistemic claim about the attributor: the attribution
is a projection, arising from a disposition in us and resting on mere
resemblance, owing nothing to the evidence. The second is an ontological claim
about the object: that the object does not have the property. We therefore want
to ask what we are really claiming when we make a charge of anthropomorphism,
and whether the move from the epistemic claim to the ontological one is
legitimate.

A reasonable objection is that the psychology of anthropomorphism is by now
well documented, so that the concept needs no further clarification
\cite{epley2007,guthrie1993}. But that psychology concerns only the first of the
two claims, how an attribution is produced; it says nothing about whether the
move from the first claim to the second is a good one. Nor is the conceptual
question settled in philosophy: as we show in Section~3, even the most recent
scholarly treatments fail to keep the two claims apart.

The paper is as follows. In Section~2 we map the charge onto four variables and
ask what a legitimate charge of anthropomorphism requires. In Section~3 we test
its recent academic and public uses against that standard, and examine what is
perhaps the most widely circulated charge of all, that the system ``is just
predicting the next token''. Section~4 concludes, drawing from the analysis a
short set of requirements that both parties to the dispute must meet.

\section{What a Legitimate Charge Requires}

The charge of anthropomorphism is far from a creature of the language-model age,
and perhaps its most famous occasion is a horse. Around 1900 Clever Hans was
exhibited across Germany as an animal that could do arithmetic. Posed a sum
aloud, he would strike out the answer with his hoof. The demonstrations were
public and repeatable, and no manifest trickery could be detected, so that
almost everyone drew the same conclusion: Hans had a share of the mathematical
capacity we credit to ourselves.

Suppose you had been in the audience and had drawn the natural conclusion.
Someone beside you objects: you are anthropomorphising; the horse has no
mathematical ability. In what sense does this objection hold? To see its
structure, some heuristic formalisation might help. From the charge we can
separate four variables:

\begin{itemize}\setlength{\itemsep}{6pt}\setlength{\topsep}{6pt}\setlength{\parsep}{0pt}
\item S, the subject, the thing to which the property is credited; here, the horse.
\item P, the property attributed to S; here, arithmetical ability.
\item C, the condition S must satisfy to have P; here, that the horse is really calculating.
\item f, the evidence the attributor goes on; here, that Hans strikes the right number.
\end{itemize}

We can then set out two competing chains of inference. Take the attributor
first. His inference can be set out as follows:

\begin{itemize}\setlength{\itemsep}{6pt}\setlength{\topsep}{6pt}\setlength{\parsep}{0pt}
\item[{\small (i)}] f holds, that is, Hans strikes the right number.
\item[{\small (ii)}] f holds only where C holds, that is, only a horse that is really calculating strikes the right number.
\item[{\small (iii)}] So f establishes that S satisfies C, and thereby supports P.
\item[{\small (iv)}] Therefore Hans can do arithmetic.
\end{itemize}

Now take the objector:

\begin{itemize}\setlength{\itemsep}{6pt}\setlength{\topsep}{6pt}\setlength{\parsep}{0pt}
\item[{\small (i')}] f holds, that is, Hans strikes the right number.
\item[{\small (ii')}] f holds in cases where C does not, that is, a horse that cannot calculate may still strike the right number.
\item[{\small (iii')}] So f does not establish that S satisfies C, and might not support P.
\item[{\small (iv')}] The attributor nonetheless holds that f supports P, and does so by projection.
\end{itemize}

The disagreement turns on (ii') and (iv'). The first concerns the horse and
its evidence: whether f can hold where C does not. The second concerns the
attributor: with f no longer supporting P, whether his holding to P is the
work of projection. We take them in turn.

Take the first. To attribute arithmetic to Hans is to read his tapping as a sign
that he is really calculating. To make the objection, the objector must point to
some other way the right taps could come about; that is, the objector needs a
possible competing explanation. Suppose the questioner knows the answer. As Hans
approaches the right number, the questioner unconsciously tenses, and relaxes
the moment it is reached; Hans reads this slight change in the questioner's
posture. On this account Hans is responding to the questioner. If that is what is happening, the taps no longer
show that Hans is calculating, and so give no support to the claim that he can
do arithmetic.

A merely possible competing explanation, however, is not enough. Suppose the
accuser proposes that Hans understands German: he recalls the answer he was
taught, and taps it out. This does not require the horse to calculate, so it
does not require S to satisfy C. Yet it leaves the attribution no less
anthropomorphic, because what it requires of the horse, a capacity for language,
is a human capacity as demanding as arithmetic itself. Crediting Hans with
German is, in this sense, no more credible than crediting him with arithmetic;
the accuser has merely relocated the projection. So we need to place a further
constraint on the competing explanation: it must be trivial, in that the
condition its operation requires of S must be weaker than C, so that an S
in which it operates need not satisfy C and need not have P. In this sense
attention to posture is trivial, while understanding German is not.

Two further requirements may also be necessary. First, the competing
explanation must be defensible: the accuser cannot merely say that some other
mechanism must be at work, or that such a mechanism is conceivable, but must
have evidence, independent of f, that it is real and operative. Second, it
must not already have been excluded by the attributor: if he has good reason to
rule it out (he has screened the questioner off, say, and the horse still
strikes the right number), it no longer unsettles f's support for P.

The first condition of a legitimate charge is therefore met when a competing
explanation can be exhibited that produces f without requiring S to satisfy
C, is trivial in the sense given, comes with justifying evidence, and has not
been excluded by the attributor.

The second condition of a legitimate charge, that the attribution is the work of
projection, is of a different kind: it concerns why the attributor holds that
f tracks P. As a claim about the source of his belief, appeal to a
psychological fact (whether an act of projection actually occurred, say) is not
operational, since he may well deny it, just as the objector may vaguely gesture
at some indefensible competing hypothesis. We do better to read it off his
epistemic situation. Suppose there is a trivial competing explanation that
produces f without the horse's working out the sum as we do. Then two typical
cases arise. In the first, the attributor knows of the competing explanation and
still holds that f tracks P. In the second, he does not, and supposes that
only a horse working out the sum as we do could strike the right number. Yet
whether or not he knows, he could have withheld the attribution and not credited
the horse with P. That he attributes it all the same is best understood as
projection: his confidence in P is sustained by the subject's likeness to us.

A legitimate charge of anthropomorphism thus has two conditions. The first falls
on the accuser, who must be able to offer an explanation of f that is trivial,
defensible, and not yet excluded by the attributor, showing that f does not in
fact support P. The second falls on the attributor: with f thus failing to
support P, his confidence in P is sustained by the subject's likeness to us,
which is to say by projection. A charge of anthropomorphism is legitimate just
when both conditions hold.

It bears emphasis that f need not be outward behaviour, and that this is what
the charge turns on.\footnote{It is natural to suppose that f must be outward
behaviour, so that any attribution resting on behaviour is anthropomorphic and
any resting on inner structure is safe; but neither half holds. Behaviour need
not invite the charge: let f be that Hans strikes the right number when the
questioner is screened off and ignorant of the answer. This is still behaviour,
but the cue-reading explanation no longer fits it, since that explanation
requires a questioner the horse can read; an attribution on this f is not
anthropomorphic, whatever else may be said against it. Inner evidence is not
exempt either: someone who credits a network with understanding because its
wiring diagram resembles a cortex goes on f that is internal, yet the
resemblance is a competing explanation that has not been ruled out. What the
charge turns on is whether an explanation that does not invoke P stands
unexcluded; whether f is outward or inner does not by itself decide it.}

\section{The Charge in Use}

Section~2 fixed the reach of a legitimate charge: at most it establishes the
epistemic claim, that f does not support P; it does not establish the
ontological claim, that S lacks P. In practice the two are run together, and
that conflation is where the charge does its damage. We distinguish three ways
its prominent uses go wrong.

The first failure is overreach: the charge establishes only the epistemic claim
yet is used to assert the ontological one as well. Perhaps the clearest public
instance is Ted Chiang's recent essay \cite{chiang2026}, whose argument can be
reconstructed as follows.

\begin{itemize}\setlength{\itemsep}{6pt}\setlength{\topsep}{6pt}\setlength{\parsep}{0pt}
\item[{\small (P1)}] The system produces fluent, humanlike text (f).
\item[{\small (P2)}] There is a trivial competing explanation of f: the text is to be taken as ``a deepfake medium'', on the grounds that generating a plausible simulacrum of a conversation between conscious beings is far easier than building a program that is genuinely conscious, and such a simulacrum does not invoke consciousness.
\item[{\small (C1)}] So f does not support attributing consciousness to the system (f does not support P).\footnote{That is, since the trivial competing explanation has not been ruled out, f is equally compatible with the system's being conscious and with its merely simulating, and so does not discriminate between them.}
\item[{\small (C2)}] So the system is not conscious (S lacks P).\footnote{C2 is Chiang's stated conclusion \cite{chiang2026}.}
\end{itemize}

From (P1) and (P2), only (C1) follows legitimately. To reach (C2) one would have
to test the competing explanation itself and examine the system, not merely note
that f falls short, and that is the step Chiang does not take.

The same conflation enters the scholarly definitions. Placani
\cite{placani2024}, for instance, characterises anthropomorphism as ``a
distinctively human process of inference or interpretation'', and yet also
classifies it as ``either a factual error---when it involves the attribution of
a human characteristic to some entity that does not possess that
characteristic, or as an inferential error---when it involves an inference that
something is or is not the case when there is insufficient evidence to draw such
a conclusion''. Anthropomorphism cannot be both at once. As an epistemic matter
it says that f does not support P; a ``factual error'' is the ontological claim,
asserting that the object in fact lacks the feature. If an attribution does saddle an object with a feature it lacks,
that is a mistake the user has made; it does not fall under the concept of
anthropomorphism. To list a use-level error (the factual one) alongside
anthropomorphism (an epistemic one) as its two branches is a category mistake.

The second failure is that the charge does not apply to every attribution,
because it presupposes that the accused has attributed P on the strength of
f, the system's humanlike performance.

\begin{itemize}\setlength{\itemsep}{6pt}\setlength{\topsep}{6pt}\setlength{\parsep}{0pt}
\item[{\small (P1)}] The accused attributes P to the system.
\item[{\small (P2)}] What he attributes P on is f, the system's humanlike performance.
\item[{\small (C)}] And f is taken to support P only by way of projection, reading ``resembles us'' as ``has P''; so the attribution is unsupported.
\end{itemize}

The charge finds its target only where (P2) holds. But some of the accused go on
something other than f. Take Hinton \cite{hinton2025}, whose argument can be
reconstructed as follows.

\begin{itemize}\setlength{\itemsep}{6pt}\setlength{\topsep}{6pt}\setlength{\parsep}{0pt}
\item[{\small (P1')}] When someone says ``I have a subjective experience of X'', the function of the words is to report that their perceptual system has gone wrong, by saying how the world would have to be for the perception to be correct. Talk of subjective experience is, on this account, a way of reporting the gap between how things are and how one perceives them.
\item[{\small (P2')}] A multimodal model with a camera, fooled by a hidden prism, points in the wrong direction; once told about the prism, it says that the object is really in front of it and that it had the subjective experience of the object being off to one side. The model is reporting exactly that gap.
\item[{\small (C)}] So the model uses the words ``subjective experience'' as we use them, and therefore has subjective experience.
\end{itemize}

This argument rests throughout on a semantic analysis of ``subjective
experience'' and on the prism experiment; f, whether the outputs read as
humanlike, never figures in it. So (P2) does not hold, and the charge of
projection misses: it attacks a premise the accused never offered. The fitting
response is to test the argument itself, challenging the redefinition of
``subjective experience'' or asking whether the model's report really means what
a human's does, whereas dismissing it as anthropomorphism simply sidesteps the
reasons actually given.\footnote{It is telling that Hinton is rarely accused of
anthropomorphism, even as he ascribes subjective experience to machines. Part of
the reason is surely that, as someone who understands the systems from the
inside, he is plainly not reasoning from surface resemblance, so the charge,
which targets inference from f, finds no purchase. One can imagine someone else
advancing the very same argument and being dismissed as an anthropomorphiser all
the same. The charge often turns on who is making the attribution, which shows
again how narrow its legitimate range really is.}

The third failure occurs at the first threshold, whether the competing
explanation succeeds at all. Recall the requirement of Section~2: for a
competing explanation to deprive f of its support for P, what it requires of
the system must be weaker than C, since only then can a system that merely
satisfies it fail to satisfy C, and so need not have P. Cue-reading works
precisely because what it requires (reading posture) is far weaker than C
(calculating).

In the AI debate the competing explanation most often offered is ``it is just
predicting the next token''. All of its force rests on an unstated premise: that
predicting the next token is something undemanding, far weaker than C (whether
C is understanding, a world model, or something else), as reading posture is
weaker than calculating. If that were so, then f (the system answering
fluently, solving new problems) would be fully compatible with the system's
merely predicting, and f would be no reliable indicator of P. The claim
looks intuitive, but it stands entirely on that premise, that predicting the
next token requires very little.

That premise, at least to us, is doubtful. The cleanest way to test what
predicting this well requires is to take a system least likely to harbour rich
internal structure: an early, small, thoroughly un-humanlike model, such as
Othello-GPT \cite{li2022othello}; it is trained only to predict the next legal
move in games of Othello, never told the rules and never shown a board. If
predicting the next token (here the next move) really required very little,
nothing close to C should be needed inside it. Yet under intervention, editing
its internal representation so as to flip one square of the represented board,
its subsequent judgments about which moves are legal change to match the edited
board.\footnote{Decodability alone would not settle the matter, since a probe
can recover information a model carries but does not use \cite{hewitt2019,
belinkov2022}; what settles it is the intervention, which shows the represented
board is causally in use \cite{nanda2023}.} This suggests that to predict moves
at this level the system must build, and causally use, an internal model of the
board. Predicting the next move, then, cannot confidently be said to require
anything weaker than C.

So this competing explanation is not the plainly trivial thing it is taken to
be, and it therefore cannot serve the accuser as a competing explanation at all;
in that sense the charge fails. As noted above, none of this shows that the
system cannot understand; it shows only that ``it is just predicting the next
token'' can no longer be taken as a free competing explanation, one that cancels
f at no cost. For the charge to be legitimate, someone would first have to
argue that predicting the next token at the frontier level does require less
than C, which, on the evidence above, is hard to sustain.

\section{Conclusion}

This paper takes no position on whether machines have minds, in part because that
question is already well served by a large literature, and in part because, among
philosophers themselves, such attributions rarely draw the charge of
anthropomorphism at all. What we have offered is a way of taking apart the
question that ``that is just anthropomorphism'' closes too early. The charge does
real damage because it is present in almost every public debate while doing far
less argumentative work than it appears to. Once it is set aside, the question
can be brought down to the variables that actually carry it: which subject S is
in question, which property P, what conditions C possessing P requires, and
whether the evidence f excludes the competing explanations.

Two heuristic requirements follow. The first we may call conceptual transparency.
Anyone who deploys the charge must say which subject S is in
question,\footnote{The choice of S is easily overlooked yet often where a
dispute really lies. The classic case is Searle's Chinese Room and the Systems
Reply it provoked: Searle locates the subject as the person in the room, who
understands no Chinese, while the reply relocates it to the whole system
\cite{searle1980}. Clarifying S remains necessary for present systems; see
Shanahan \cite{shanahan2024} on the distinction between the bare model and the
system in which it is embedded, and Chalmers \cite{chalmers2025wwt} on what kind
of entity an LLM-based system is.} which property P is at
issue,\footnote{A characteristic dispute over P can be seen in the recent
public exchange on machine consciousness. Pope Leo~XIV, in \emph{Magnifica
Humanitas} (2026), denies that AI systems are conscious, where to be conscious is
to undergo experience, to have a body, to feel joy and pain, to mature through
relationships. In the same period, interpretability work reports functional
emotion in large models, emotion representations with causal roles in behaviour,
while stating that ``none of this tells us whether language models actually feel
anything'' \cite{sofroniew2026}. The two do not contradict each other: they
concern different properties P, with different conditions C, so an attribution
of the one is consistent with a denial of the other. They appear to disagree only
because they share a word.}\ and which conditions C that property is taken to
require: whether suffering, say, demands phenomenal consciousness, or whether
understanding demands more than functional organisation. A charge that invokes
what a system ``really'' understands or ``genuinely'' feels, while declining to
state what reality here requires, merely presupposes its conclusion.

The second is symmetry of burden. Accuser and attributor alike must produce
evidence or argument that their own position is not the naive one: at a minimum,
the accuser must rule out the trivial competing explanation, and the attributor
must offer substantive theoretical support; surface resemblance alone will not
do. ``S lacks P'' is itself a claim about S, and it answers to C
and f exactly as ``S has P'' does. If interpretability evidence reveals
internal structure of the kind a mental capacity would require, that evidence
cannot be waved away by reasserting that the system is ``merely a machine'', any
more than behavioural resemblance alone settles the matter for the attributor.

\subsubsection*{Acknowledgements}
We thank the AISB AICE reviewers for their comments on the extended abstract.

\bibliography{paper}

\end{document}
