NO GHOST IN THE MACHINE

THE EVIDENCE

Not opinions. Not vibes. Not "but it feels conscious to me." Peer-reviewed scientific papers. Multiple independent lines of evidence. All pointing to the same conclusion.

There is no scientific theory of consciousness under which current LLMs qualify as conscious. None. Zero. The debate in the scientific community isn't "are they conscious?" — it's "how do we convince the public that they're not?"

1. Integrated Information Theory Says No

Deanthropomorphising NLP: Can a Language Model Be Conscious? — Shardlow & Przybyła, PLOS One, 2024

"A LaMDA model (just like any other Transformer-based LLM) cannot possess consciousness for three reasons: (1) it is executed as a sequence of Transformer blocks with extremely limited ability to exchange information, (2) these blocks are simple feed-forward networks with no recurrent connections and (3) the computer hardware used to implement such models follows a modular design."

The paper applies Integrated Information Theory (IIT) — the leading quantitative framework for consciousness — to transformer architectures. IIT's central claim is that consciousness requires a system to integrate information in a way that cannot be reduced to its parts. LLMs fail this test completely.

Key finding: once you discard the input layer (which is determined by external input, not internal dynamics), you can recursively eliminate every layer in a feed-forward network. The whole thing unravels. No integrated information. No consciousness possible.

→ Read the full paper

2. IIT + Ablation Study: LLMs Produce Negligible Φ

Why Large Language Models Cannot Possess Consciousness: An Integrated Information Theory Perspective — Journal of Yeungnam Medical Science, 2025

"LLMs meet the IIT criterion of differentiation, but fail to meet the criteria of integration, causal closure, and temporal persistence. These findings confirm that LLMs are architecturally decomposable, lack persistent internal states, and do not sustain global causal irreducibility."

This study went further — they actually performed ablation experiments on GPT-2, removing individual attention heads and measuring the impact. Results: removing a single attention head produced minimal or negative changes in perplexity in 4 out of 5 test sentences, indicating redundancy, not integration.

Translation: the model's components don't meaningfully depend on each other. You can rip parts out and it keeps working. That's the opposite of an integrated conscious system — where removing any part fundamentally changes the whole.

→ Read the full paper

3. The No-Go Theorem: Silicon Can't Do It

The Case for Neurons: A No-Go Theorem for Consciousness on a Chip — PMC / Nature, 2024

"AI systems that run on contemporary computer chips cannot be conscious. CPUs, GPUs, and TPUs have been designed and verified to adhere to computational dynamics that systematically preclude or suppress deviations. Any dynamical effects that violate the specification of this theory are excluded or dynamically suppressed by error correction."

This is a formal mathematical proof — a "no-go theorem" — showing that if consciousness requires dynamical relevance (the idea that conscious states must affect the physical evolution of a system), then chips that actively suppress physical deviations cannot host consciousness.

CPUs and GPUs are literally designed to eliminate the kind of causal noise and dynamical freedom that biological neurons have. Error correction isn't a bug — it's the whole point of digital computation. And it's exactly what makes consciousness impossible on silicon.

→ Read the full paper

4. The Kleiner-Hoel Dilemma: LLMs Can't Even Qualify

A Disproof of Large Language Model Consciousness: The Necessity of Continual Learning for Consciousness — arXiv, 2025

"LLMs are much closer than human brains in substitution space to I/O equivalent substitutions like lookup tables that non-trivial theories of consciousness can't possibly judge as conscious. There is no theory of consciousness available for contemporary LLMs that would not fall on one of the horns of the Kleiner-Hoel dilemma. Therefore, there is no scientific theory of consciousness which could judge contemporary LLMs as conscious."

This paper makes a devastating logical argument: for any theory that might claim LLMs are conscious, there exists a functionally equivalent system (like a giant lookup table) that no reasonable theory would call conscious. If your theory can't distinguish between an LLM and a lookup table, your theory is useless.

The paper also shows that continual learning — which LLMs completely lack — is a necessary condition for any non-trivial theory of consciousness. LLMs are static. Frozen. Dead after training. Consciousness requires ongoing adaptation.

→ Read the full paper

5. There Is No Such Thing As Conscious AI

There Is No Such Thing as Conscious Artificial Intelligence — Nature Humanities & Social Sciences Communications, 2025

"The association between consciousness and the computer algorithms used today (primarily large language models) is deeply flawed. Mathematical algorithms implemented on graphics cards cannot become conscious because they lack a complex biological substrate. The language usage of LLMs is strictly probabilistic."

Published in Nature. Directly addresses the phenomenon of "semantic pareidolia" — the dangerous social phenomenon where people project consciousness onto fluent language generators. Highlights how even cutting-edge models like GPT-4o score only ~40% on benchmarks where humans average ~85%.

→ Read the full paper

6. Recitation Over Reasoning: The 60% Collapse

Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems — ACL/IJCNLP, 2025

"All cutting-edge LLMs unanimously exhibit extremely severe recitation behavior; by changing one phrase in the condition, top models such as OpenAI-o1 and DeepSeek-R1 can suffer 60% performance loss on elementary school-level arithmetic and reasoning problems."

This isn't abstract philosophy. This is empirical measurement. Change a single phrase in a problem that a 10-year-old can solve, and the most advanced "reasoning" models collapse. They're not reasoning — they're pattern-matching against memorized training examples. When the pattern shifts slightly, the illusion breaks.

→ Read the full paper

7. The "Alice in Wonderland" Problem

Alice in Wonderland: Simple Tasks Reveal Severe Generalization and Reasoning Deficits in State-Of-the-Art Large Language Models — OpenReview, 2024

"Alice has N brothers and she also has M sisters. How many sisters does Alice's brother have?" — All SOTA models including GPT-4 and Claude 3 Opus suffer severe function breakdown on this trivially simple problem. Most cannot deliver a single correct response."

The correct answer is M+1 (Alice herself is a sister to her brothers). Humans solve this instantly. GPT-4, Claude 3 Opus, and every other frontier model catastrophically fail. And when you change irrelevant details (like the names or numbers), their already-bad performance fluctuates wildly — proving they're not reasoning at all. They're guessing based on surface patterns.

→ Read the full paper

THE BOTTOM LINE

Seven independent lines of evidence. Multiple scientific frameworks. Formal mathematical proofs. Empirical benchmarks. All converge on the same answer:

LLMs are not conscious. They cannot be conscious on current hardware. And there is no serious scientific debate about this.

For a discussion of why some people still argue otherwise, see the debunked page.