What AI's Disagreements Reveal About How It Works

What Nine AI Answers to One Obscure History Question Reveal About Language Models, Hallucination and How to Stay in Control

A small school on a remote Japanese island -- 青海島小学校, now preserved as a cultural site called 青海島共和国 on Omi Island in Yamaguchi Prefecture -- holds a striking mystery in its archive. Every year's sixth-grade graduation photo includes both boys and girls. Every year except one: 1946. That cohort graduated with only girls in the picture.

The same question about that photo was submitted to nine AI systems in August 2025 -- Claude, ChatGPT, Perplexity, Grok, Gemini, CoPilot, DeepSeek, Qwen and Kimi. What came back was not nine versions of the same answer. It was nine distinct historical theories, weighted differently, landing on different conclusions, with different tones and different levels of confidence. Some were right. Some were plausible. Some were confidently wrong.

That divergence is not a bug. It is the most direct window into how large language models are actually built.