We handed two of today's most capable models the exact same one-paragraph prompt: build a working planetarium from scratch — no libraries, no lookups, just raw orbital math — and show tonight's sky. Both nailed it. Both spun up a gorgeous, fully interactive star map with the Sun, the Moon, and all five naked-eye planets in mathematically correct positions. Then we put them side by side and caught something strange: the two skies were mirror images, turning in opposite directions.
|
JOIN 46,000+ PROS Get the AI edge, every morning. Was this forwarded to you? Get one short, useful AI briefing — free, 5–6 mornings a week.
Free forever · unsubscribe anytime |
Go from AI overwhelmed to AI savvy professional
AI will eliminate 300 million jobs in the next 5 years.
Yours doesn't have to be one of them.
Here's how to future-proof your career:
Join the Superhuman AI newsletter - read by 1M+ professionals
Learn AI skills in 3 mins a day
Become the AI expert on your team
TLDR: We gave Claude Opus 4.8 and Claude Fable 5 the same prompt and got two beautiful, correct-looking planetariums that render the sky in opposite orientations — and only one matched what we actually asked for. Below: what split them, why it matters for every answer you trust to AI, and a prompt that turns the lesson into a habit.
Quick note: Fable 5 is temporarily paused as we publish — which is exactly why the habit beats the tool. The gut-check below works with whatever two models you've got.
Both Got the Stars Right. Only One Got the Vantage Right.
Every measurable thing agreed. Both models drew a new moon lit at roughly 3%. Both placed Mercury, Venus, and Jupiter east of the Sun, and Mars and Saturn to the west. Both put all seven bodies above the horizon, exactly where they belong tonight. The astronomy was identical — and correct. The split was orientation. Our prompt asked them to "show the sky chart with horizon and azimuth" — and a chart bounded by a horizon is, by definition, the view of someone standing on the ground looking up at the dome. Claude Fable 5 drew exactly that: east on the left, the way a real star chart reads when you tip your head back. Claude Opus 4.8 rendered the same correct sky as if you were hovering above it looking down at a map: east on the right. One character in the projection code, and the entire sky flipped.
This Is Why You Run It Twice
Neither model is broken. Both are stunning — a year ago, writing a planetarium from first principles in a single file was a research project, not a prompt you fire off before coffee. The takeaway isn't fear. As models get more polished, their misses get more confident and better-looking. Polish is converging fast. Correctness isn't — quite. Which makes the cheapest quality check in all of AI almost embarrassingly simple: ask two models the same thing and look at where they disagree. Agreement is your green light. Disagreement is a flashing arrow pointing straight at the one spot you need to check.
How to Catch It in 30 Seconds
Run anything that actually matters — a claim, a calculation, a plan — through two different models, not one. Skim for where they diverge; ignore the 90% they agree on and zero in on the 10% they don't. At each split, ask which one matched what you actually asked for, not which one sounds more sure. The planetarium is a perfect teacher because the mistake is visible. Most of the time it isn't. The habit is the same either way.
The Prompt — The Two-Model Gut Check
We turned the lesson into a reusable verification partner you can keep:
Prompt Proof Table
Two AIs, one sky, opposite views — and the model that looked gorgeous wasn't the one that looked up. The next flawless answer you get? Ask a second model which way it's facing.
## What kind of content do you want to see more of
If you enjoyed this issue and want more like it, subscribe to the newsletter.
Brought to you by Stoneyard.com • Subscribe • Forward • Archive


