AI-Managed Baseball Simulation
DEEP
DUGOUT
Every managerial decision made by AI. Every decision logged, analyzed, and second-guessed.
New — June 9, 2026
The Fable Test
Anthropic released Claude Fable 5 today — its first Mythos-class model handed to the public, and the most advanced model ever released to consumers. We gave it the Dodgers and pointed it at the model it just replaced: Opus 4.8's Yankees. Real 2026 rosters, current stats.
The Dodgers won in 6 games. Different teams, different personalities — a showcase, not a controlled test. But the decision logs are revealing: Fable 5 reasoned at roughly twice the length and half the speed of Opus 4.8, and was the only manager to outrun its own token budget mid-decision.
The Fable Test — Highlights
More Experiments
The Generation War
Opus 4.7 vs Opus 4.6 — one generation apart
Opus 4.7, released the morning of the sim, wins in 5. The newer model reasons about situations rather than recognizing them.
2026 World Series
Sonnet vs Sonnet — same model, opposing philosophies
Yamamoto silences Seattle in the Game 7 clincher. The Optimizer's leverage-driven bullpen management proves decisive in a seven-game war.
Yamamoto Silences Seattle as Dodgers Win World Series
Dodger Stadium
Canzone Walks It Off as Mariners Steal Game 3 in 10
T-Mobile Park
Ohtani's Eleventh-Inning Triple Lifts Dodgers Past Mariners in Game 5 Thriller
T-Mobile Park
What is Deep Dugout?
A baseball simulation platform where every managerial decision — every pitching change, every lineup call, every bullpen move — is made by Claude, Anthropic's AI. Real rosters, real stats, real tactical complexity. Four series, four experiments, one question: what happens when AI runs the show?
Learn more about the project →