The Measure of an Agent's Power
How long can it work autonomously?
Claude Opus 4.5
~5 hours
GPT-5.1 Codex Max
~3 hours
Claude Sonnet 4.5
~2 hours
Gemini 3 Pro
~2 hours
@ 50% success rate
~7 months
Doubling time
6+ years
Exponential trend
Minutes
→
Hours (NOW)
→
Days
→
Weeks
Source:
METR Time Horizons 1.1
(Jan 2026)