The Token Problem
Loading all MCP tool schemas upfront burns context window
❌ Today: Upfront Loading
~50
tool schemas in system prompt
Unused schemas
Actually called
Every turn pays for schemas the agent never uses. Most conversations touch 2-3 tools max.
✅ Better: Lazy Discovery
0
schemas loaded at start
Tokens saved
On-demand load
Agent searches → discovers matching tool → loads only what it needs. Context stays clean.