Academic evidence
MCPToolBench++ (2025) — LLM context windows limit available tools per run; tool descriptions consume significant tokens
ScaleMCP (2025) — Static tool loading doesn't scale; proposes dynamic lazy loading
CaveAgent (2026) — Advocates metadata-driven skill activation over static tool definitions