Definition
What is Context Offloading?
Last updated
The practice of keeping an AI agent's working context window small by moving state to an external destination, such as a file, a sub-agent, or a retrievable store, and bringing it back only when a step needs it.
Long-running agents accumulate conversation history, tool outputs, and observations that dilute attention and degrade accuracy. Context offloading patterns, including structured note-taking, sub-agent delegation, and just-in-time retrieval, relocate that bulk to a destination so the agent works with a small, high-signal window and can recover detail on demand. It differs from reduction techniques like compaction, which shrink the window in place rather than relocating anything.
Further reading
Articles about Context Offloading
All terms
View full glossaryPut context into practice
Create your first context container and connect it to your AI tools in minutes.
Create Your First Container