ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents
Abstract
Budget-Aware Context Management (BACM) addresses context limitations in LLM agents through sequential decision-making and reinforcement learning-based compression strategies.
LLM-based agents show strong potential for long-horizon reasoning, yet their context size is limited by deployment factors (e.g., memory, latency, and cost), yielding a constrained context budget. As interaction histories grow, this induces a trade-off between retaining past information and staying within the context limit. To address this challenge, we propose Budget-Aware Context Management (BACM), which formulates context management as a sequential decision problem with a context budget constraint. It enables agents to assess the available budget before incorporating new observations and decide when and how much of the interaction history to compress. We further develop BACM-RL, an end-to-end curriculum-based reinforcement learning approach that learns compression strategies under varying context budgets. Experiments on compositional multi-objective QA and long-horizon web browsing benchmarks show that BACM-RL consistently outperforms prior methods across model scales and task complexities, achieving over 1.6times gains over strong baselines in high-complexity settings, while maintaining strong advantages as budgets shrink, where most methods exhibit a downward performance trend.
Community
- We propose Budget-Aware Context Management} a framework that formulates budget-aware context compression for LLM agents as a sequential decision problem under explicit context-window constraints, enabling adaptive compression throughout long-horizon trajectories.
- We develop a budget-constrained RL method that extends GRPO with a progressively tightened budget curriculum and overflow-sensitive regularization for robust context management under strict budgets.
- We conduct extensive empirical studies on compositional multi-objective QA and long-horizon web browsing benchmarks, and further provide comparative analyses of compression behavior and efficiency under context constraints.
Get this paper in your agent:
hf papers read 2604.01664 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper