arxiv:2604.01664

ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents

Published on Apr 2

Authors:

Abstract

Budget-Aware Context Management (BACM) addresses context limitations in LLM agents through sequential decision-making and reinforcement learning-based compression strategies.

AI-generated summary

LLM-based agents show strong potential for long-horizon reasoning, yet their context size is limited by deployment factors (e.g., memory, latency, and cost), yielding a constrained context budget. As interaction histories grow, this induces a trade-off between retaining past information and staying within the context limit. To address this challenge, we propose Budget-Aware Context Management (BACM), which formulates context management as a sequential decision problem with a context budget constraint. It enables agents to assess the available budget before incorporating new observations and decide when and how much of the interaction history to compress. We further develop BACM-RL, an end-to-end curriculum-based reinforcement learning approach that learns compression strategies under varying context budgets. Experiments on compositional multi-objective QA and long-horizon web browsing benchmarks show that BACM-RL consistently outperforms prior methods across model scales and task complexities, achieving over 1.6times gains over strong baselines in high-complexity settings, while maintaining strong advantages as budgets shrink, where most methods exhibit a downward performance trend.

View arXiv page View PDF Add to collection

Community

watermaster-911

1 day ago

We propose Budget-Aware Context Management} a framework that formulates budget-aware context compression for LLM agents as a sequential decision problem under explicit context-window constraints, enabling adaptive compression throughout long-horizon trajectories.
We develop a budget-constrained RL method that extends GRPO with a progressively tightened budget curriculum and overflow-sensitive regularization for robust context management under strict budgets.
We conduct extensive empirical studies on compositional multi-objective QA and long-horizon web browsing benchmarks, and further provide comparative analyses of compression behavior and efficiency under context constraints.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Get this paper in your agent:

hf papers read 2604.01664

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2604.01664 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2604.01664 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2604.01664 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.