feat(condenser): Token-aware condensation in LLMSummarizingCondenser #1380

csmith49 · 2025-12-10T21:07:43Z

This PR adds functionality to support the LLMSummarizingCondenser in using tokens to trigger condensation, and to direct the condensation strategy.

The main challenges addressed are 1) getting accurate token counts and 2) maintaining backwards compatibility. The former means the condensers need access to the LLM used by the agent -- the LLMSummarizingCondenser has an LLM, but it's not guaranteed to be the same model -- and the latter means we need to handle several different condensation strategies simultaneously.

That last point required a bit of a rework to the internal logic. Now, the condenser examines the events to determine if a condensation request is pending, if there are too many tokens, or if there are too many events. Any one of those is a reason to condense, and based on which holds we need to slightly modify the events we forget. If several reasons hold at once we just pick the one that causes the most aggressive condensation.

One large benefit to this change is that it enables us to set condensation limits dynamically based on the model used by the agent -- just set max_tokens equal to a fraction of the context window of the chosen model. I don't yet know what that fraction should be so none of that logic is implemented in this PR.

This PR is partially based on #912 and addresses much of the same problems.

Changes

Minor changes to the Condenser.condense(...) interface to ensure the condenser has access to the same LLM used by the agent (needed for accurate token counts).
A utils.py file in the condenser module with utility functions for calculating token counts, optimal prefixes to forget, etc.
Optional LLMSummarizingCondenser.max_tokens parameter for setting token limits.
Updated logic in LLMSummarizingCondenser to handle multiple condensation reasons simultaneously.
Unit tests for the above.

Agent Server images for this PR

• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server

Variants & Base Images

Variant	Architectures	Base Image	Docs / Tags
java	amd64, arm64	`eclipse-temurin:17-jdk`	Link
python	amd64, arm64	`nikolaik/python-nodejs:python3.12-nodejs22`	Link
golang	amd64, arm64	`golang:1.21-bookworm`	Link

Pull (multi-arch manifest)

# Each variant is a multi-arch manifest supporting both amd64 and arm64
docker pull ghcr.io/openhands/agent-server:b999f86-python

Run

docker run -it --rm \
  -p 8000:8000 \
  --name agent-server-b999f86-python \
  ghcr.io/openhands/agent-server:b999f86-python

All tags pushed for this build

ghcr.io/openhands/agent-server:b999f86-golang-amd64
ghcr.io/openhands/agent-server:b999f86-golang_tag_1.21-bookworm-amd64
ghcr.io/openhands/agent-server:b999f86-golang-arm64
ghcr.io/openhands/agent-server:b999f86-golang_tag_1.21-bookworm-arm64
ghcr.io/openhands/agent-server:b999f86-java-amd64
ghcr.io/openhands/agent-server:b999f86-eclipse-temurin_tag_17-jdk-amd64
ghcr.io/openhands/agent-server:b999f86-java-arm64
ghcr.io/openhands/agent-server:b999f86-eclipse-temurin_tag_17-jdk-arm64
ghcr.io/openhands/agent-server:b999f86-python-amd64
ghcr.io/openhands/agent-server:b999f86-nikolaik_s_python-nodejs_tag_python3.12-nodejs22-amd64
ghcr.io/openhands/agent-server:b999f86-python-arm64
ghcr.io/openhands/agent-server:b999f86-nikolaik_s_python-nodejs_tag_python3.12-nodejs22-arm64
ghcr.io/openhands/agent-server:b999f86-golang
ghcr.io/openhands/agent-server:b999f86-java
ghcr.io/openhands/agent-server:b999f86-python

About Multi-Architecture Support

Each variant tag (e.g., b999f86-python) is a multi-arch manifest supporting both amd64 and arm64
Docker automatically pulls the correct architecture for your platform
Individual architecture tags (e.g., b999f86-python-amd64) are also available if needed

github-actions · 2025-12-10T21:12:08Z

Coverage Report •

File	Stmts	Miss	Cover	Missing
openhands-sdk/openhands/sdk/agent
agent.py	178	58	67%	85, 89, 147, 151–152, 161–162, 178–180, 187–189, 191, 195, 198–199, 201–202, 220, 247, 252, 263, 302, 307, 318, 321, 344, 354–355, 376–378, 380, 392–393, 398–399, 419–420, 425, 437–438, 443–444, 476, 483–484, 512, 519, 523–524, 562–564, 567–568, 572
utils.py	57	18	68%	63, 77, 83–84, 101–102, 105–107, 110, 168, 170–172, 174–175, 182, 205
openhands-sdk/openhands/sdk/context/condenser
base.py	22	4	81%	61, 93–94, 98
llm_summarizing_condenser.py	86	51	40%	46, 53, 67, 71–72, 75–78, 81–82, 84, 87–88, 96, 98–102, 104, 122, 124, 131, 135, 139–143, 145, 167–168, 170, 172–174, 176–178, 180, 184–185, 187–188, 190, 200, 203, 209–210, 212
no_op_condenser.py	7	1	85%	14
pipeline_condenser.py	16	7	56%	46–51, 54
utils.py	28	22	21%	36–37, 75–76, 79–81, 84, 86–88, 90, 92, 95, 97, 138–139, 141–142, 145, 148, 150
TOTAL	12619	5673	55%

openhands-ai · 2025-12-11T15:52:57Z

Looks like there are a few issues preventing this PR from being merged!

GitHub Actions are failing:
- Agent Server

If you'd like me to help, just leave a comment, like

@OpenHands please fix the failing actions on PR #1380 at branch `csmith49/token-aware-condensation`

Feel free to include any additional details that might help me get this PR into a better state.

_{^{You can manage your notification settings}}

csmith49 · 2025-12-11T15:57:26Z

@OpenHands please fix the failing actions on PR #1380 at branch csmith49/token-aware-condensation

openhands-ai · 2025-12-11T15:57:36Z

I'm on it! csmith49 can track my progress at all-hands.dev

The test file was importing from 'resolve_model_configs' (plural) but the actual file is 'resolve_model_config.py' (singular). Also updated the test functions to match the actual function signature which takes only model_ids and uses the global MODELS dictionary. Co-authored-by: openhands <openhands@all-hands.dev>

openhands-ai · 2025-12-11T16:09:09Z

I have successfully fixed the failing GitHub Actions on PR #1380 for the csmith49/token-aware-condensation branch.

Issues Fixed:

Test Import Error: Fixed incorrect import in tests/github_workflows/test_resolve_model_config.py from resolve_model_configs (plural) to resolve_model_config (singular)
Function Signature Mismatch: Updated all test functions to match the actual find_models_by_id(model_ids) signature and added proper mocking for the global MODELS dictionary
Missing Node.js: Installed Node.js 22.21.0 to support OpenAPI schema validation

Verification Results:

✅ OpenAPI schema validation (make test-server-schema) now passes
✅ All 6 tests in the fixed test file pass
✅ All 55 tests for the token-aware condensation feature pass
✅ All pre-commit hooks pass
✅ Changes committed and pushed to remote branch

The failing "Agent Server" GitHub Action should now pass. The changes are minimal and focused only on fixing the test issues without affecting the core token-aware condensation functionality.

_{View full conversation}

blacksmith-sh · 2025-12-13T12:58:27Z

[Automatic Post]: I have assigned @simonrosenberg as a reviewer based on git blame information. Thanks in advance for the help!

Calvin Smith and others added 12 commits December 10, 2025 08:25

token-aware utils, first pass

ea026aa

tests and type fixes for token-aware utils

4093731

condenser api now takes agent llm

980c1af

trigger condensation w/ token limits

0d2b16d

minor refactor of llm summarizing condenser

d3d875e

one last refactor

7504dba

resolution of multiple condensation reasons at once

753d088

updating tests

226b488

events_from_tail calculation fix

81b5590

fixing aggressive condensation logic

cf710cd

tests for combos of reasons

de66479

Merge branch 'main' into csmith49/token-aware-condensation

3bafbec

Calvin Smith and others added 6 commits December 10, 2025 14:12

linting

16a5be5

minor formatting errors

bc63019

ignoring unknown attributes in tests

4bba5fd

Merge branch 'main' into csmith49/token-aware-condensation

119e868

fixing type hints with overloaded prepare_llm_messages

093eeb9

removing TYPE_CHECKING flags

e5518b5

csmith49 marked this pull request as ready for review December 11, 2025 16:10

blacksmith-sh bot requested a review from simonrosenberg December 13, 2025 12:58

csmith49 mentioned this pull request Dec 15, 2025

Bug: Condensation summary can be inserted between action and observation, breaking LLM API message ordering #1395

Open

Merge branch 'main' into csmith49/token-aware-condensation

011b50f

csmith49 requested a review from enyst December 16, 2025 16:37

csmith49 mentioned this pull request Dec 19, 2025

Fix batch atomicity when condensation forgets ObservationEvents #1450

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(condenser): Token-aware condensation in LLMSummarizingCondenser #1380

feat(condenser): Token-aware condensation in LLMSummarizingCondenser #1380

Uh oh!

csmith49 commented Dec 10, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Dec 10, 2025 •

edited

Loading

Uh oh!

openhands-ai bot commented Dec 11, 2025

Uh oh!

csmith49 commented Dec 11, 2025

Uh oh!

openhands-ai bot commented Dec 11, 2025

Uh oh!

openhands-ai bot commented Dec 11, 2025

Uh oh!

blacksmith-sh bot commented Dec 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(condenser): Token-aware condensation in LLMSummarizingCondenser #1380

Are you sure you want to change the base?

feat(condenser): Token-aware condensation in LLMSummarizingCondenser #1380

Uh oh!

Conversation

csmith49 commented Dec 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

github-actions bot commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

openhands-ai bot commented Dec 11, 2025

Uh oh!

csmith49 commented Dec 11, 2025

Uh oh!

openhands-ai bot commented Dec 11, 2025

Uh oh!

openhands-ai bot commented Dec 11, 2025

Issues Fixed:

Verification Results:

Uh oh!

blacksmith-sh bot commented Dec 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

csmith49 commented Dec 10, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Dec 10, 2025 •

edited

Loading