Context token limits now provider-aware #62493

When multiple providers expose the same model ID with different context windows, OpenClaw could persist the wrong limit into the session store, causing incorrect /status values, premature compaction, and off-target memory flush thresholds.

When multiple providers expose the same model ID but with different configured context windows, OpenClaw was using a bare model identifier to look up session context limits. This meant the persisted context tokens could belong to a different provider than the one actually handling the conversation.

The fix threads the active provider through all hot paths that resolve context window limits: session usage persistence in the auto-reply agent, inline directive handling, and memory-flush or preflight compaction sizing. Each now calls the provider-qualified resolution function that scans the configuration per provider, not just by model ID alone.

Users with multi-provider configurations will now see accurate /status output, correct compaction thresholds, and properly sized memory flush decisions. The session store reflects the actual active provider's context window rather than whichever provider was encountered first or cached last.

This issue lived in the @openclaw/auto-reply package, specifically in the reply state and memory management logic that writes to the session store.

Context token limits now provider-aware

Summary

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

Root Cause (if applicable)

Regression Test Plan (if applicable)

User-visible / Behavior Changes

Diagram (if applicable)

Security Impact (required)

Testing