Skip to content

fix: support HMA for vLLM OffloadingConnector#351

Open
yu-zh47 wants to merge 2 commits into
ovg-project:mainfrom
yu-zh47:fix/offloading-connector-hma
Open

fix: support HMA for vLLM OffloadingConnector#351
yu-zh47 wants to merge 2 commits into
ovg-project:mainfrom
yu-zh47:fix/offloading-connector-hma

Conversation

@yu-zh47
Copy link
Copy Markdown

@yu-zh47 yu-zh47 commented Jun 1, 2026

Fixes #267.

Summary

This PR backports vLLM 0.21.0's HMA support for OffloadingConnector to older vLLM versions.

In vLLM 0.16.0 through 0.20.x, OffloadingConnector exists but does not inherit SupportsHMA. Since vLLM 0.16 enables the hybrid memory allocator path by default, vllm serve ... --kv-offloading-size N fails during connector creation with:

ValueError: Connector OffloadingConnector does not support HMA but HMA is enabled.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Error when vllm-v0.16.0 uses --kv-offloading-size

1 participant