[Stability fix] turn off HMA allocator when connector is set #27592

KuntaiDu · 2025-10-27T16:30:41Z

Due to #25712 , currently vLLM will hard fail if the user simply set a connector because vLM enables HMA by default.

To avoid hard fail, before discussing with @njhill to figure out a better solution, let's just turn off HMA when a connector is being set.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: KuntaiDu <[email protected]>

mergify · 2025-10-27T16:31:20Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @KuntaiDu.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

gemini-code-assist

Code Review

This pull request aims to fix a stability issue by disabling the Hybrid Memory Allocator (HMA) when a connector is configured. While the change is functionally correct in its intent, it introduces a redundant code block. The check for kv_transfer_config is now performed twice consecutively. This should be consolidated into a single check to improve code clarity and maintainability.

Signed-off-by: Kuntai Du <[email protected]>

KuntaiDu · 2025-10-27T17:03:02Z

Code Review

This pull request aims to fix a stability issue by disabling the Hybrid Memory Allocator (HMA) when a connector is configured. While the change is functionally correct in its intent, it introduces a redundant code block. The check for kv_transfer_config is now performed twice consecutively. This should be consolidated into a single check to improve code clarity and maintainability.

It's now fixed. Thanks for the reminder.

NickLucche

Fixes

(EngineCore_DP0 pid=3595930) ValueError: Connector NixlConnector does not support HMA but HMA is enabled. Please set `--disable-hybrid-kv-cache-manager`.

I think we should just change logic to disabling it if it does not support hma with the interface you defined, given it's on by default

njhill

Thanks @KuntaiDu

vllm/config/vllm.py

Signed-off-by: KuntaiDu <[email protected]>

…thub.com/KuntaiDu/vllm into kuntai-disable-HMA-for-connector-for-now

Signed-off-by: KuntaiDu <[email protected]>

KuntaiDu · 2025-10-27T18:26:53Z

Fixes
(EngineCore_DP0 pid=3595930) ValueError: Connector NixlConnector does not support HMA but HMA is enabled. Please set `--disable-hybrid-kv-cache-manager`.
I think we should just change logic to disabling it if it does not support hma with the interface you defined, given it's on by default

Agree. For now let's temporarily disable it for all connectors to avoid blocking the release. Will figure out a way to only turn off HMA when connector does not support it.

njhill

Thanks @KuntaiDu

ApostaC

LGTM!

simon-mo · 2025-10-28T01:26:51Z

Ready to be force merged?

KuntaiDu · 2025-10-29T04:31:35Z

Ready to be force merged?

Yes and ty for the force merge.

…oject#27592) Signed-off-by: KuntaiDu <[email protected]> Signed-off-by: Kuntai Du <[email protected]>

turn off HMA connector when connector is set

d65eca0

Signed-off-by: KuntaiDu <[email protected]>

KuntaiDu requested review from ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, simon-mo, tlrmchlsmth, yewentao256 and youkaichao as code owners October 27, 2025 16:30

mergify bot added the needs-rebase label Oct 27, 2025

gemini-code-assist bot reviewed Oct 27, 2025

View reviewed changes

Merge branch 'main' into kuntai-disable-HMA-for-connector-for-now

16a2bb5

Signed-off-by: Kuntai Du <[email protected]>

mergify bot removed the needs-rebase label Oct 27, 2025

NickLucche approved these changes Oct 27, 2025

View reviewed changes

njhill reviewed Oct 27, 2025

View reviewed changes

vllm/config/vllm.py Show resolved Hide resolved

njhill added this to the v0.11.1 milestone Oct 27, 2025

add logger warning

bb82eb6

Signed-off-by: KuntaiDu <[email protected]>

KuntaiDu requested a review from njhill October 27, 2025 18:21

KuntaiDu added 2 commits October 27, 2025 11:21

Merge branch 'kuntai-disable-HMA-for-connector-for-now' of https://gi…

b7b6f3e

…thub.com/KuntaiDu/vllm into kuntai-disable-HMA-for-connector-for-now

adjust logger warning to make it more informative

c21bb17

Signed-off-by: KuntaiDu <[email protected]>

njhill approved these changes Oct 27, 2025

View reviewed changes

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 27, 2025

njhill enabled auto-merge (squash) October 27, 2025 18:47

ApostaC approved these changes Oct 27, 2025

View reviewed changes

Merge branch 'main' into kuntai-disable-HMA-for-connector-for-now

d00bb77

simon-mo disabled auto-merge October 28, 2025 01:32

simon-mo merged commit 255e34c into vllm-project:main Oct 28, 2025
42 of 45 checks passed

KuntaiDu deleted the kuntai-disable-HMA-for-connector-for-now branch October 29, 2025 04:31

KuntaiDu mentioned this pull request Nov 4, 2025

[Hybrid allocator + kv connector] revert connector test changes related to hybrid allocator #28011

Merged

5 tasks

ilmarkov pushed a commit to neuralmagic/vllm that referenced this pull request Nov 7, 2025

[Stability fix] turn off HMA allocator when connector is set (vllm-pr…

992bfa3

…oject#27592) Signed-off-by: KuntaiDu <[email protected]> Signed-off-by: Kuntai Du <[email protected]>

ZhengHongming888 pushed a commit to ZhengHongming888/vllm that referenced this pull request Nov 8, 2025

[Stability fix] turn off HMA allocator when connector is set (vllm-pr…

d52d89e

…oject#27592) Signed-off-by: KuntaiDu <[email protected]> Signed-off-by: Kuntai Du <[email protected]>

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025

[Stability fix] turn off HMA allocator when connector is set (vllm-pr…

12eb1be

…oject#27592) Signed-off-by: KuntaiDu <[email protected]> Signed-off-by: Kuntai Du <[email protected]>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[Stability fix] turn off HMA allocator when connector is set (vllm-pr…

d4c4db1

…oject#27592) Signed-off-by: KuntaiDu <[email protected]> Signed-off-by: Kuntai Du <[email protected]>

Uh oh!

[Stability fix] turn off HMA allocator when connector is set #27592

[Stability fix] turn off HMA allocator when connector is set #27592

Uh oh!

Conversation

KuntaiDu commented Oct 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mergify bot commented Oct 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

KuntaiDu commented Oct 27, 2025

Code Review

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

KuntaiDu commented Oct 27, 2025

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

ApostaC left a comment

Choose a reason for hiding this comment

Uh oh!

simon-mo commented Oct 28, 2025

Uh oh!

Uh oh!

KuntaiDu commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

KuntaiDu commented Oct 27, 2025 •

edited by github-actions bot

Loading