-
-
Notifications
You must be signed in to change notification settings - Fork 11.7k
[Stability fix] turn off HMA allocator when connector is set #27592
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Stability fix] turn off HMA allocator when connector is set #27592
Conversation
Signed-off-by: KuntaiDu <[email protected]>
|
This pull request has merge conflicts that must be resolved before it can be |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request aims to fix a stability issue by disabling the Hybrid Memory Allocator (HMA) when a connector is configured. While the change is functionally correct in its intent, it introduces a redundant code block. The check for kv_transfer_config is now performed twice consecutively. This should be consolidated into a single check to improve code clarity and maintainability.
Signed-off-by: Kuntai Du <[email protected]>
It's now fixed. Thanks for the reminder. |
NickLucche
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixes
(EngineCore_DP0 pid=3595930) ValueError: Connector NixlConnector does not support HMA but HMA is enabled. Please set `--disable-hybrid-kv-cache-manager`.
I think we should just change logic to disabling it if it does not support hma with the interface you defined, given it's on by default
njhill
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @KuntaiDu
Signed-off-by: KuntaiDu <[email protected]>
…thub.com/KuntaiDu/vllm into kuntai-disable-HMA-for-connector-for-now
Signed-off-by: KuntaiDu <[email protected]>
Agree. For now let's temporarily disable it for all connectors to avoid blocking the release. Will figure out a way to only turn off HMA when connector does not support it. |
njhill
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @KuntaiDu
ApostaC
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
|
Ready to be force merged? |
Yes and ty for the force merge. |
…oject#27592) Signed-off-by: KuntaiDu <[email protected]> Signed-off-by: Kuntai Du <[email protected]>
…oject#27592) Signed-off-by: KuntaiDu <[email protected]> Signed-off-by: Kuntai Du <[email protected]>
…oject#27592) Signed-off-by: KuntaiDu <[email protected]> Signed-off-by: Kuntai Du <[email protected]>
…oject#27592) Signed-off-by: KuntaiDu <[email protected]> Signed-off-by: Kuntai Du <[email protected]>
Due to #25712 , currently vLLM will hard fail if the user simply set a connector because vLM enables HMA by default.
To avoid hard fail, before discussing with @njhill to figure out a better solution, let's just turn off HMA when a connector is being set.
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.