-
-
Notifications
You must be signed in to change notification settings - Fork 11.9k
[Attention] Add ROCM_AITER_MLA_SPARSE to attention backend registry #29103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Attention] Add ROCM_AITER_MLA_SPARSE to attention backend registry #29103
Conversation
Signed-off-by: Matthew Bonanni <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request registers a new attention backend, ROCM_AITER_MLA_SPARSE, and refactors the code to use the new enum member instead of a hardcoded string. The changes are a good improvement for maintainability. However, I've identified a critical issue in the implementation of the new backend that will prevent it from being used. The backend class is missing necessary method overrides, which will cause validation to fail during backend selection.
…llm-project#29103) Signed-off-by: Matthew Bonanni <[email protected]> Signed-off-by: LuminolT <[email protected]>
…llm-project#29103) Signed-off-by: Matthew Bonanni <[email protected]>
…llm-project#29103) Signed-off-by: Matthew Bonanni <[email protected]>
…llm-project#29103) Signed-off-by: Matthew Bonanni <[email protected]> Signed-off-by: Runkai Tao <[email protected]>
…llm-project#29103) Signed-off-by: Matthew Bonanni <[email protected]>
…llm-project#29103) Signed-off-by: Matthew Bonanni <[email protected]>
…llm-project#29103) Signed-off-by: Matthew Bonanni <[email protected]>
…llm-project#29103) Signed-off-by: Matthew Bonanni <[email protected]> Signed-off-by: Xingyu Liu <[email protected]>
…llm-project#29103) Signed-off-by: Matthew Bonanni <[email protected]>
Purpose
Adds the new
ROCM_AITER_MLA_SPARSEbackend introduced by #26670 to the attention backend registry.Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.