Skip to content

Conversation

@zejunchen-zejun
Copy link

@zejunchen-zejun zejunchen-zejun commented Nov 20, 2025

Port the PR: vllm-project#27380

B, self.num_heads, self.kv_lora_rank, dtype=q.dtype, device=q.device
)
B, self.num_heads, self.kv_lora_rank, dtype=torch.bfloat16, device=q.device
).fill_(-1)
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ganyi1996ppo do we need this fill?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think we do

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removed

@zejunchen-zejun zejunchen-zejun force-pushed the zejun/port_ganyi_mla_to_dev_perf branch from b76c385 to 331643e Compare November 26, 2025 05:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants