[Bugfix][DeepSeek Fix config used for DeepseekV2 Eagle #25953

tlrmchlsmth · 2025-09-30T13:09:47Z

We are using the verifier model's config instead of the draft model's config when using Eagle for Deepseek.

Introduced in #24134. Similar issue for llama3 was fixed in #25883.

Signed-off-by: Tyler Michael Smith <[email protected]>

gemini-code-assist

Code Review

This pull request correctly fixes a bug in the DeepseekV2 Eagle speculative decoding implementation. Previously, the draft model's decoder layers were incorrectly using the verifier model's configuration. The changes introduce an optional config parameter to the DeepseekV2DecoderLayer initializer, allowing the correct draft model configuration to be passed. The implementation maintains backward compatibility by falling back to the verifier model's configuration when the new parameter is not provided. The fix is well-implemented and aligns with similar corrections in the codebase.

benchislett · 2025-09-30T14:47:45Z

vllm/model_executor/models/deepseek_eagle.py

            DeepseekV2DecoderLayer(
                vllm_config,
                prefix=maybe_prefix(prefix, f"layers.{i + start_layer_id}"),
+                config=self.config,


Should this also be applied in deepseek_mtp.py?

I didn't touch deepseek_mtp.py because even prior to #24134, it passed in the verifier model config rather than the draft model... I guess it was always broken?

benchislett

config needs to be added at the end of DeepseekV2DecoderLayer's constructor, because deepseek_mtp.py passes the arguments positionally

vllm/vllm/model_executor/models/deepseek_mtp.py

Line 68 in ef28354

self.mtp_block = DeepseekV2DecoderLayer(vllm_config, prefix,

tlrmchlsmth · 2025-09-30T21:00:05Z

closing in favor of #25987

Fix config passed to deepseek_eagle

9ffb182

Signed-off-by: Tyler Michael Smith <[email protected]>

mergify bot added deepseek Related to DeepSeek models speculative-decoding labels Sep 30, 2025

gemini-code-assist bot reviewed Sep 30, 2025

View reviewed changes

benchislett reviewed Sep 30, 2025

View reviewed changes

benchislett requested changes Sep 30, 2025

View reviewed changes

benchislett mentioned this pull request Sep 30, 2025

[Bugfix] Allow skipping MoE in NVFP4 (fix for MTP) #25987

Merged

tlrmchlsmth closed this Sep 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix][DeepSeek Fix config used for DeepseekV2 Eagle #25953

[Bugfix][DeepSeek Fix config used for DeepseekV2 Eagle #25953

Uh oh!

tlrmchlsmth commented Sep 30, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

benchislett Sep 30, 2025

Uh oh!

tlrmchlsmth Sep 30, 2025

Uh oh!

benchislett left a comment

Uh oh!

tlrmchlsmth commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Bugfix][DeepSeek Fix config used for DeepseekV2 Eagle #25953

[Bugfix][DeepSeek Fix config used for DeepseekV2 Eagle #25953

Uh oh!

Conversation

tlrmchlsmth commented Sep 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

benchislett Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

tlrmchlsmth Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

benchislett left a comment

Choose a reason for hiding this comment

Uh oh!

tlrmchlsmth commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tlrmchlsmth commented Sep 30, 2025 •

edited by github-actions bot

Loading