[Frontend][torch.compile] CompilationConfig Overhaul (#20283): name change compilation level to compilation mode, deprecation compilation level #26355

morrison-turnansky · 2025-10-07T14:08:54Z

Purpose

See #20283 (comment)
The purpose of this PR is to perform variable name changes and deprecation warnings for compilation level to compilation mode. Enum values names are also changed. No true changes of behavior nor breaking changes have occurred.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

mergify · 2025-10-07T14:09:33Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @morrison-turnansky.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

morrison-turnansky · 2025-10-15T12:53:56Z

Issue created: #26911

): name change compilation level to compilation mode, deprecation compilation level (vllm-project#26355) Signed-off-by: morrison-turnansky <[email protected]> Signed-off-by: Morrison Turnansky <[email protected]> Co-authored-by: Luka Govedič <[email protected]> Signed-off-by: bbartels <[email protected]>

): name change compilation level to compilation mode, deprecation compilation level (vllm-project#26355) Signed-off-by: morrison-turnansky <[email protected]> Signed-off-by: Morrison Turnansky <[email protected]> Co-authored-by: Luka Govedič <[email protected]>

Signed-off-by: Harry Mellor <[email protected]>

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]>

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Alberto Perdomo <[email protected]>

): name change compilation level to compilation mode, deprecation compilation level (vllm-project#26355) Signed-off-by: morrison-turnansky <[email protected]> Signed-off-by: Morrison Turnansky <[email protected]> Co-authored-by: Luka Govedič <[email protected]>

### What this PR does / why we need it? This is the step 1 of refactoring code to adapt with vllm main, and this pr aligned with vllm-project/vllm@17c540a 1. refactor deepseek to the latest code arch as of vllm-project/vllm@17c540a 2. bunches of fixes due to vllm changes - Fix `AscendScheduler` `__post_init__`, caused by vllm-project/vllm#25075 - Fix `AscendScheduler` init got an unexpected arg `block_size`, caused by vllm-project/vllm#26296 - Fix `KVCacheManager` `get_num_common_prefix_blocks` arg, caused by vllm-project/vllm#23485 - Fix `MLAAttention` import,caused by vllm-project/vllm#25103 - Fix `SharedFusedMoE` import, caused by vllm-project/vllm#26145 - Fix `LazyLoader` improt, caused by vllm-project/vllm#27022 - Fix `vllm.utils.swap_dict_values` improt, caused by vllm-project/vllm#26990 - Fix `Backend` enum import, caused by vllm-project/vllm#25893 - Fix `CompilationLevel` renaming to `CompilationMode` issue introduced by vllm-project/vllm#26355 - Fix fused_moe ops, caused by vllm-project/vllm#24097 - Fix bert model because of `inputs_embeds`, caused by vllm-project/vllm#25922 - Fix MRope because of `get_input_positions_tensor` to `get_mrope_input_positions`, caused by vllm-project/vllm#24172 - Fix `splitting_ops` changes introduced by vllm-project/vllm#25845 - Fix multi-modality changes introduced by vllm-project/vllm#16229 - Fix lora bias dropping issue introduced by vllm-project/vllm#25807 - Fix structured ouput break introduced by vllm-project/vllm#26737 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? CI passed with existing test. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: MengqingCao <[email protected]> Signed-off-by: Icey <[email protected]> Co-authored-by: Icey <[email protected]>

): name change compilation level to compilation mode, deprecation compilation level (vllm-project#26355) Signed-off-by: morrison-turnansky <[email protected]> Signed-off-by: Morrison Turnansky <[email protected]> Co-authored-by: Luka Govedič <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]>

): name change compilation level to compilation mode, deprecation compilation level (vllm-project#26355) Signed-off-by: morrison-turnansky <[email protected]> Signed-off-by: Morrison Turnansky <[email protected]> Co-authored-by: Luka Govedič <[email protected]> Signed-off-by: 0xrushi <[email protected]>

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: 0xrushi <[email protected]>

): name change compilation level to compilation mode, deprecation compilation level (vllm-project#26355) Signed-off-by: morrison-turnansky <[email protected]> Signed-off-by: Morrison Turnansky <[email protected]> Co-authored-by: Luka Govedič <[email protected]> Signed-off-by: 0xrushi <[email protected]>

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: 0xrushi <[email protected]>

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]>

): name change compilation level to compilation mode, deprecation compilation level (vllm-project#26355) Signed-off-by: morrison-turnansky <[email protected]> Signed-off-by: Morrison Turnansky <[email protected]> Co-authored-by: Luka Govedič <[email protected]>

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]>

): name change compilation level to compilation mode, deprecation compilation level (vllm-project#26355) Signed-off-by: morrison-turnansky <[email protected]> Signed-off-by: Morrison Turnansky <[email protected]> Co-authored-by: Luka Govedič <[email protected]>

) ### What this PR does / why we need it? This is the step 1 of refactoring code to adapt with vllm main, and this pr aligned with vllm-project/vllm@17c540a 1. refactor deepseek to the latest code arch as of vllm-project/vllm@17c540a 2. bunches of fixes due to vllm changes - Fix `AscendScheduler` `__post_init__`, caused by vllm-project/vllm#25075 - Fix `AscendScheduler` init got an unexpected arg `block_size`, caused by vllm-project/vllm#26296 - Fix `KVCacheManager` `get_num_common_prefix_blocks` arg, caused by vllm-project/vllm#23485 - Fix `MLAAttention` import,caused by vllm-project/vllm#25103 - Fix `SharedFusedMoE` import, caused by vllm-project/vllm#26145 - Fix `LazyLoader` improt, caused by vllm-project/vllm#27022 - Fix `vllm.utils.swap_dict_values` improt, caused by vllm-project/vllm#26990 - Fix `Backend` enum import, caused by vllm-project/vllm#25893 - Fix `CompilationLevel` renaming to `CompilationMode` issue introduced by vllm-project/vllm#26355 - Fix fused_moe ops, caused by vllm-project/vllm#24097 - Fix bert model because of `inputs_embeds`, caused by vllm-project/vllm#25922 - Fix MRope because of `get_input_positions_tensor` to `get_mrope_input_positions`, caused by vllm-project/vllm#24172 - Fix `splitting_ops` changes introduced by vllm-project/vllm#25845 - Fix multi-modality changes introduced by vllm-project/vllm#16229 - Fix lora bias dropping issue introduced by vllm-project/vllm#25807 - Fix structured ouput break introduced by vllm-project/vllm#26737 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? CI passed with existing test. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: MengqingCao <[email protected]> Signed-off-by: Icey <[email protected]> Co-authored-by: Icey <[email protected]> Signed-off-by: luolun <[email protected]>

) ### What this PR does / why we need it? This is the step 1 of refactoring code to adapt with vllm main, and this pr aligned with vllm-project/vllm@17c540a 1. refactor deepseek to the latest code arch as of vllm-project/vllm@17c540a 2. bunches of fixes due to vllm changes - Fix `AscendScheduler` `__post_init__`, caused by vllm-project/vllm#25075 - Fix `AscendScheduler` init got an unexpected arg `block_size`, caused by vllm-project/vllm#26296 - Fix `KVCacheManager` `get_num_common_prefix_blocks` arg, caused by vllm-project/vllm#23485 - Fix `MLAAttention` import,caused by vllm-project/vllm#25103 - Fix `SharedFusedMoE` import, caused by vllm-project/vllm#26145 - Fix `LazyLoader` improt, caused by vllm-project/vllm#27022 - Fix `vllm.utils.swap_dict_values` improt, caused by vllm-project/vllm#26990 - Fix `Backend` enum import, caused by vllm-project/vllm#25893 - Fix `CompilationLevel` renaming to `CompilationMode` issue introduced by vllm-project/vllm#26355 - Fix fused_moe ops, caused by vllm-project/vllm#24097 - Fix bert model because of `inputs_embeds`, caused by vllm-project/vllm#25922 - Fix MRope because of `get_input_positions_tensor` to `get_mrope_input_positions`, caused by vllm-project/vllm#24172 - Fix `splitting_ops` changes introduced by vllm-project/vllm#25845 - Fix multi-modality changes introduced by vllm-project/vllm#16229 - Fix lora bias dropping issue introduced by vllm-project/vllm#25807 - Fix structured ouput break introduced by vllm-project/vllm#26737 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? CI passed with existing test. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: MengqingCao <[email protected]> Signed-off-by: Icey <[email protected]> Co-authored-by: Icey <[email protected]> Signed-off-by: hwhaokun <[email protected]>

) ### What this PR does / why we need it? This is the step 1 of refactoring code to adapt with vllm main, and this pr aligned with vllm-project/vllm@17c540a 1. refactor deepseek to the latest code arch as of vllm-project/vllm@17c540a 2. bunches of fixes due to vllm changes - Fix `AscendScheduler` `__post_init__`, caused by vllm-project/vllm#25075 - Fix `AscendScheduler` init got an unexpected arg `block_size`, caused by vllm-project/vllm#26296 - Fix `KVCacheManager` `get_num_common_prefix_blocks` arg, caused by vllm-project/vllm#23485 - Fix `MLAAttention` import,caused by vllm-project/vllm#25103 - Fix `SharedFusedMoE` import, caused by vllm-project/vllm#26145 - Fix `LazyLoader` improt, caused by vllm-project/vllm#27022 - Fix `vllm.utils.swap_dict_values` improt, caused by vllm-project/vllm#26990 - Fix `Backend` enum import, caused by vllm-project/vllm#25893 - Fix `CompilationLevel` renaming to `CompilationMode` issue introduced by vllm-project/vllm#26355 - Fix fused_moe ops, caused by vllm-project/vllm#24097 - Fix bert model because of `inputs_embeds`, caused by vllm-project/vllm#25922 - Fix MRope because of `get_input_positions_tensor` to `get_mrope_input_positions`, caused by vllm-project/vllm#24172 - Fix `splitting_ops` changes introduced by vllm-project/vllm#25845 - Fix multi-modality changes introduced by vllm-project/vllm#16229 - Fix lora bias dropping issue introduced by vllm-project/vllm#25807 - Fix structured ouput break introduced by vllm-project/vllm#26737 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? CI passed with existing test. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: MengqingCao <[email protected]> Signed-off-by: Icey <[email protected]> Co-authored-by: Icey <[email protected]> Signed-off-by: nsdie <[email protected]>

): name change compilation level to compilation mode, deprecation compilation level (vllm-project#26355) Signed-off-by: morrison-turnansky <[email protected]> Signed-off-by: Morrison Turnansky <[email protected]> Co-authored-by: Luka Govedič <[email protected]>

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]>

mergify bot added documentation Improvements or additions to documentation llama Related to Llama models speculative-decoding v1 tpu Related to Google TPUs labels Oct 7, 2025

mergify bot added the needs-rebase label Oct 7, 2025

morrison-turnansky force-pushed the issue-20283-level branch from 4753011 to e347d4d Compare October 7, 2025 14:14

mergify bot removed the needs-rebase label Oct 7, 2025

morrison-turnansky marked this pull request as ready for review October 7, 2025 14:16

ProExpertProg merged commit 96b9aa5 into vllm-project:main Oct 15, 2025
60 checks passed

ProExpertProg mentioned this pull request Oct 15, 2025

[Feature]: Add test verifying CompilationMode.STOCK_TORCH_COMPILE behavior #26911

Open

1 task

wxsIcey mentioned this pull request Oct 16, 2025

[CI] Upgrade vllm to newest commit vllm-project/vllm-ascend#3423

Closed

hmellor added a commit to hmellor/vllm that referenced this pull request Oct 21, 2025

Remove last level references not removed in vllm-project#26355

85983dd

Signed-off-by: Harry Mellor <[email protected]>

MengqingCao mentioned this pull request Oct 22, 2025

[1/N][Refactor] Refactor code to adapt with vllm main vllm-project/vllm-ascend#3612

Merged

hmellor added a commit that referenced this pull request Oct 22, 2025

Remove last level references not removed in #26355 (#27260)

8f18feb

Signed-off-by: Harry Mellor <[email protected]>

usberkeley pushed a commit to usberkeley/vllm that referenced this pull request Oct 23, 2025

Remove last level references not removed in vllm-project#26355 (vll…

e7ca36b

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]>

kingsmad pushed a commit to kingsmad/vllm that referenced this pull request Oct 25, 2025

Remove last level references not removed in vllm-project#26355 (vll…

105620a

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]>

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

Remove last level references not removed in vllm-project#26355 (vll…

2bb9bfe

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: 0xrushi <[email protected]>

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

Remove last level references not removed in vllm-project#26355 (vll…

6a5ef82

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: 0xrushi <[email protected]>

ProExpertProg linked an issue Nov 3, 2025 that may be closed by this pull request

[RFC][UX][torch.compile][CUDAGraph]: Overhaul CompilationConfig and improve CLI -O<n> #20283

Closed

1 task

ilmarkov pushed a commit to neuralmagic/vllm that referenced this pull request Nov 7, 2025

Remove last level references not removed in vllm-project#26355 (vll…

1d53519

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]>

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025

Remove last level references not removed in vllm-project#26355 (vll…

96f78e2

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

Remove last level references not removed in vllm-project#26355 (vll…

d8419b6

…m-project#27260) Signed-off-by: Harry Mellor <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Frontend][torch.compile] CompilationConfig Overhaul (#20283): name change compilation level to compilation mode, deprecation compilation level #26355

[Frontend][torch.compile] CompilationConfig Overhaul (#20283): name change compilation level to compilation mode, deprecation compilation level #26355

Uh oh!

morrison-turnansky commented Oct 7, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Oct 7, 2025

Uh oh!

Uh oh!

morrison-turnansky commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Frontend][torch.compile] CompilationConfig Overhaul (#20283): name change compilation level to compilation mode, deprecation compilation level #26355

[Frontend][torch.compile] CompilationConfig Overhaul (#20283): name change compilation level to compilation mode, deprecation compilation level #26355

Uh oh!

Conversation

morrison-turnansky commented Oct 7, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Oct 7, 2025

Uh oh!

Uh oh!

morrison-turnansky commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

morrison-turnansky commented Oct 7, 2025 •

edited by github-actions bot

Loading