[Bug]: VLLM 不支持在 V100-SXM2-32GB 上部署qwen3 系列的模型

### Your current environment

# 版本
VLLM 0.9.0.1
CUDA 12.8

### 🐛 Describe the bug

# 问题
可以成功部署，但是一调用就会崩掉

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.