Your current environment
The output of python collect_env.py
Your output of `python collect_env.py` here
Your code version
The commit id or version of vllm
The commit id or version of vllm-omni
🐛 Describe the bug
The tokens output by the talker model of qwen2.5-omni are incorrect when enforce eager (in qwen2_5_omni.yaml) is set to false (running compile on NPU), but it's correct for thinker model under the same parameters
Before submitting a new issue...