Commit 3e50d7f
committed
[Core] Force PIECEWISE CUDAGraph mode for encoder-decoder
Whisper does not work with full cudagraphs. That is being worked on in
PR vllm-project#25208.
The failure can be reproduced reliably via
`tests/models/multimodal/generation/test_whisper.py`, at least in my
H100 development environment. The tests passed on the PR and I'm not
sure why.
Regardless, this seems like the right change to make until vllm-project#25208
sorts out exactly what changes are needed.
Signed-off-by: Russell Bryant <[email protected]>1 parent 3468f17 commit 3e50d7f
1 file changed
+4
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
364 | 364 | | |
365 | 365 | | |
366 | 366 | | |
367 | | - | |
| 367 | + | |
| 368 | + | |
368 | 369 | | |
369 | | - | |
| 370 | + | |
| 371 | + | |
370 | 372 | | |
371 | 373 | | |
372 | 374 | | |
| |||
0 commit comments