test: a few test configuration updates to have chunked prefill tests pass on Spyre #588

tjohnson31415 · 2025-12-04T16:31:24Z

Description

In our internal CI we run unit tests on Spyre devices. This PR fixes some of the new Chunked Prefill tests to be able to pass on Spyre.

The two issues causing tests to fail:

chunk size greater than max model length (due to vLLM default of 2048 or granite TP4 detection setting it to 4096)
some edge case I don't fully understand in test_single_cp_prefill causing failure during inference with DtException: No matching compiler iter found

Also found that having @pytest.mark.cpu on all tests in test_spyre_cp_scheduler_steps.py is incorrect (the mark is automatically applied from the backend parameterization).

Signed-off-by: Travis Johnson <[email protected]>

github-actions · 2025-12-04T17:35:15Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

maxdebayser · 2025-12-04T17:58:24Z

tests/e2e/test_spyre_async_llm.py

+            "use_cb": False,
+            "warmup_shapes": warmup_shapes,
+        })
+        patch_environment(


Nice simplification!

maxdebayser · 2025-12-04T17:59:23Z

tests/e2e/test_spyre_cp_scheduler_steps.py

        * number of prompts: 1
            * 0: len = 512, max tokens = 1, step joining = 0
    """
+    # max_model_len=514 tests an edge case in the scheduler, but does not work


Nice catch. Maybe in sendnn the max model len has to be a multiple of 64

I was wondering something like that as well, but values like 576, 640, 768 also didn't work in my testing.

maxdebayser

LGTM, thanks for the fixes!

Signed-off-by: Travis Johnson <[email protected]>

joerunde · 2025-12-04T20:05:45Z

tests/spyre_util.py

    monkeypatch.setenv("VLLM_SPYRE_USE_CHUNKED_PREFILL",
                       "1" if use_chunked_prefill else "0")
+    # NB: setting this env var explicitly is needed to set the desired value for
+    # the chunk size in the case that granite 8b TP4 is detected


In that case wouldn't we also need to re-override the internal config for max_num_batched_tokens as well?

VLLM_DT_CHUNK_LEN currently takes precedence over user / vllm setting max_num_batch_tokens after the changes in #571

very hot 🌶️

tjohnson31415 added 3 commits December 3, 2025 16:49

test: fix test_single_cp_prefill when running on spyre

a538fff

Signed-off-by: Travis Johnson <[email protected]>

test: fix test_abort to pass on spyre

a21f279

Signed-off-by: Travis Johnson <[email protected]>

test: fix full model test with granite 8b TP4

256bc50

Signed-off-by: Travis Johnson <[email protected]>

tjohnson31415 requested review from prashantgupta24, rafvasq and sducouedic as code owners December 4, 2025 16:31

tjohnson31415 requested a review from maxdebayser December 4, 2025 17:39

maxdebayser reviewed Dec 4, 2025

View reviewed changes

maxdebayser approved these changes Dec 4, 2025

View reviewed changes

test fix: remove cpu mark from scheduler step tests

f7dc3bd

Signed-off-by: Travis Johnson <[email protected]>

joerunde reviewed Dec 4, 2025

View reviewed changes

tjohnson31415 merged commit ef47741 into main Dec 4, 2025
18 of 19 checks passed

tjohnson31415 deleted the test-cp-spyre branch December 4, 2025 20:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test: a few test configuration updates to have chunked prefill tests pass on Spyre #588

test: a few test configuration updates to have chunked prefill tests pass on Spyre #588

tjohnson31415 commented Dec 4, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 4, 2025

Uh oh!

maxdebayser Dec 4, 2025

Uh oh!

maxdebayser Dec 4, 2025

Uh oh!

tjohnson31415 Dec 4, 2025

Uh oh!

maxdebayser left a comment

Uh oh!

joerunde Dec 4, 2025

Uh oh!

tjohnson31415 Dec 4, 2025 •

edited

Loading

Uh oh!

joerunde Dec 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

test: a few test configuration updates to have chunked prefill tests pass on Spyre #588

test: a few test configuration updates to have chunked prefill tests pass on Spyre #588

Conversation

tjohnson31415 commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

github-actions bot commented Dec 4, 2025

Uh oh!

maxdebayser Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

maxdebayser Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

maxdebayser left a comment

Choose a reason for hiding this comment

Uh oh!

joerunde Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

joerunde Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tjohnson31415 commented Dec 4, 2025 •

edited

Loading

tjohnson31415 Dec 4, 2025 •

edited

Loading