✨ add debug perf logger #515

joerunde · 2025-10-09T23:08:55Z

Description

This PR adds a debug-mode performance logger that will print the timing stats for each individual request. These are the stats collected by the engine which are aggregated into prometheus metrics. This splits out the timing info into e2e time, queue time, prefill time, and decode time for a better understanding of how time is spent inside of vllm.

Additionally, for each request this will attempt to calculate

the amount of time that that request was spent interrupted waiting on a new request to prefill to enter the batch
the mean itl from decode passes only, excluding the time that the request was interrupted

These are included as the prefill_interrupt and decode_only_itl fields.

This uses the existing VLLM_SPYRE_PERF_METRIC_LOGGING_ENABLED and VLLM_SPYRE_PERF_METRIC_LOGGING_DIR configs, and writes the results to a .jsonl file with the following fields

{"timestamp": x, "prefill_interrupt_seconds": x, "decode_only_itl_seconds": x, "finish_reason": x, "num_prompt_tokens": x, "num_generation_tokens": x, "max_tokens_param": x, "e2e_latency_seconds": x, "queued_time_seconds": x, "prefill_time_seconds":x, "inference_time_seconds": x, "decode_time_seconds": x, "mean_time_per_output_token_seconds": x}

Extending the vLLM StatLoggers could allow us to create custom prometheus metrics as well, if any of the extra info about prefill interrupt time would be helpful on a dashboard.

Signed-off-by: Joe Runde <[email protected]>

github-actions · 2025-10-09T23:09:03Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Joe Runde <[email protected]>

docs/contributing/README.md

vllm_spyre/v1/metrics/stats_logger.py

vllm_spyre/v1/stats_logger.py

Signed-off-by: Joe Runde <[email protected]>

joerunde · 2025-10-10T20:59:23Z

bot:test
MARKERS="spyre and cb and not multi and not quantized"

tjohnson31415

Couple of tiny nits, but can merge as is.

vllm_spyre/v1/metrics/stats_logger.py

Co-authored-by: Travis Johnson <[email protected]> Signed-off-by: Joe Runde <[email protected]>

Signed-off-by: Joe Runde <[email protected]>

✨ add debug perf logger

4c3c4e3

Signed-off-by: Joe Runde <[email protected]>

joerunde requested review from nikolaospapandreou, prashantgupta24, rafvasq, sducouedic, tdoublep and yannicks1 as code owners October 9, 2025 23:08

joerunde added 7 commits October 9, 2025 17:14

🐛 0.10.2 backwards compat

da3aa0c

Signed-off-by: Joe Runde <[email protected]>

🐛 move test to make GHA happy

33c3b00

Signed-off-by: Joe Runde <[email protected]>

🎨 fmt

da72bd3

Signed-off-by: Joe Runde <[email protected]>

📝 add docs

b89dd9b

Signed-off-by: Joe Runde <[email protected]>

📝 more docs

94c0e46

Signed-off-by: Joe Runde <[email protected]>

🎨 typos

6943840

Signed-off-by: Joe Runde <[email protected]>

🎨 typos

09d8682

Signed-off-by: Joe Runde <[email protected]>

tjohnson31415 reviewed Oct 10, 2025

View reviewed changes

docs/contributing/README.md Outdated Show resolved Hide resolved

vllm_spyre/v1/metrics/stats_logger.py Show resolved Hide resolved

vllm_spyre/v1/stats_logger.py Outdated Show resolved Hide resolved

vllm_spyre/v1/stats_logger.py Outdated Show resolved Hide resolved

joerunde added 6 commits October 10, 2025 12:06

♻️ move to v1.metrics

99c1eeb

Signed-off-by: Joe Runde <[email protected]>

🎨 flatten json, add units

6c4a5e2

Signed-off-by: Joe Runde <[email protected]>

📝 update docs

4dd407a

Signed-off-by: Joe Runde <[email protected]>

🔥 remove restrictio on CB only

7f5271e

Signed-off-by: Joe Runde <[email protected]>

🎨 fmt

cddcea9

Signed-off-by: Joe Runde <[email protected]>

🐛 guard against missing keys, update tests

1b5113c

Signed-off-by: Joe Runde <[email protected]>

tjohnson31415 approved these changes Oct 10, 2025

View reviewed changes

vllm_spyre/v1/metrics/stats_logger.py Outdated Show resolved Hide resolved

vllm_spyre/v1/metrics/stats_logger.py Outdated Show resolved Hide resolved

maxdebayser reviewed Oct 10, 2025

View reviewed changes

vllm_spyre/v1/metrics/stats_logger.py Outdated Show resolved Hide resolved

Apply suggestions from code review

2b005a6

Co-authored-by: Travis Johnson <[email protected]> Signed-off-by: Joe Runde <[email protected]>

joerunde force-pushed the perf-logger branch from d156183 to 2b005a6 Compare October 10, 2025 22:43

♻️ open file once

a06f6c7

Signed-off-by: Joe Runde <[email protected]>

joerunde merged commit dff277b into main Oct 10, 2025
19 checks passed

joerunde deleted the perf-logger branch October 10, 2025 23:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

✨ add debug perf logger #515

✨ add debug perf logger #515

Uh oh!

joerunde commented Oct 9, 2025 •

edited by yannicks1

Loading

Uh oh!

github-actions bot commented Oct 9, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

joerunde commented Oct 10, 2025

Uh oh!

tjohnson31415 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

✨ add debug perf logger #515

✨ add debug perf logger #515

Uh oh!

Conversation

joerunde commented Oct 9, 2025 • edited by yannicks1 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

github-actions bot commented Oct 9, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

joerunde commented Oct 10, 2025

Uh oh!

tjohnson31415 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

joerunde commented Oct 9, 2025 •

edited by yannicks1

Loading