[Doc]Format profiling doc by lishunyang12 · Pull Request #993 · vllm-project/vllm-omni

lishunyang12 · 2026-01-28T01:10:53Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

This PR aims to format profiling page for better guideline and clear instructions.
@hsliuustc0106

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: lishunyang <[email protected]>

Copilot

Pull request overview

Formats and updates the profiling documentation to provide clearer guidance for profiling vLLM-Omni omni-modality and diffusion workflows.

Changes:

Updates terminology and section headings for omni-modality profiling.
Renames model examples (Qwen2.5-Omni / Qwen3-Omni) and restructures diffusion profiling into its own section.
Removes the async/online profiling section and updates the external vLLM profiling guide link.

Comments suppressed due to low confidence (1)

docs/contributing/profiling.md:90

In this diffusion profiling section, the heading uses sentence case and the CLI example is fenced as python even though it’s a shell command block. Please switch the fence to bash (and consider using Title Case for the heading to match the rest of the document).

### 3. Profiling diffusion models

Diffusion profiling is End-to-End, capturing encoding, denoising loops, and decoding.

**CLI Usage:**
```python

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

docs/contributing/profiling.md

Co-authored-by: Copilot <[email protected]> Signed-off-by: Hongsheng Liu <[email protected]>

Signed-off-by: lishunyang <[email protected]>

david6666666 · 2026-01-28T06:33:13Z

docs/contributing/profiling.md

+As of now, asynchronous (online) profiling is not fully supported in vLLM-Omni. While start_profile() and stop_profile() methods exist, they are only reliable in offline inference scripts (e.g., the provided end2end.py examples). Do not use them in server-mode or streaming scenarios—traces may be incomplete or fail to flush.

-**Online Inference(Async)**
-


@gcanlin
I recall that Omni pipeline supports profile in asyncOmni.

AsyncOmni's methods support profiling but the it has not been validated in examples. We will update it in a separate PR given than online serving profiling is less common than offline one.

online serving profiling is less common than offline one.

I don't think so.

Signed-off-by: lishunyang <[email protected]> Signed-off-by: Hongsheng Liu <[email protected]> Co-authored-by: Hongsheng Liu <[email protected]> Co-authored-by: Copilot <[email protected]>

format profiling doc

4334e98

Signed-off-by: lishunyang <[email protected]>

hsliuustc0106 requested a review from Copilot January 28, 2026 01:18

Copilot started reviewing on behalf of hsliuustc0106 January 28, 2026 01:19 View session

Copilot AI reviewed Jan 28, 2026

View reviewed changes

docs/contributing/profiling.md Show resolved Hide resolved

docs/contributing/profiling.md Outdated Show resolved Hide resolved

docs/contributing/profiling.md Outdated Show resolved Hide resolved

hsliuustc0106 and others added 4 commits January 28, 2026 09:27

Update docs/contributing/profiling.md

80b9e02

Co-authored-by: Copilot <[email protected]> Signed-off-by: Hongsheng Liu <[email protected]>

Update docs/contributing/profiling.md

db76c8b

Co-authored-by: Copilot <[email protected]> Signed-off-by: Hongsheng Liu <[email protected]>

follow copliot's suggestions

ee19e69

Signed-off-by: lishunyang <[email protected]>

Merge branch 'main' into doc_patch

8883045

david6666666 reviewed Jan 28, 2026

View reviewed changes

lishunyang12 added 2 commits January 28, 2026 14:49

Merge branch 'main' into doc_patch

db8fbd8

Merge branch 'main' into doc_patch

cd4732a

david6666666 added the ready label to trigger buildkite CI label Jan 28, 2026

david6666666 approved these changes Jan 28, 2026

View reviewed changes

david6666666 enabled auto-merge (squash) January 28, 2026 12:13

hsliuustc0106 disabled auto-merge January 28, 2026 12:17

hsliuustc0106 merged commit b11d436 into vllm-project:main Jan 28, 2026
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[Doc]Format profiling doc#993

[Doc]Format profiling doc#993
hsliuustc0106 merged 7 commits intovllm-project:mainfrom
lishunyang12:doc_patch

lishunyang12 commented Jan 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

david6666666 Jan 28, 2026

Uh oh!

lishunyang12 Jan 28, 2026

Uh oh!

ZJY0516 Jan 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		As of now, asynchronous (online) profiling is not fully supported in vLLM-Omni. While start_profile() and stop_profile() methods exist, they are only reliable in offline inference scripts (e.g., the provided end2end.py examples). Do not use them in server-mode or streaming scenarios—traces may be incomplete or fail to flush.

		Online Inference(Async)

Comments

Conversation

lishunyang12 commented Jan 28, 2026

Purpose

Test Plan

Test Result

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

david6666666 Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

lishunyang12 Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

ZJY0516 Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants