[CI]: Bagel E2E Smoked Test by princepride · Pull Request #1074 · vllm-project/vllm-omni

princepride · 2026-01-29T09:28:32Z

Purpose

Add E2E Bagel unit test I have discussed in: #400

Add end-to-end unit tests for Bagel text-to-image generation to validate the model's image generation pipeline. This PR includes:

Shared Memory Connector Test: Tests Bagel text2img generation using the default shared memory connector for inter-stage communication
Mooncake Connector Test: Tests Bagel text2img generation using the Mooncake connector for distributed inter-stage communication
Buildkite CI Integration: Added a new CI step to run the Bagel text2img tests on H100 GPU
The tests validate that generated images match expected reference pixel values within a ±5 tolerance, ensuring consistent and reproducible image generation output.

Test Plan

pytest -s -v tests/e2e/offline_inference/test_bagel_text2img.py

Result

root@deepseek-v3-2-vllm-77f648c586-w95x2:/proj-tango-pvc/users/zhipeng.wang/workspace/vllm-omni# pytest tests/model_executor/models/bagel/test_bagel_text2img.py -v
========================================================== test session starts ==========================================================
platform linux -- Python 3.12.12, pytest-9.0.2, pluggy-1.6.0 -- /usr/bin/python3.12
cachedir: .pytest_cache
rootdir: /proj-tango-pvc/users/zhipeng.wang/workspace/vllm-omni
configfile: pyproject.toml
plugins: shard-0.1.2, timeout-2.4.0, cov-7.0.0, forked-1.6.0, asyncio-1.3.0, rerunfailures-16.1, anyio-4.12.0
asyncio: mode=Mode.STRICT, debug=False, asyncio_default_fixture_loop_scope=None, asyncio_default_test_loop_scope=function
collected 2 items                                                                                                                       
Running 2 items in this shard: tests/model_executor/models/bagel/test_bagel_text2img.py::test_bagel_text2img_shared_memory_connector, tests/model_executor/models/bagel/test_bagel_text2img.py::test_bagel_text2img_mooncake_connector

tests/model_executor/models/bagel/test_bagel_text2img.py::test_bagel_text2img_shared_memory_connector PASSED                      [ 50%]
tests/model_executor/models/bagel/test_bagel_text2img.py::test_bagel_text2img_mooncake_connector PASSED                           [100%]E0129 01:14:04.789839 2952173 master_client.cpp:208] RPC call failed: End of file

Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride · 2026-01-29T09:28:50Z

@codex review

princepride · 2026-01-29T09:29:56Z

@congw729 PTAL

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7915a89633

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

tests/model_executor/models/bagel/test_bagel_text2img.py

tests/e2e/offline_inference/test_bagel_text2img.py

pyproject.toml

Gaohan123

I think it is not a Unit Test, which is just for api check and protection. Your PR is more like an E2E Smoked Test (ST), which is more suitable to put in e2e folder.

princepride · 2026-01-29T13:32:22Z

I think it is not a Unit Test, which is just for api check and protection. Your PR is more like an E2E Smoked Test (ST), which is more suitable to put in e2e folder.

Thank you for your feedback, I will change

Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride · 2026-01-30T02:34:50Z

@Gaohan123 @congw729 PTAL, I hope this test can be merged as soon as possible to prevent new PR unintentionally modified the relevant code.

tests/e2e/offline_inference/test_bagel_text2img.py

hsliuustc0106 · 2026-01-30T03:25:20Z

@yenuo26 PTAL

Copilot

Pull request overview

This PR adds end-to-end tests for Bagel text-to-image generation to strengthen CI coverage for diffusion-based models. The tests validate image generation output using pixel-level comparisons and support both SharedMemory and Mooncake connectors for inter-stage communication.

Changes:

Added comprehensive E2E tests for Bagel text2img generation with two connector types (SharedMemory and Mooncake)
Updated GPU memory utilization in Bagel stage configurations to support single-GPU testing (redistributed from 0.4/0.4 to 0.35/0.55)
Added Buildkite CI step for automated testing on H100 GPUs
Added mooncake-transfer-engine dependency to development requirements

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 17 comments.

Show a summary per file

File	Description
tests/e2e/offline_inference/test_bagel_text2img.py	New test file with two test functions validating Bagel text2img generation using reference pixel comparisons
vllm_omni/model_executor/stage_configs/bagel.yaml	Adjusted GPU memory utilization to accommodate both stages on single GPU (0.35/0.55 split)
vllm_omni/model_executor/stage_configs/bagel_multiconnector.yaml	Updated device allocation to use single GPU and adjusted memory utilization; now both stages run on device 0
.buildkite/pipeline.yml	Added new CI step for Bagel text2img tests on H100 GPU with 30-minute timeout
pyproject.toml	Added mooncake-transfer-engine==0.3.8.post1 to dev dependencies

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/e2e/offline_inference/test_bagel_text2img.py

.buildkite/pipeline.yml

tests/e2e/offline_inference/test_bagel_text2img.py

vllm_omni/model_executor/stage_configs/bagel_multiconnector.yaml

tests/e2e/offline_inference/test_bagel_text2img.py

hsliuustc0106 · 2026-01-30T05:35:08Z

@princepride could you please add the benchmark result for diffusion models under benchmark folder, maybe for the dit-only model

Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride · 2026-01-30T06:05:39Z

DiT-only Model: BAGEL-7B-MoT (Text-to-Image)

Hardware: NVIDIA H200 141GB

Server command:

vllm serve ByteDance-Seed/BAGEL-7B-MoT --omni --port 8099

Benchmark command:

python3 benchmarks/diffusion/diffusion_benchmark_serving.py \
    --base-url http://localhost:8099 \
    --model ByteDance-Seed/BAGEL-7B-MoT \
    --task t2i \
    --dataset vbench \
    --num-prompts 5

Results:

================= Serving Benchmark Result =================
Model:                                   ByteDance-Seed/BAGEL-7B-MoT
Dataset:                                 vbench         
Task:                                    t2i            
--------------------------------------------------
Benchmark duration (s):                  53.55          
Request rate:                            inf            
Max request concurrency:                 1              
Successful requests:                     5/5              
--------------------------------------------------
Request throughput (req/s):              0.09           
Latency Mean (s):                        10.7092        
Latency Median (s):                      10.0302        
Latency P99 (s):                         12.6974        

============================================================

princepride · 2026-01-30T06:19:13Z

@hsliuustc0106 PTAL

tests/e2e/offline_inference/test_bagel_text2img.py

Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride · 2026-01-30T11:33:45Z

@hsliuustc0106

DiT-only Model: BAGEL-7B-MoT (Text-to-Image)

Hardware: NVIDIA H200 141GB

Server command:

vllm serve ByteDance-Seed/BAGEL-7B-MoT --omni --port 8099

Benchmark command:

python3 benchmarks/diffusion/diffusion_benchmark_serving.py \
    --base-url http://localhost:8099 \
    --model ByteDance-Seed/BAGEL-7B-MoT \
    --task t2i \
    --dataset vbench \
    --num-prompts 5

Results:

================= Serving Benchmark Result =================
Model:                                   ByteDance-Seed/BAGEL-7B-MoT
Dataset:                                 vbench         
Task:                                    t2i            
--------------------------------------------------
Benchmark duration (s):                  53.55          
Request rate:                            inf            
Max request concurrency:                 1              
Successful requests:                     5/5              
--------------------------------------------------
Request throughput (req/s):              0.09           
Latency Mean (s):                        10.7092        
Latency Median (s):                      10.0302        
Latency P99 (s):                         12.6974        

============================================================

Gaohan123 · 2026-01-31T02:22:47Z

Please resolve CI failures

hsliuustc0106 · 2026-01-31T02:26:36Z

Please resolve CI failures

it looks like HF downloading problem

Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride · 2026-01-31T08:35:14Z

@hsliuustc0106 Ready to merge. 😊

Signed-off-by: princepride <wangzhipeng628@gmail.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

princepride added 3 commits January 29, 2026 00:01

finished bagel e2e shared memory test

2cde185

Signed-off-by: princepride <wangzhipeng628@gmail.com>

add e2e mooncake test

bae291f

Signed-off-by: princepride <wangzhipeng628@gmail.com>

add e2e to buildkite

7915a89

Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride requested a review from hsliuustc0106 as a code owner January 29, 2026 09:28

chatgpt-codex-connector bot reviewed Jan 29, 2026

View reviewed changes

tests/model_executor/models/bagel/test_bagel_text2img.py Outdated Show resolved Hide resolved

tests/e2e/offline_inference/test_bagel_text2img.py Outdated Show resolved Hide resolved

david6666666 reviewed Jan 29, 2026

View reviewed changes

pyproject.toml Show resolved Hide resolved

princepride mentioned this pull request Jan 29, 2026

[RFC]: Bagel deployment #936

Open

14 tasks

Gaohan123 reviewed Jan 29, 2026

View reviewed changes

princepride added 2 commits January 29, 2026 18:30

selecting free ephemeral ports

7c76856

Signed-off-by: princepride <wangzhipeng628@gmail.com>

Merge branch 'main' into bagel-e2e-pytest

8c77ebd

princepride changed the title ~~[CI]: Bagel E2E Unit Test~~ [CI]: Bagel E2E Smoked Test Jan 30, 2026

david6666666 added the ready label to trigger buildkite CI label Jan 30, 2026

congw729 suggested changes Jan 30, 2026

View reviewed changes

tests/e2e/offline_inference/test_bagel_text2img.py Show resolved Hide resolved

tests/e2e/offline_inference/test_bagel_text2img.py Show resolved Hide resolved

tests/e2e/offline_inference/test_bagel_text2img.py Show resolved Hide resolved

hsliuustc0106 requested a review from Copilot January 30, 2026 03:25

Copilot started reviewing on behalf of hsliuustc0106 January 30, 2026 03:25 View session

hsliuustc0106 removed the ready label to trigger buildkite CI label Jan 30, 2026

Copilot AI reviewed Jan 30, 2026

View reviewed changes

modified code based on ai suggestion

0a440eb

Signed-off-by: princepride <wangzhipeng628@gmail.com>

Merge branch 'main' into bagel-e2e-pytest

04302f4

yenuo26 reviewed Jan 30, 2026

View reviewed changes

tests/e2e/offline_inference/test_bagel_text2img.py Outdated Show resolved Hide resolved

create bagel mooncake ci yaml

22a8584

Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride requested a review from Gaohan123 January 30, 2026 11:08

princepride requested a review from david6666666 January 30, 2026 11:08

Gaohan123 added the ready label to trigger buildkite CI label Jan 30, 2026

hsliuustc0106 and others added 2 commits January 31, 2026 04:46

Merge branch 'main' into bagel-e2e-pytest

bdffc52

Merge branch 'main' into bagel-e2e-pytest

fae091c

princepride added 5 commits January 30, 2026 19:40

upgrade gpu utilization

ec9b0e8

Signed-off-by: princepride <wangzhipeng628@gmail.com>

more gpu utilization

8cca58b

Signed-off-by: princepride <wangzhipeng628@gmail.com>

adjust mooncake test yaml

df0b71b

Signed-off-by: princepride <wangzhipeng628@gmail.com>

adjust mooncake test yaml

41706a2

Signed-off-by: princepride <wangzhipeng628@gmail.com>

Merge branch 'main' into bagel-e2e-pytest

d7dd055

hsliuustc0106 enabled auto-merge (squash) January 31, 2026 09:07

hsliuustc0106 approved these changes Jan 31, 2026

View reviewed changes

hsliuustc0106 merged commit de0a8b0 into vllm-project:main Jan 31, 2026
7 checks passed

dongbo910220 pushed a commit to dongbo910220/vllm-omni that referenced this pull request Feb 1, 2026

[CI]: Bagel E2E Smoked Test (vllm-project#1074)

423bdca

Signed-off-by: princepride <wangzhipeng628@gmail.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

Conversation

princepride commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Result

Uh oh!

princepride commented Jan 29, 2026

Uh oh!

princepride commented Jan 29, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Gaohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

princepride commented Jan 29, 2026

Uh oh!

princepride commented Jan 30, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hsliuustc0106 commented Jan 30, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hsliuustc0106 commented Jan 30, 2026

Uh oh!

princepride commented Jan 30, 2026

DiT-only Model: BAGEL-7B-MoT (Text-to-Image)

Uh oh!

princepride commented Jan 30, 2026

Uh oh!

Uh oh!

princepride commented Jan 30, 2026

DiT-only Model: BAGEL-7B-MoT (Text-to-Image)

Uh oh!

Gaohan123 commented Jan 31, 2026

Uh oh!

hsliuustc0106 commented Jan 31, 2026

Uh oh!

princepride commented Jan 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

princepride commented Jan 29, 2026 •

edited

Loading