[feat]: adapt batch request for flux by nuclearwu · Pull Request #1028 · vllm-project/vllm-omni

nuclearwu · 2026-01-28T11:59:06Z

Signed-off-by: wuzhongjian [email protected]

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

adapt batch request for flux #853 and #797

Test Plan

Vllm-Omni:

offline_inference

python examples/offline_inference/text_to_image/text_to_image.py \
  --model black-forest-labs/FLUX.1-dev \
  --prompt "Beautiful illustration of The ocean. in a serene landscape, magic realism, narrative realism, beautiful matte painting, heavenly lighting, retrowave, 4 k hd wallpaper" \
  --seed 42 \
  --cfg_scale 4.0 \
  --tensor_parallel_size 4 \
  --num_images_per_prompt 1 \
  --num_inference_steps 50 \
  --guidance_scale 4.0 \
  --height 1024\
  --width 1024\
  --output outputs/ocean.png

online_inference

export VLLM_WORKER_MULTIPROC_METHOD=spawn
MODEL_NAME_OR_PATH=/workspace/cache/ymttest/johnjan/models/black-forest-labs/FLUX___1-dev/

vllm serve ${MODEL_NAME_OR_PATH} \
   --omni \
   --port 8092 \
   --vae_use_slicing \
   --vae_use_tiling

curl -s http://localhost:8092/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "Beautiful illustration of The ocean. in a serene landscape, magic realism, narrative realism, beautiful matte painting, heavenly lighting, retrowave, 4 k hd wallpaper"}
    ],
    "extra_body": {
      "height": 1024,
      "width": 1024,
      "num_inference_steps": 50,
      "guidance_scale": 4.0,
      "seed": 42
    }
  }' | jq -r '.choices[0].message.content[0].image_url.url' | cut -d',' -f2- | base64 -d > ocean-serve.png

Benchmark

python3 benchmarks/diffusion/diffusion_benchmark_serving.py \
	--base-url http://localhost:8092 \
	--model /workspace/cache/ymttest/johnjan/models/black-forest-labs/FLUX___1-dev/ \
	--task t2i \
	--dataset vbench \
	--num-prompts 100

Test Result

1.offline_inference

2.online_inference

Benchmark

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: wuzhongjian <[email protected]>

nuclearwu · 2026-01-28T12:02:44Z

cc @david6666666 @ZJY0516 @hsliuustc0106

david6666666 · 2026-01-28T12:03:42Z

@fhfuih ptal thx

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f108c48697

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vllm_omni/diffusion/models/flux/pipeline_flux.py

hsliuustc0106 · 2026-01-28T12:25:55Z

add test plan&results please

fhfuih · 2026-01-29T01:21:09Z

Looks good to me. Thanks for keeping up with the recent code update

nuclearwu · 2026-01-29T01:27:12Z

add test plan&results please

@hsliuustc0106 done, please review, thanks

nuclearwu · 2026-01-29T08:22:29Z

cc @david6666666 @ZJY0516

hsliuustc0106 · 2026-01-29T08:25:05Z

could you provide benchmark results using https://github.com/vllm-project/vllm-omni/tree/main/benchmarks/diffusion

nuclearwu · 2026-01-29T09:19:07Z

could you provide benchmark results using https://github.com/vllm-project/vllm-omni/tree/main/benchmarks/diffusion

@hsliuustc0106 done, ptal thx

david6666666 · 2026-01-29T12:18:44Z

LGTM, I've already tested it locally, thanks for the fix.

Signed-off-by: wuzhongjian [email protected]

[feat]: adapt batch request for flux

f108c48

Signed-off-by: wuzhongjian <[email protected]>

nuclearwu requested a review from hsliuustc0106 as a code owner January 28, 2026 11:59

nuclearwu mentioned this pull request Jan 28, 2026

[Model]: add FLUX.1-dev model #853

Merged

5 tasks

chatgpt-codex-connector bot reviewed Jan 28, 2026

View reviewed changes

vllm_omni/diffusion/models/flux/pipeline_flux.py Show resolved Hide resolved

ZJY0516 added this to the v0.14.0 milestone Jan 28, 2026

fhfuih approved these changes Jan 29, 2026

View reviewed changes

hsliuustc0106 added the ready label to trigger buildkite CI label Jan 29, 2026

david6666666 approved these changes Jan 29, 2026

View reviewed changes

david6666666 merged commit ee0f7a7 into vllm-project:main Jan 29, 2026
7 checks passed

dongbo910220 pushed a commit to dongbo910220/vllm-omni that referenced this pull request Feb 1, 2026

[feat]: adapt batch request for flux (vllm-project#1028)

1eab759

Signed-off-by: wuzhongjian [email protected]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat]: adapt batch request for flux#1028

[feat]: adapt batch request for flux#1028
david6666666 merged 1 commit intovllm-project:mainfrom
nuclearwu:request

nuclearwu commented Jan 28, 2026 •

edited

Loading

Uh oh!

nuclearwu commented Jan 28, 2026

Uh oh!

david6666666 commented Jan 28, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

hsliuustc0106 commented Jan 28, 2026

Uh oh!

fhfuih commented Jan 29, 2026

Uh oh!

nuclearwu commented Jan 29, 2026 •

edited

Loading

Uh oh!

nuclearwu commented Jan 29, 2026

Uh oh!

hsliuustc0106 commented Jan 29, 2026

Uh oh!

nuclearwu commented Jan 29, 2026

Uh oh!

david6666666 commented Jan 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

nuclearwu commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

nuclearwu commented Jan 28, 2026

Uh oh!

david6666666 commented Jan 28, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

hsliuustc0106 commented Jan 28, 2026

Uh oh!

fhfuih commented Jan 29, 2026

Uh oh!

nuclearwu commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nuclearwu commented Jan 29, 2026

Uh oh!

hsliuustc0106 commented Jan 29, 2026

Uh oh!

nuclearwu commented Jan 29, 2026

Uh oh!

david6666666 commented Jan 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

nuclearwu commented Jan 28, 2026 •

edited

Loading

nuclearwu commented Jan 29, 2026 •

edited

Loading