[Hardware] [Feat] Setup platform dependent package installation by tjtanaa · Pull Request #1046 · vllm-project/vllm-omni

tjtanaa · 2026-01-29T03:42:57Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

This PR implements the RFC #997 .
It fixes #909 as well.

The dockerfiles and documentation has been updated to reflect this.

Current behaviour

Default behaviour (CUDA)

pip install and uv pip install default will install CUDA configuration.

Other platforms

To install dependencies for other platform (CPU/ROCm/NPU/XPU), has to be installed using the following approach.

VLLM_OMNI_TARGET_DEVICE=<cpu,rocm,npu,xpu> pip install -e .

VLLM_OMNI_TARGET_DEVICE=<cpu,rocm,npu,xpu> uv pip install -e .

# using platform auto-detection
pip install -e . --no-build-isolation

uv pip install -e . --no-build-isolation

Test Plan

Validated locally on ROCm platform.

Test Result

Validated locally on ROCm platform that it works and we need --no-build-isolation flag.

CC @gcanlin @PopSoda2002 @david6666666

Simple bugfix

Fix the license deprecation warning

  /usr/local/lib/python3.12/dist-packages/setuptools/config/_apply_pyprojecttoml.py:82: SetuptoolsDeprecationWarning: `project.li
cense` as a TOML table is deprecated
  !!
          ********************************************************************************
          Please use a simple string containing a SPDX expression for `project.license`. You can also use `project.license-files`
. (Both options available on setuptools>=77.0.0).
          By 2026-Feb-18, you need to update your project and remove deprecated calls
          or your builds will no longer be supported.
          See https://packaging.python.org/en/latest/guides/writing-pyproject-toml/#license for details.
          ********************************************************************************
  !!
    corresp(dist, value, root_dir)
  /usr/local/lib/python3.12/dist-packages/setuptools/config/_apply_pyprojecttoml.py:61: SetuptoolsDeprecationWarning: License cla
ssifiers are deprecated.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Co-authored-by: gcanlin <[email protected]> Co-authored-by: PopSoda2002 <[email protected]> Signed-off-by: tjtanaa <[email protected]>

Signed-off-by: tjtanaa <[email protected]>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7eb9c59675

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-01-29T03:46:08Z

setup.py

+    device = detect_target_device()
+    requirements_dir = Path(__file__).parent / "requirements"
+    requirements_file = requirements_dir / f"{device}.txt"
+
+    print(f"Loading requirements from: {requirements_file}")


Ensure requirements/*.txt are packaged in sdist/wheel

This setup logic reads requirements/{device}.txt at build time, but the new requirements/ directory isn’t included in package data (only vllm_omni.model_executor.stage_configs is listed) and there’s no MANIFEST.in, so source distributions will omit those files. In that case load_requirements returns an empty list and the built wheel installs with no runtime dependencies, which breaks pip install from sdist (e.g., when no wheel is available for a platform/Python). Consider adding the requirements files to the sdist/wheel inputs (MANIFEST.in or tool.setuptools package-data) or moving the lists back into pyproject.toml so they’re always available.

Useful? React with 👍 / 👎.

Signed-off-by: tjtanaa <[email protected]>

requirements/npu.txt

requirements/xpu.txt

Co-authored-by: gcanlin <[email protected]> Signed-off-by: tjtanaa <[email protected]>

requirements/npu.txt

Signed-off-by: tjtanaa <[email protected]>

tjtanaa · 2026-01-29T04:29:15Z

To reviewer: please help me to ensure all three authors are there. Thanks

Signed-off-by: tjtanaa <[email protected]>
Co-authored-by: gcanlin <[email protected]>
Co-authored-by: PopSoda2002 <[email protected]>

Signed-off-by: tjtanaa <[email protected]>

tjtanaa · 2026-01-29T07:05:09Z

@david6666666 I have updated the PR, so that the fa-fwd is pinned.
@gcanlin and @faaany find there is no issue for npu and xpu.

gcanlin · 2026-01-30T02:05:49Z

CI failed. It looks like fa3-fwd didn't install successfully.

Exception: No module named 'fa3_fwd_interface'

Do we need --no-build-isolation on CUDA?

yma11 · 2026-01-30T05:30:22Z

docs/getting_started/installation/gpu/rocm.inc.md


+
+#### Installation of vLLM
+Note: Pre-built wheels are currently only available for vLLM-Omni 0.11.0rc1, 0.12.0rc1, 0.14.0rc1. For the latest version, please [build from source](https://docs.vllm.ai/projects/vllm-omni/en/latest/getting_started/installation/gpu/#build-wheel-from-source).


This note seems should be put above Installation of vLLM.

Done. I was following the instruction from CUDA. Since we can move this before the installation of vLLM, I have put it in the gpu.md.

Signed-off-by: tjtanaa <[email protected]>

tjtanaa · 2026-01-30T10:54:27Z

CI failed. It looks like fa3-fwd didn't install successfully.
Exception: No module named 'fa3_fwd_interface'
Do we need --no-build-isolation on CUDA?

I have added that to the ci dockerfile, let's see.

gcanlin · 2026-01-30T12:59:08Z

CI failed. It looks like fa3-fwd didn't install successfully.
Exception: No module named 'fa3_fwd_interface'
Do we need --no-build-isolation on CUDA?
I have added that to the ci dockerfile, let's see.

My concern is that it will change the user behavior when installing. Is it acceptable?

My original point is we avoid to introduce the extra flag like [cuda], [npu]. But now, even if we unify installation way in different hardware, we have to add the longer flag. If we couldn't avoid it totally, maybe [cuda] style installation way should be considered.

tjtanaa · 2026-01-30T13:54:27Z

CI failed. It looks like fa3-fwd didn't install successfully.
Exception: No module named 'fa3_fwd_interface'
Do we need --no-build-isolation on CUDA?
I have added that to the ci dockerfile, let's see.
My concern is that it will change the user behavior when installing. Is it acceptable?

My original point is we avoid to introduce the extra flag like [cuda], [npu]. But now, even if we unify installation way in different hardware, we have to add the longer flag. If we couldn't avoid it totally, maybe [cuda] style installation way should be considered.

@gcanlin I think let's go for vllm-omni[cuda], vllm-omni[rocm]

I am currently also working on helping to setup release pipeline, both docker image and wheel.

I noticed that if we use setup.py approach, then when we are distributing on pypi, we cannot distribute wheel, we can only distribute Source tar.gz . The reason is that the dependency METADATA is determined at BUILD time, not INSTALL time.

If we go for vllm-omni[cuda], vllm-omni[rocm] , then this will work at INSTALL time.

Then for test-time dependencies, we can use the tag cuda-tests, rocm-tests, npu-tests etc.

hsliuustc0106 · 2026-01-31T02:35:41Z

it's not optimal to require cuda users to use "pip install vllm-omni[cuda]"

david6666666 · 2026-01-31T03:18:01Z

This PR will not be included in v0.14.0 for now; this will be explained in the documentation first, and a better method will be used later.

Signed-off-by: tjtanaa <[email protected]>

Isotr0py · 2026-01-31T10:44:51Z

pyproject.toml

+    "setuptools>=77.0.3,<81.0.0",
+    "wheel",
+    "setuptools-scm>=8.0",
+    "torch == 2.9.1",


Do we still need to include torch in building dependency? I think vLLM-Omni doesn't need to build pytorch kernels?

We need to include torch if we want to make pip install . to install cuda dependencies as we are using if torch.version.cuda is not None: in the setup.py. Without torch in the pyproject.toml, if torch.version.cuda is not None: will be None and it will not install cuda dependencies.
I am thinking of proposing not to support CPU for now and always assume the fallback to be CUDA platform. Then we don't need to include torch into the pyproject.toml.

…on platform, add onnxruntime uninstallation in rocm path Signed-off-by: tjtanaa <[email protected]>

Signed-off-by: tjtanaa <[email protected]>

tjtanaa · 2026-02-01T15:40:29Z

I have updated the PR description to reflect the current state of the code. Please take alook.

Isotr0py

LGTM now

PopSoda2002

Good work!

Signed-off-by: tjtanaa <[email protected]>

…-project#1046) Signed-off-by: tjtanaa <[email protected]> Co-authored-by: PopSoda2002 <[email protected]> Co-authored-by: gcanlin <[email protected]> Signed-off-by: gerayking <[email protected]>

…-project#1046) Signed-off-by: tjtanaa <[email protected]> Co-authored-by: PopSoda2002 <[email protected]> Co-authored-by: gcanlin <[email protected]>

tjtanaa and others added 4 commits January 29, 2026 03:11

Support platform dependent installation steps

b410779

Co-authored-by: gcanlin <[email protected]> Co-authored-by: PopSoda2002 <[email protected]> Signed-off-by: tjtanaa <[email protected]>

remov ambiguous dependencies

e2c0dc4

Signed-off-by: tjtanaa <[email protected]>

add installation procedure

26111ac

Signed-off-by: tjtanaa <[email protected]>

ensure we are installing using no build isolation on other platform

7eb9c59

Signed-off-by: tjtanaa <[email protected]>

tjtanaa requested a review from hsliuustc0106 as a code owner January 29, 2026 03:42

chatgpt-codex-connector bot reviewed Jan 29, 2026

View reviewed changes

fix precommit

889cc15

Signed-off-by: tjtanaa <[email protected]>

This was referenced Jan 29, 2026

[BugFix] Fix dependencies error for qwen3TTS #909

Closed

[Hardware] Support platform-aware dependency routing #1040

Closed

gcanlin reviewed Jan 29, 2026

View reviewed changes

requirements/npu.txt Outdated Show resolved Hide resolved

gcanlin reviewed Jan 29, 2026

View reviewed changes

requirements/xpu.txt Show resolved Hide resolved

Added the cann tto lint ignore list

cf44438

Co-authored-by: gcanlin <[email protected]> Signed-off-by: tjtanaa <[email protected]>

gcanlin reviewed Jan 29, 2026

View reviewed changes

requirements/npu.txt Show resolved Hide resolved

add torch audio to npu.txt

0510da3

Signed-off-by: tjtanaa <[email protected]>

david6666666 mentioned this pull request Jan 29, 2026

[Misc] pin version of fa3-fwd #1051

Merged

5 tasks

pin fa-fwd version and sync main

07dfb53

Signed-off-by: tjtanaa <[email protected]>

ZJY0516 mentioned this pull request Jan 29, 2026

failed to build fa3 on arm ZJY0516/fa-fwd#3

Open

hsliuustc0106 added the Hardware Plugin support different hardware beyond cuda label Jan 29, 2026

ZJY0516 added the ready label to trigger buildkite CI label Jan 29, 2026

yma11 reviewed Jan 30, 2026

View reviewed changes

david6666666 added this to the v0.14.0 milestone Jan 30, 2026

tjtanaa added 3 commits January 30, 2026 10:49

add no build isolation

5f5b788

Signed-off-by: tjtanaa <[email protected]>

address feedback

86c58a9

Signed-off-by: tjtanaa <[email protected]>

Merge remote-tracking branch 'origin/main' into hardwareinstallationomni

d49ba73

hsliuustc0106 requested review from a team and ywang96 January 30, 2026 15:24

david6666666 removed this from the v0.14.0 milestone Jan 31, 2026

tjtanaa added 2 commits January 31, 2026 10:11

test uv no build isolation

19e6963

Signed-off-by: tjtanaa <[email protected]>

fix precommit

4b37856

Signed-off-by: tjtanaa <[email protected]>

Isotr0py reviewed Jan 31, 2026

View reviewed changes

tjtanaa added 5 commits February 1, 2026 14:31

remove torch from pyproject.toml, make cuda be the default installati…

00beb07

…on platform, add onnxruntime uninstallation in rocm path Signed-off-by: tjtanaa <[email protected]>

sync with main

18a62ca

Signed-off-by: tjtanaa <[email protected]>

bugfix the pyproject.toml

2d19234

Signed-off-by: tjtanaa <[email protected]>

update the docker file and documentation

b15f168

Signed-off-by: tjtanaa <[email protected]>

change vllm rocm to 0.14.0 as well

47019c1

Signed-off-by: tjtanaa <[email protected]>

tjtanaa mentioned this pull request Feb 1, 2026

[RFC]: Wheel releases for multi-hardware platform #1137

Open

1 task

Isotr0py approved these changes Feb 1, 2026

View reviewed changes

PopSoda2002 approved these changes Feb 2, 2026

View reviewed changes

tjtanaa added 6 commits February 2, 2026 23:14

Merge branch 'main' into hardwareinstallationomni

08f5c18

sync main

8283852

Signed-off-by: tjtanaa <[email protected]>

delete old file

d0cd1f2

Signed-off-by: tjtanaa <[email protected]>

update docker npu command

a3a01ef

Signed-off-by: tjtanaa <[email protected]>

update the npu documentation

4686335

Signed-off-by: tjtanaa <[email protected]>

fix rocm installation step

e688577

Signed-off-by: tjtanaa <[email protected]>

xinyu-intel approved these changes Feb 5, 2026

View reviewed changes

gcanlin approved these changes Feb 5, 2026

View reviewed changes

Isotr0py merged commit 78a5aae into vllm-project:main Feb 5, 2026
7 checks passed

gcanlin mentioned this pull request Feb 10, 2026

[RFC]: Platform‑Aware Dependency Routing for vLLM‑Omni #997

Closed

1 task

gcanlin mentioned this pull request Feb 14, 2026

[RFC]: vLLM-Omni NPU 2026 Q1 Roadmap #886

Open

22 tasks



		#### Installation of vLLM
		Note: Pre-built wheels are currently only available for vLLM-Omni 0.11.0rc1, 0.12.0rc1, 0.14.0rc1. For the latest version, please [build from source](https://docs.vllm.ai/projects/vllm-omni/en/latest/getting_started/installation/gpu/#build-wheel-from-source).

Conversation

tjtanaa commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Current behaviour

Default behaviour (CUDA)

Other platforms

Test Plan

Test Result

Simple bugfix

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tjtanaa commented Jan 29, 2026

Uh oh!

tjtanaa commented Jan 29, 2026

Uh oh!

gcanlin commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yma11 Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

tjtanaa Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

tjtanaa commented Jan 30, 2026

Uh oh!

gcanlin commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tjtanaa commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hsliuustc0106 commented Jan 31, 2026

Uh oh!

david6666666 commented Jan 31, 2026

Uh oh!

Isotr0py Jan 31, 2026

Choose a reason for hiding this comment

Uh oh!

tjtanaa Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tjtanaa commented Feb 1, 2026

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

PopSoda2002 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

tjtanaa commented Jan 29, 2026 •

edited

Loading

gcanlin commented Jan 30, 2026 •

edited

Loading

gcanlin commented Jan 30, 2026 •

edited

Loading

tjtanaa commented Jan 30, 2026 •

edited

Loading

tjtanaa Jan 31, 2026 •

edited

Loading