[Migration] Sendnn inference rename by joerunde · Pull Request #946 · torch-spyre/sendnn-inference

joerunde · 2026-04-21T21:58:43Z

Description

This PR updates the vllm_spyre package to sendnn_inference.

Docs and publication workflows have been updated in preparation to publish this as sendnn-inference==2.0.0

❗❗❗

Breaking configuration changes!

The plugin name for VLLM_PLUGINS is now sendnn_inference
All config options are now SENDNN_INFERENCE_*
The precompiled model parser now expects sendnn_inference_version

Related Issues

Closes torch-spyre/spyre-inference#8

Test Plan

CPU unit tests
Spyre unit tests
Spot checks with full model integration tests
Ensure Test PyPI deploy works

Checklist

I have read the contributing guidelines
My code follows the project's code style (run bash format.sh)
I have added tests for my changes (if applicable)
I have updated the documentation (if applicable)
My commits include a Signed-off-by: line (DCO compliance)

Signed-off-by: Joe Runde <joe@joerun.de>

github-actions · 2026-04-21T21:58:52Z

👋 Hi! Thank you for contributing.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, run ./format.sh.
Now you are good to go 🚀.

We also recommend installing prek and configuring it to check your code before every local commit.

Signed-off-by: Joe Runde <joe@joerun.de>

joerunde · 2026-04-21T22:26:06Z

If we want to keep pushing docker images (that nobody uses?) then we'll also need to update our quay configs. I've set things up so that they could run, but disabled the build for now.

PyPI and Test PyPI projects have been updates to publish the new sendnn-inference project once this PR merges

joerunde · 2026-04-21T22:27:14Z

bot:test
MARKERS="spyre and prefix_caching and not quantized and not multi"

Signed-off-by: Joe Runde <joe@joerun.de>

joerunde · 2026-04-21T23:03:39Z

bot:test
MARKERS="spyre and chunked_prefill"

Nhan-Hoang · 2026-04-21T23:48:44Z

@joerunde yes I think it is overlapping. We can close this as dup. I actually had this ready last week just for the documentation but fail to submit.

sducouedic · 2026-04-22T10:02:07Z

I think these two changes are still missing:

_local_envs_for_test.sh line 10 VLLM_SPYRE_DYNAMO_BACKEND
.yapfignore: line 1 vllm_spyre/model_executor/...

rafvasq

Assuming docker/Dockerfile.amd64 is being left alone for now, a few other references to "vllm spyre" include:

CONTRIBUTING.md
AFTU tests docstrings (graph_compare_utils.py, test_compare_graphs.py)
e2e/test_spyre_mm.py (here)

tjohnson31415 · 2026-04-22T16:41:22Z

Adding some of my searching as well, with commands (overlaps with above comments except for some more changes in .github):

# General search (excluding dot files/dirs)
# excluding docs since that intends to use old names in some places
$ grep -ri 'vllm.spyre' * | grep -v -e '.egg-info' -e '__pycache__' -e 'Binary file' -e 'docs/' -e 'docker/'
_local_envs_for_test.sh:export VLLM_SPYRE_DYNAMO_BACKEND=eager
CONTRIBUTING.md:# Contributing to vLLM Spyre
CONTRIBUTING.md:For details on contributing to vLLM-Spyre, see the **[contributing guide](https://docs.vllm.ai/projects/spyre/en/latest/contributing)**.
examples/online_inference/spyre_vllm_benchmark.py:    parser = argparse.ArgumentParser(description="VLLM Spyre inference benchmarking script.")
tests/aftu/test_compare_graphs.py:"""Compare graphs generated by vLLM-Spyre vs AFTU.
tests/aftu/test_compare_graphs.py:This test compares computation graphs generated by vLLM-Spyre against those
tests/aftu/graph_compare_utils.py:"""Utilities for comparing computation graphs between vLLM-Spyre and AFTU.
tests/e2e/test_spyre_mm.py:    # and vllm spyre running with the eager backend.

# Search dotfiles
$ grep -d skip -i 'vllm.spyre' .*
.yapfignore:vllm_spyre/model_executor/model_loader/spyre_setup.py

# Recurse into specific . directories (avoid .git, .venv, etc)
$ grep -ir 'vllm.spyre' .github/ .vscode/
.github/CODEOWNERS:# This lists cover the "core" components of vLLM-Spyre that require careful review
.github/ISSUE_TEMPLATE/config.yml:    about: Read the vLLM Spyre documentation
.github/ISSUE_TEMPLATE/config.yml:    about: Learn how to contribute to vLLM Spyre
.github/ISSUE_TEMPLATE/feature-request.yml:      Why is this feature important? How would it benefit vLLM Spyre users?

Signed-off-by: Joe Runde <joe@joerun.de>

joerunde · 2026-04-22T20:00:45Z

Travis' commands are now clean- y'all think this is good to go?

sducouedic

lgtm!

joerunde · 2026-04-22T22:27:46Z

good luck us!

joerunde · 2026-04-22T22:29:03Z

test pypi deployment was successful! 🎉

joerunde added 5 commits April 21, 2026 15:41

♻️ Bulk rename to sendnn_inference

dfebea4

Signed-off-by: Joe Runde <joe@joerun.de>

Merge branch 'main' into sendnn_inference_rename

e7366e5

♻️ stragglers

a2255e1

Signed-off-by: Joe Runde <joe@joerun.de>

🎨 fmt

c0094ce

Signed-off-by: Joe Runde <joe@joerun.de>

🐛 fixup test model list config

9241cb8

Signed-off-by: Joe Runde <joe@joerun.de>

joerunde requested review from nikolaospapandreou, prashantgupta24, rafvasq, sducouedic, tdoublep and yannicks1 as code owners April 21, 2026 21:58

joerunde added 3 commits April 21, 2026 16:07

📝 update migration guide

1c24ca5

Signed-off-by: Joe Runde <joe@joerun.de>

🔧 Update base image for docker

0bbe65d

Signed-off-by: Joe Runde <joe@joerun.de>

🔧 disable docker build anyway

c31ed1f

Signed-off-by: Joe Runde <joe@joerun.de>

📝 update SendNN -> SenDNN

954eee9

Signed-off-by: Joe Runde <joe@joerun.de>

joerunde mentioned this pull request Apr 21, 2026

Fix documentation URL #945

Closed

5 tasks

rafvasq reviewed Apr 22, 2026

View reviewed changes

🎨 cleanup remaining refs

9410611

Signed-off-by: Joe Runde <joe@joerun.de>

sducouedic approved these changes Apr 22, 2026

View reviewed changes

joerunde merged commit 105dead into torch-spyre:main Apr 22, 2026
12 checks passed

sducouedic mentioned this pull request Apr 30, 2026

[WIP] Add documentation for the scheduler and padding logic #956

Draft

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Migration] Sendnn inference rename#946

[Migration] Sendnn inference rename#946
joerunde merged 10 commits intotorch-spyre:mainfrom
joerunde:sendnn_inference_rename

joerunde commented Apr 21, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 21, 2026

Uh oh!

joerunde commented Apr 21, 2026

Uh oh!

joerunde commented Apr 21, 2026

Uh oh!

joerunde commented Apr 21, 2026

Uh oh!

Nhan-Hoang commented Apr 21, 2026

Uh oh!

sducouedic commented Apr 22, 2026 •

edited

Loading

Uh oh!

rafvasq left a comment

Uh oh!

tjohnson31415 commented Apr 22, 2026 •

edited

Loading

Uh oh!

joerunde commented Apr 22, 2026

Uh oh!

sducouedic left a comment

Uh oh!

Uh oh!

joerunde commented Apr 22, 2026

Uh oh!

joerunde commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

joerunde commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Test Plan

Checklist

Uh oh!

github-actions Bot commented Apr 21, 2026

Uh oh!

joerunde commented Apr 21, 2026

Uh oh!

joerunde commented Apr 21, 2026

Uh oh!

joerunde commented Apr 21, 2026

Uh oh!

Nhan-Hoang commented Apr 21, 2026

Uh oh!

sducouedic commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rafvasq left a comment

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

joerunde commented Apr 22, 2026

Uh oh!

sducouedic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

joerunde commented Apr 22, 2026

Uh oh!

joerunde commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

joerunde commented Apr 21, 2026 •

edited

Loading

sducouedic commented Apr 22, 2026 •

edited

Loading

tjohnson31415 commented Apr 22, 2026 •

edited

Loading