spyre-inference

A vLLM platform plugin for IBM Spyre AI accelerators.

Overview

spyre-inference is the next evolution of sendnn-inference, providing seamless integration of IBM's Spyre hardware accelerators with vLLM for high-performance large language model inference.

This plugin leverages torch-spyre to utilize PyTorch's native Inductor compiler backend, enabling optimized model execution on Spyre devices through vLLM's plugin architecture.

Requirements

Python >= 3.11
Access to IBM Spyre hardware with the Spyre Runtime stack
PyTorch 2.10.0 (CPU backend)

Installation

# Clone the repository
git clone https://github.com/torch-spyre/spyre-inference
cd spyre-inference

# Install with uv (recommended)
uv sync --frozen

Note: torch-spyre compilation requires access to IBM Spyre hardware with the Spyre Runtime stack. See internal development documentation for environment setup.

Usage

The plugin automatically registers with vLLM when installed. Use it by setting `VLLM_PLUGINS=spyre_inference"

from vllm import LLM

llm = LLM(
    model="ibm-ai-platform/micro-g3.3-8b-instruct-1b",
    max_model_len=128,
    max_num_seqs=2,
)

Testing

The test suite includes:

Local tests (-m spyre) - Spyre-specific functionality validation
Upstream tests (-m upstream) - vLLM compatibility verification

Upstream tests are automatically synced from the vLLM repository at the commit specified in pyproject.toml.

Contributing

See Contributing Guide for:

Issue reporting and feature requests
Development setup
Testing guidelines
Pull request process

Documentation

License

Apache 2.0

Related Projects

torch-spyre - PyTorch backend for Spyre accelerators
vLLM - High-throughput LLM inference engine
sendnn-inference - Previous generation Spyre vLLM plugin

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
docs		docs
examples		examples
spyre_inference		spyre_inference
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CODEOWNERS		CODEOWNERS
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spyre-inference

Overview

Requirements

Installation

Usage

Testing

Contributing

Documentation

License

Related Projects

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

spyre-inference

Overview

Requirements

Installation

Usage

Testing

Contributing

Documentation

License

Related Projects

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages