[Feature]: LoRA adapter support for vLLM alignment

### 🚀 The feature, motivation and pitch

Follow-up to our previous communication under the [roadmap](https://github.com/vllm-project/vllm-omni/issues/165), I’m opening this issue to propose implementing LoRA support for the vLLM alignment workflow. The multimodal RL projects, e.g., [mm_grpo](https://github.com/leibniz-csi/mm_grpo) are also [looking to adopt vllm-omni as a rollout engine](https://github.com/leibniz-csi/mm_grpo/issues/28). Since MM RL typically fine-tunes only the LoRA adapters (i.e., [FlowGRPO](https://github.com/yifan123/flow_grpo), [DiffusionNFT](https://github.com/NVlabs/DiffusionNFT), this integration would be directly beneficial in the sense that it enables RL training workflows where weights update dynamically.

Having LoRA in vllm-omni also aligns with base vllm design, and it brings additional benefits
- Dynamic adaptation: load/unload adapters without restart
- Memory efficiency: smaller memory footprint than full model copies

Happy to take on the work if no one else is currently assigned.

### Alternatives

_No response_

### Additional context

_No response_

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://vllm-omni.readthedocs.io), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: LoRA adapter support for vLLM alignment #281

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature]: LoRA adapter support for vLLM alignment #281

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions