Skip to content

[RFC]: Remove VL Modeling Files #4084

@shen-shanshan

Description

@shen-shanshan

Motivation.

To avoid maintaining a variety of modeling files in vllm-ascend, we propose to remove all files in models dir in vllm-ascend. After this, the only thing a vllm plugin need to do is just registering their custom device-specific OOT ops to vllm when adding a new model.

To achieve this, there are some refactors need to be done both in vllm and vllm-ascend, such as extracting some general layers as CustomOp

Proposed Change.

vllm:

vllm-ascend:

Other related:

Feedback Period.

No response

CC List.

@Yikun @wangxiyuan @gcanlin

Any Other Things.

No response

Metadata

Metadata

Assignees

Labels

RFCRequest For Comments

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions