The specific version of vllm to use isn't mentioned in your README.md file. The potential conflict is that, the pinned version of transformers==4.28.1, tells us that we should use vllm=0.1.2 which is really old but supports that version of transformers. But then in your README.md you use the vllm serve command which was only introduced in post 0.6.5, but that requires transformers>=4.45.2. Can you please resolve this issue?
The specific version of
vllmto use isn't mentioned in yourREADME.mdfile. The potential conflict is that, the pinned version oftransformers==4.28.1, tells us that we should usevllm=0.1.2which is really old but supports that version oftransformers. But then in yourREADME.mdyou use thevllm servecommand which was only introduced in post 0.6.5, but that requirestransformers>=4.45.2. Can you please resolve this issue?