[Usage]: if I want to run a 34B model，like yi-34B-chat,how can I use  multi GPU,I just have A100 40G

### Your current environment

```text
The output of `python collect_env.py`
```


### How would you like to use vllm

I want to run inference of a [specific model](put link here). I don't know how to integrate it with vllm.