Skip to content

Commit 5bd5ed6

Browse files
committed
update
1 parent 7109bd3 commit 5bd5ed6

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/models/llama.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -147,7 +147,7 @@ def get_quantized_layer(in_features, out_features, quant_config):
147147
in_features=in_features,
148148
out_features=out_features,
149149
bias=None,
150-
dev=0 ## TODO: fix this
150+
dev=0 ## TODO: fix this without large spike in memory
151151
)
152152
return layer
153153

0 commit comments

Comments
 (0)