Skip to content

[ROCm] [Feature] [Doc] [Dockerfile] Support Per-Token-Activation Per-Channel-Weight FP8 Quantization Inferencing#12499

Closed
tjtanaa wants to merge 104 commits intovllm-project:mainfrom
EmbeddedLLM:ptpc-fp8-rocm
Closed

[ROCm] [Feature] [Doc] [Dockerfile] Support Per-Token-Activation Per-Channel-Weight FP8 Quantization Inferencing#12499
tjtanaa wants to merge 104 commits intovllm-project:mainfrom
EmbeddedLLM:ptpc-fp8-rocm

Commits

Commits on Jan 28, 2025