Use NVFP4 Marlin for CompressedTensorsW4A16Fp4#18000
Merged
mgoin merged 2 commits intovllm-project:mainfrom May 13, 2025
Merged
Use NVFP4 Marlin for CompressedTensorsW4A16Fp4#18000mgoin merged 2 commits intovllm-project:mainfrom
mgoin merged 2 commits intovllm-project:mainfrom