-
Notifications
You must be signed in to change notification settings - Fork 16.2k
Compile bug: CUDA build for mmq breaks for compute capability 120 #18363
Copy link
Copy link
Closed
Labels
Description
Git commit
$ git rev-parse HEAD
c54bba8
Operating systems
Linux
GGML backends
CUDA
Problem description & steps to reproduce
When CUDA compute 120 is enabled, the build breaks on mmq; see commands and logs in the fields below
First Bad Commit
Compile command
cmake -B build -DGGML_CUDA=ON -DCMAKE_CUDA_ARCHITECTURES="80;86;90;120" -DCMAKE_EXE_LINKER_FLAGS="-L/usr/local/cuda/lib64/stubs" -DCMAKE_SHARED_LINKER_FLAGS="-L/usr/local/cuda/lib64/stubs"
make --build build --config Release -j 6Relevant log output
27%] Building CUDA object ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q4_0.cu.o
ptxas /tmp/tmpxft_00007d6d_00000000-6_mmq-instance-mxfp4.compute_120.ptx, line 1025; error : Instruction 'mma with block scale' not supported on .target 'sm_120'
ptxas /tmp/tmpxft_00007d6d_00000000-6_mmq-instance-mxfp4.compute_120.ptx, line 1025; error : Feature '.kind::mxf4' not supported on .target 'sm_120'
ptxas /tmp/tmpxft_00007d6d_00000000-6_mmq-instance-mxfp4.compute_120.ptx, line 1025; error : Feature '.block_scale' not supported on .target 'sm_120'
ptxas /tmp/tmpxft_00007d6d_00000000-6_mmq-instance-mxfp4.compute_120.ptx, line 1025; error : Feature '.scale_vec::2X' not supported on .target 'sm_120'
ptxas /tmp/tmpxft_00007d6d_00000000-6_mmq-instance-mxfp4.compute_120.ptx, line 1039; error : Instruction 'mma with block scale' not supported on .target 'sm_120'
(many, many other similar lines)Reactions are currently unavailable