Skip to content

Compile bug: CUDA build for mmq breaks for compute capability 120 #18363

@mramendi

Description

@mramendi

Git commit

$ git rev-parse HEAD
c54bba8

Operating systems

Linux

GGML backends

CUDA

Problem description & steps to reproduce

When CUDA compute 120 is enabled, the build breaks on mmq; see commands and logs in the fields below

First Bad Commit

c8a2417

Compile command

cmake -B build -DGGML_CUDA=ON -DCMAKE_CUDA_ARCHITECTURES="80;86;90;120" -DCMAKE_EXE_LINKER_FLAGS="-L/usr/local/cuda/lib64/stubs" -DCMAKE_SHARED_LINKER_FLAGS="-L/usr/local/cuda/lib64/stubs"

make --build build --config Release -j 6

Relevant log output

27%] Building CUDA object ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q4_0.cu.o
ptxas /tmp/tmpxft_00007d6d_00000000-6_mmq-instance-mxfp4.compute_120.ptx, line 1025; error   : Instruction 'mma with block scale' not supported on .target 'sm_120'
ptxas /tmp/tmpxft_00007d6d_00000000-6_mmq-instance-mxfp4.compute_120.ptx, line 1025; error   : Feature '.kind::mxf4' not supported on .target 'sm_120'
ptxas /tmp/tmpxft_00007d6d_00000000-6_mmq-instance-mxfp4.compute_120.ptx, line 1025; error   : Feature '.block_scale' not supported on .target 'sm_120'
ptxas /tmp/tmpxft_00007d6d_00000000-6_mmq-instance-mxfp4.compute_120.ptx, line 1025; error   : Feature '.scale_vec::2X' not supported on .target 'sm_120'
ptxas /tmp/tmpxft_00007d6d_00000000-6_mmq-instance-mxfp4.compute_120.ptx, line 1039; error   : Instruction 'mma with block scale' not supported on .target 'sm_120'
(many, many other similar lines)

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions