Skip to content

[DOC] CuTeDSL elementwise_add.py referring float16 as 32 bit value #2949

@ziereis

Description

@ziereis

The example here: https://github.com/NVIDIA/cutlass/blob/main/examples/python/CuTeDSL/notebooks/elementwise_add.ipynb

Is made with float16 values, however the documentation refers to "32-bit" wide values and also only uses 4 element vectorization instead of 8. The dtype should either be changed to float32 or the docs/code should be updated

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions