Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add SM75_16x8x8_F16F16F16F16_TN
#2851 opened Dec 6, 2025 by jinzhen-lin Loading…
use cp.async.bulk for per-row data; quiets synccheck
#2850 opened Dec 5, 2025 by v0i0 Loading…
Add spin_lock_atom_cas_acquire_wait function
#2846 opened Dec 5, 2025 by aleozlx Loading…
Remove deprecated newshape argument.
#2844 opened Dec 4, 2025 by Artem-B Loading…
Add alignment checks for mx dtypes under the Auto policy
#2827 opened Dec 1, 2025 by Hyaloid Loading…
[CuTeDSL] Feature/fp8e4m3 to fp16 conversion
#2822 opened Nov 28, 2025 by arseniivanov Loading…
Fix processing of relative imports in AST preprocessing
#2821 opened Nov 28, 2025 by danieldk Loading…
Remove x premission of CMakeLists.txt
#2811 opened Nov 25, 2025 by Rtoax Loading…
DOC: Fixing fundamental types codeblock
#2803 opened Nov 23, 2025 by SwayamInSync Loading…
DOC: Update CUTLASS quickstart, remove FP8 GEMM link
#2799 opened Nov 22, 2025 by SwayamInSync Loading…
add cute.union
#2788 opened Nov 21, 2025 by v0i0 Loading…
Fix: print subbyte<T> compilation error
#2783 opened Nov 19, 2025 by chrisHuxi Loading…
WIP: OSS CI Testing
#2776 opened Nov 15, 2025 by zekunf-nv Loading…
add dump_patch.py
#2767 opened Nov 13, 2025 by JayceSu98 Loading…
Remove prints from fmha fwd kernels
#2765 opened Nov 12, 2025 by milesvant Loading…
Fix example in CuTe tutorials
#2752 opened Nov 6, 2025 by StevenYangCC Loading…
Minor fix cute dsl example paths
#2741 opened Nov 1, 2025 by Edenzzzz Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.