Skip to content

Reduce warpspeed scan to 256 threads/block on NVHPC#7892

Open
bernhardmgruber wants to merge 1 commit intoNVIDIA:mainfrom
bernhardmgruber:warpspeed_nvhpc
Open

Reduce warpspeed scan to 256 threads/block on NVHPC#7892
bernhardmgruber wants to merge 1 commit intoNVIDIA:mainfrom
bernhardmgruber:warpspeed_nvhpc

Conversation

@bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Mar 5, 2026

@miscco
Copy link
Contributor

miscco commented Mar 5, 2026

There is currently no NVHPC running in CI, please use the override matrix to make a test run against CUB

@bernhardmgruber
Copy link
Contributor Author

There is currently no NVHPC running in CI, please use the override matrix to make a test run against CUB

Or I'll wait until #7684 is merged.

@trxcllnt
Copy link
Member

trxcllnt commented Mar 5, 2026

The previously-failing NVHPC CUB tests are passing after merging this change into #7684: https://github.com/NVIDIA/cccl/actions/runs/22730042381?pr=7684

@github-actions
Copy link
Contributor

github-actions bot commented Mar 6, 2026

🥳 CI Workflow Results

🟩 Finished in 1d 05h: Pass: 100%/249 | Total: 6d 03h | Max: 3h 40m | Hits: 88%/155141

See results here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: In Review

Development

Successfully merging this pull request may close these issues.

[BUG]: ptxas error: Entry function with max regcount of 168 calls function with regcount of 254 on sm_120 (Blackwell)

3 participants