CUDA Compiler Refactor #875

pvelesko · 2024-06-18T10:53:59Z

Looked into rewriting cucc in C++ but I really don't see any benefit for it. We have a python dependency through unit testing anyways.

Fixes #877
Fixes #874

Incorporate vector operation overloading fix from Jenny
CUDA compiler drop cmake configuration
nvcc symbolink link
Implement __shfl_sync variants partially (mask off/off or print error)
cuda_bf16.h support
cuda_fp16.h support
math_constants.h
map cudaMallocAsync, cudaFreeAsync to serial versions

Future work:

cuda_runtime.h is not C compatible and some HeCBench tests fail to compile for that reason.
Implement _sync support for masks other than all 0 or 1

with preprocessor directives to prevent double definitions

This reverts commit 39d657e.

error out of not c++

pvelesko mentioned this pull request Jun 18, 2024

wrap vector operator overloading with preprocessor directives to prev… #873

Closed

pvelesko force-pushed the cucc-cpp branch from b31a987 to 2403b0b Compare June 21, 2024 08:46

pvelesko mentioned this pull request Jun 21, 2024

cucc is a fork bomb if a symlink called nvcc pointing to cucc exists #877

Closed

jjennychen and others added 19 commits June 25, 2024 12:04

wrap vector operator overloading

dd876eb

with preprocessor directives to prevent double definitions

add cucc.cc

6a159f6

Cmake switch to C++ cucc

8eebca0

Revert "Cmake switch to C++ cucc"

bc09ac8

This reverts commit 39d657e.

-include cuda_runtime

c513ae0

hipcc - disable nvcc find

fc1af4b

--gencode

edc008d

missing cuda_runtime.h defines

cf700c8

update HIPCC use_fast_math

e761bc2

update HIP for bf16

7098b98

add spirv_hip_bf16.h

4e2acf1

add math_constants.h

7118362

Implement shfl_sync (partial)

b1c8bbc

HIP submodule

48c00ce

spirv_hip_bf16.h

7bf1877

missing cudaMalloc templates

6c87806

cuda_runtime.h:

82de638

error out of not c++

fix --gencode arg

e834f9e

additional _sync devicelibs

e757814

pvelesko force-pushed the cucc-cpp branch from b0bf4f9 to e757814 Compare June 25, 2024 09:05

pvelesko added 7 commits June 25, 2024 12:11

remove cucc.cc

1273772

remove extra definition from cuda_runtime.h

e72e134

set target props for shlf_sync

c1f4538

cucc.in -> cucc.py

809982e

cuda throw warning about C compat instead of err

f14c7ca

remove cuspv

f032aeb

switch activemask test to cucc

285fdb3

pvelesko added 5 commits June 25, 2024 14:28

exclude shfl_sync

18d93c1

clean up some warnings

04b3969

remove dead cuspv

024a299

refactor cucc/nvcc symlinks

d64465e

cmake install typo

6ea7e18

pvelesko marked this pull request as ready for review June 25, 2024 13:30

pvelesko requested review from franz and pjaaskel June 25, 2024 13:47

Kerilk requested a review from jjennychen June 27, 2024 14:49

pvelesko merged commit 4edbcb6 into main Jun 29, 2024

pvelesko deleted the cucc-cpp branch June 29, 2024 08:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA Compiler Refactor #875

CUDA Compiler Refactor #875

Uh oh!

pvelesko commented Jun 18, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CUDA Compiler Refactor #875

CUDA Compiler Refactor #875

Uh oh!

Conversation

pvelesko commented Jun 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pvelesko commented Jun 18, 2024 •

edited

Loading