Skip to content

Remove customized address space tracking and address class emission in debug info#669

Merged
gmarkall merged 8 commits intoNVIDIA:mainfrom
jiel-nv:shared-pointer
Dec 19, 2025
Merged

Remove customized address space tracking and address class emission in debug info#669
gmarkall merged 8 commits intoNVIDIA:mainfrom
jiel-nv:shared-pointer

Conversation

@jiel-nv
Copy link
Contributor

@jiel-nv jiel-nv commented Dec 17, 2025

Fix issue #627

Remove the dwarfAddressSpace attribute from DIDerivedType for shared memory pointers;
DCE cleanup since customized address space tracking is no longer needed.

The Enum data structure of DwarfAddressClass and the corrsponding mapping function (map NVVM address space to DWARF address class) are intentionly retained for potential future usage.

@copy-pr-bot
Copy link

copy-pr-bot bot commented Dec 17, 2025

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@jiel-nv
Copy link
Contributor Author

jiel-nv commented Dec 17, 2025

/ok to test ab37265

@jiel-nv
Copy link
Contributor Author

jiel-nv commented Dec 17, 2025

/ok to test 31dde8a

@jiel-nv
Copy link
Contributor Author

jiel-nv commented Dec 17, 2025

/ok to test 7a76f2f

@jiel-nv jiel-nv changed the title [Draft] Disable customized address space tracking and address class emission in debug info Remove customized address space tracking and address class emission in debug info Dec 17, 2025
@jiel-nv jiel-nv added the 3 - Ready for Review Ready for review by team label Dec 17, 2025
@jiel-nv jiel-nv requested a review from gmarkall December 17, 2025 22:52
@jiel-nv
Copy link
Contributor Author

jiel-nv commented Dec 18, 2025

/ok to test 17ecf4b

@jiel-nv
Copy link
Contributor Author

jiel-nv commented Dec 19, 2025

/ok to test cd4edef

return False
return fnty.typing_key is cuda.shared.array

def _lower_call_normal(self, fnty, expr, signature):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this function can now be deleted as it entirely defers to Lower._lower_call_normal().

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for catching this!

@gmarkall
Copy link
Contributor

/ok to test

Copy link
Contributor

@gmarkall gmarkall left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good. I just pushed a minor tidy-up rather than iterating on the PR.

@gmarkall gmarkall added 4 - Waiting on CI Waiting for a CI run to finish successfully and removed 3 - Ready for Review Ready for review by team labels Dec 19, 2025
@gmarkall gmarkall enabled auto-merge (squash) December 19, 2025 09:52
@gmarkall gmarkall merged commit 368a87c into NVIDIA:main Dec 19, 2025
190 of 192 checks passed
gmarkall added a commit to gmarkall/numba-cuda that referenced this pull request Jan 12, 2026
- Add arch specific target support (NVIDIA#549)
- chore: disable `locked` flag to bypass prefix-dev/pixi#5256 (NVIDIA#714)
- ci: relock pixi (NVIDIA#712)
- ci: remove redundant conda build in ci (NVIDIA#711)
- chore(deps): bump numba-cuda version and relock pixi (NVIDIA#707)
- Dropping bits in the old CI & Propagating recent changes from cuda-python (NVIDIA#683)
- Fix `test_wheel_deps_wheels.sh` to actually uninstall `nvvm` and `nvrtc` packages for CUDA 13 (NVIDIA#701)
- perf: remove some exception control flow and buffer-exception penalization for arrays (NVIDIA#700)
- perf: let CAI fall through instead of calling from_cuda_array_interface (NVIDIA#694)
- chore: perf lint (NVIDIA#697)
- chore(deps): bump deps in pixi lockfile (NVIDIA#693)
- fix: use freethreading-supported `_PySet_NextItemRef` where possible (NVIDIA#682)
- Support python `3.14` (NVIDIA#599)
- Remove customized address space tracking and address class emission in debug info (NVIDIA#669)
- Drop `experimental` from cuda.core namespace imports (NVIDIA#676)
- Remove dangling references to NUMBA_CUDA_ENABLE_MINOR_VERSION_COMPATIBILITY (NVIDIA#675)
- Use `rapidsai/sccache` in CI (NVIDIA#674)
- chore(dev-deps): remove ipython and pyinstrument (NVIDIA#670)
- Set up a new VM-based CI infrastructure  (NVIDIA#604)
@gmarkall gmarkall mentioned this pull request Jan 12, 2026
gmarkall added a commit that referenced this pull request Jan 12, 2026
- Add arch specific target support (#549)
- chore: disable `locked` flag to bypass
prefix-dev/pixi#5256 (#714)
- ci: relock pixi (#712)
- ci: remove redundant conda build in ci (#711)
- chore(deps): bump numba-cuda version and relock pixi (#707)
- Dropping bits in the old CI & Propagating recent changes from
cuda-python (#683)
- Fix `test_wheel_deps_wheels.sh` to actually uninstall `nvvm` and
`nvrtc` packages for CUDA 13 (#701)
- perf: remove some exception control flow and buffer-exception
penalization for arrays (#700)
- perf: let CAI fall through instead of calling
from_cuda_array_interface (#694)
- chore: perf lint (#697)
- chore(deps): bump deps in pixi lockfile (#693)
- fix: use freethreading-supported `_PySet_NextItemRef` where possible
(#682)
- Support python `3.14` (#599)
- Remove customized address space tracking and address class emission in
debug info (#669)
- Drop `experimental` from cuda.core namespace imports (#676)
- Remove dangling references to
NUMBA_CUDA_ENABLE_MINOR_VERSION_COMPATIBILITY (#675)
- Use `rapidsai/sccache` in CI (#674)
- chore(dev-deps): remove ipython and pyinstrument (#670)
- Set up a new VM-based CI infrastructure  (#604)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

4 - Waiting on CI Waiting for a CI run to finish successfully

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants