[PHI] Add uint8/int16 CUDA atomic mul/min/max and upgraded take/put_along_axis (input types) #74693

Enigmatisms · 2025-08-18T06:56:25Z

PR Category

Operator Mechanism

PR Types

Improvements

Description

在 gpu_primitives 中增加了 uint8/int16 两种类型的 CUDA atomic functions，针对如下三种操作：

min
max
multiply

对应的单测通过 put_along_axis 相关的单测实现并进行。同时兼容升级了 put_along_axis 以及 take_along_axis，增加了 int16/uint8 的支持（除了上述三种op，add也增加了支持，只不过add的atomic uint8 int16操作原本就存在），删除了某些文件中 SFINAE 实现的 uint8/int16 绕过。除此之外，这些 atomic primitives 本身由于无法导出单测，本人在 Enigmatisms/atomic_playground 中导出了 Python 接口，与 host 端 std::accumulation 结果进行了大量对比测试。

Pcard-89620

paddle-bot · 2025-08-18T06:56:32Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Enigmatisms · 2025-08-19T00:47:49Z

/re-run all-failed

…long_axis (input types) (PaddlePaddle#74693) * [PHI] Aligned uint8 and int16 atomic funcs * [PHI] Removed some of the GPU only constraints. * [PHI] Fixed put_along_axis CPU end test error

[PHI] Aligned uint8 and int16 atomic funcs

62f3acd

Enigmatisms added 2 commits August 18, 2025 07:29

[PHI] Removed some of the GPU only constraints.

8bf3128

[PHI] Fixed put_along_axis CPU end test error

23f9ee8

zhangbo9674 approved these changes Aug 19, 2025

View reviewed changes

zhangbo9674 merged commit a412420 into PaddlePaddle:develop Aug 19, 2025
72 of 73 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[PHI] Add uint8/int16 CUDA atomic mul/min/max and upgraded take/put_along_axis (input types) #74693

[PHI] Add uint8/int16 CUDA atomic mul/min/max and upgraded take/put_along_axis (input types) #74693

Uh oh!

Enigmatisms commented Aug 18, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Aug 18, 2025

Uh oh!

Enigmatisms commented Aug 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[PHI] Add uint8/int16 CUDA atomic mul/min/max and upgraded take/put_along_axis (input types) #74693

[PHI] Add uint8/int16 CUDA atomic mul/min/max and upgraded take/put_along_axis (input types) #74693

Uh oh!

Conversation

Enigmatisms commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

Uh oh!

paddle-bot bot commented Aug 18, 2025

Uh oh!

Enigmatisms commented Aug 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Enigmatisms commented Aug 18, 2025 •

edited

Loading