Skip to content

Conversation

@hxzd5568
Copy link
Contributor

PR Category

Operator Mechanism

PR Types

Bug fixes

Description

Pcard-67164

  1. cublas 的SCAL, VCOPY 函数调用了封装的 CUBLASAPI cublasStatus_t CUBLASWINAPI cublasScalEx(...) 等, 而这些函数中表示下标的类型为int, 因此对大tensor采用了paddle自定义的scal函数
  2. 修改后和torch 一致,torch也是使用了自定义函数来处理vcopy , scale
    (torch: pytorch/tools/autograd/derivatives.yaml:258)

@paddle-bot
Copy link

paddle-bot bot commented Jul 29, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@hxzd5568
Copy link
Contributor Author

/re-run all-failed

Copy link
Contributor

@wanghuancoder wanghuancoder left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lshpku lshpku merged commit a6f0874 into PaddlePaddle:develop Jul 30, 2025
69 of 71 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants