Skip to content

Update fp8_gemm_blockwise_kernel.cu check cuda version#73581

Merged
lshpku merged 4 commits intoPaddlePaddle:developfrom
co63oc:f47
Jun 27, 2025
Merged

Update fp8_gemm_blockwise_kernel.cu check cuda version#73581
lshpku merged 4 commits intoPaddlePaddle:developfrom
co63oc:f47

Conversation

@co63oc
Copy link
Contributor

@co63oc co63oc commented Jun 24, 2025

PR Category

User Experience

PR Types

Improvements

Description

cuda 12.8 cublas 12.8.4 没有CUBLASLT_MATMUL_MATRIX_SCALE_VEC128_32F定义,编译会有未定义错误
在 cuda 12.9 cublas 12.9.1中有,修改版本检查
image
image

@paddle-bot
Copy link

paddle-bot bot commented Jun 24, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@paddle-bot paddle-bot bot added the contributor External developers label Jun 24, 2025
@luotao1 luotao1 added the HappyOpenSource 快乐开源活动issue与PR label Jun 24, 2025
@co63oc
Copy link
Contributor Author

co63oc commented Jun 25, 2025

/re-run all-failed

@co63oc
Copy link
Contributor Author

co63oc commented Jun 25, 2025

@risemeup1 CI已完成需要review

&operationDesc, CUBLAS_COMPUTE_32F, CUDA_R_32F));

#if CUBLAS_VERSION >= 120804 && CUDA_VERSION >= 12060
#if CUBLAS_VERSION >= 120901 && CUDA_VERSION >= 12090
Copy link
Contributor

@lshpku lshpku Jun 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

感谢你发现这个问题!该特性是12.8.5引入的,原代码确实有疏忽,但是限制成12.9也不合适,如果你想继续合入这个PR,可以找找cublas官方文档依据,然后修改这里的版本

Copy link
Contributor Author

@co63oc co63oc Jun 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

使用apt-get 默认安装的12.8.4, 没有更新的版本,那安装12.8.5是怎么安装
image

PR这里先按12.8.5修改

Copy link
Contributor

@lshpku lshpku left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

感谢你的贡献

@lshpku lshpku merged commit 2e52b04 into PaddlePaddle:develop Jun 27, 2025
45 of 47 checks passed
@co63oc co63oc deleted the f47 branch July 10, 2025 05:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

contributor External developers HappyOpenSource 快乐开源活动issue与PR

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants