Skip to content

Conversation

@linjieccc
Copy link
Contributor

@linjieccc linjieccc commented Nov 8, 2021

PR types

Bug fixes

PR changes

APIs

Describe

半精度fp16 的两个一维向量做matmul时,使用Eigen进行计算EigenMetaKernel周期过长,替换为blas进行实现。

Original(Profiled by Nsight System):
image

image

Optimized(Profiled by Nsight System):
image

image

@paddle-bot-old
Copy link

paddle-bot-old bot commented Nov 8, 2021

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Copy link
Contributor

@zyfncg zyfncg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@linjieccc linjieccc changed the title Optimize Matmul_v2 Optimize dot product of Matmul_v2 Nov 9, 2021
@wawltor wawltor self-requested a review November 15, 2021 02:39
Copy link
Contributor

@wawltor wawltor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@wawltor wawltor merged commit 444a735 into PaddlePaddle:develop Nov 15, 2021
@linjieccc linjieccc deleted the fix_matmul branch November 15, 2021 02:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants