Skip to content

Conversation

@zhanglirong1999
Copy link
Contributor

PR types

Performance optimization

PR changes

Others

Description

We found create deconvolution primitive desc is time consuming in OneDNN. We want to add cache to reduce the number of creations.

@paddle-bot
Copy link

paddle-bot bot commented Jan 18, 2024

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@paddle-bot paddle-bot bot added the contributor External developers label Jan 18, 2024
@zhanglirong1999
Copy link
Contributor Author

加了deconv cache后,会对OneDNN v3.2下det_mv3_db_v2_0,ocrv4_det_fp32_perf这类模型有10-15%的提升。而对于OneDNN v3.4而言,我们发现v3.4 create deconv primitive desc的时间要长于v3.2,因此该cache对于之后要升级的3.4版本提升会更大一些

@zhanglirong1999
Copy link
Contributor Author

cc @vivienfanghuagood , @yuanlehome

@xinyu-intel xinyu-intel merged commit 5e87a34 into PaddlePaddle:develop Jan 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

contributor External developers Intel

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants