-
Notifications
You must be signed in to change notification settings - Fork 5.9k
Split op oneDNN AVX2 fix #33944
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Split op oneDNN AVX2 fix #33944
Conversation
|
Thanks for your contribution! |
arlesniak
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
lidanqing-vv
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
#33944 will solve this error on AVX2. But testing this model on AVX2 environment may not speedup as good as on AVX512 (6148 itself suppots AVX512). Because oneDNN improve very well on AVX512. But in AVX2, some oneDNN op will call reference version (slower). So to achieve expected performance, please use avx512. But this fix will solve error on AVX2. @juncaipeng Please approve and merge this PR |
juncaipeng
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR types
Bug fixes
PR changes
OPs
Describe
Fixed bug that was occuring when AVX2 instruction set was used. In that scenario blocking format was chosen and split op doesn't support that. It is a fix for #33918