-
Notifications
You must be signed in to change notification settings - Fork 40
Add more teams per communication buffer #1271
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
|
@pgrete: This should now be at the point you can test and see what sort of a performance impact this has for you. |
|
Based on the tests, it looks like there is some issue on gpu. I will take a look next week. |
Yurlungur
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Assuming this improves performance as we learned in the hackathon, LGTM
|
I'll test this next week. I finally have the benchmark/performance pipeline setup, which will make this easier |
|
@pgrete: I didn't think carefully about |
|
Here's even more detailed timing info (based on the 4.5 year old WIP PR #388) 😄 Apart from the obvious changes, I'm also surprised about the impact of the reduction across many blocks. Given that I now have a test env, I'll do some more test to get a sense of what's most relevant (e.g., with regard to the choice on whether to set a fixed number or a minumum number as you say). |
Do you mean the reduction in the |
|
@pgrete: What is the status of your review here? |
Sorry, this was not clear. I was referring to the |
I'm in favor of some "min number of teams" variable (as it's more generally applicable) and we may even be able to tie it to some hardware info that we query from Kokkos. |
@pgrete: If you want to take a crack at it, that sounds good to me. |
|
@pgrete: What is the status of this PR? Are you still thinking about changing how you specify the number of teams per buffer? |
Yes, just didn't get to it yet. |


PR Summary
Does something similar to #1196.
PR Checklist