-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-28593][CORE] Rename ShuffleClient to BlockStoreClient which more close to its usage #25327
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #108522 has finished for PR 25327 at commit
|
...network-shuffle/src/main/java/org/apache/spark/network/shuffle/ExternalBlockStoreClient.java
Show resolved
Hide resolved
| val blockTransferService: BlockTransferService, | ||
| securityManager: SecurityManager, | ||
| externalShuffleClient: Option[ExternalShuffleClient]) | ||
| externalShuffleClient: Option[ExternalBlockStoreClient]) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
rename the variable as well?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Initially I tried this, but there are plenty of names bind with Shuffle, so in ae64538 I just change this param name and the member var shuffleClient.
core/src/main/scala/org/apache/spark/storage/BlockManager.scala
Outdated
Show resolved
Hide resolved
|
the new name loos clearer, also cc @attilapiros @vanzin @tgravescs |
|
Test build #108544 has finished for PR 25327 at commit
|
|
Test build #108547 has finished for PR 25327 at commit
|
|
retest this please. |
|
Test build #108560 has finished for PR 25327 at commit
|
|
I'm ok with renaming, it is more clear. My only concern is changing public api's when maybe not truly needed, although this is 3.0 so the time to do it and I don't think this will impact very many users. |
|
AFAIK the |
attilapiros
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the new names are much bette even self explanatory.
But what about going one step further and renaming the ExternalShuffleService to ExternalBlockStoreService?
I know it will generate a lot of documentation changes and a temporary mapping in the mind of the operators running Spark but if we would like to ever fix this then this is the best time to do so.
cc @squito
|
Thanks for the review and comments. |
|
It takes time to educate the users that external shuffle service is now external block service. We need to update documents, configs, etc. and probably need a few blog posts, presentations, etc. I don't think it's a good idea to do them with one single PR. I'd like to merge this PR first, as a start of internal renaming. |
|
thanks, merging to master! |
|
ok, I see skipping its rename was result of a conscientious decision. That's fine. |
|
Yep, thanks for the advice and review :) |
What changes were proposed in this pull request?
After SPARK-27677, the shuffle client not only handles the shuffle block but also responsible for local persist RDD blocks. For better code scalability and precise semantics(as the discussion), here we did several changes:
Note, we still keep the name of BlockTransferService, because the
Servicecontains both client and server, also the name of BlockTransferService is not referencing shuffle client only.How was this patch tested?
Existing UT.